Anchoring in the African AI Ecosystem
The WAXAL project has made significant strides in enriching the African AI landscape. At its heart, this initiative embodies a profound commitment to collaboration with local academic and community organizations across the continent. This endeavor is not just about technology; it’s fundamentally about empowering local communities and fostering an inclusive, representative AI ecosystem.
Collaborative Approach to Data Collection
The data collection phase of the WAXAL project was led entirely by African institutions, ensuring that the insights and nuances of local languages are authentically represented. By partnering with universities like Makerere University and the University of Ghana, WAXAL successfully tapped into local expertise and community knowledge. Makerere University, for instance, collected Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) data across nine different languages, employing a methodology that aligns with global best practices, yet tailored for local contexts.
Ghana’s University took it a step further by focusing on eight additional languages, employing an innovative ASR image-prompted methodology. This approach not only supports language preservation but significantly enhances the quality and reliability of AI applications tailored for diverse linguistic groups.
Key Collaborators and Diverse Efforts
WAXAL’s success is attributed to an extensive network of collaborators, including Digital Umuganda and Addis Ababa University, who played a pivotal role in leading the ASR collection for several regional dialects. This collaborative framework ensures that the entire data collection process is steeped in local knowledge and practices, enhancing the overall reliability of AI models developed in the future.
Quality is paramount; hence, organizations such as Media Trust, Loud n Clear, and the African Institute for Mathematical Sciences in Senegal focused on high-quality TTS recordings in various languages. This commitment to excellence showcases a dedication not only to quantity but also to ensuring that the data collected is of the highest caliber.
Ownership and Accessibility of Data
A critical component of the WAXAL project’s framework is the principle of data ownership. By allowing partner organizations to retain ownership of the collected data, WAXAL emphasizes a shared commitment to making datasets freely accessible for the broader community. This open-access philosophy is groundbreaking, as it promotes transparency, fosters further research, and cultivates an environment of shared learning among researchers and practitioners.
The collaboration has already led to significant advancements, such as the development of a community-driven cookbook for collecting data on impaired speech. This pioneering research yielded the first open-source dataset for Akan speakers facing challenges like cerebral palsy and stammering. The findings demonstrate that image-prompted elicitation techniques are notably more efficient than traditional text-based prompts for these populations, paving the way for more inclusive speech technologies in low-resource settings.
Pioneering New Research
The potential for groundbreaking research stemming from the WAXAL project is enormous. One of the major initiatives led to the creation of a comprehensive 5,000-hour speech corpus covering five Ghanaian languages: Akan, Ewe, Dagbani, Dagaare, and Ikposo. This ambitious work builds infrastructural capacity for creating robust ASR and TTS systems that cater to West Africa’s rich linguistic diversity. Utilizing a controlled crowdsourcing approach allows for natural audio collections that accurately reflect spontaneous speech.
Moreover, the project has facilitated evaluations of advanced AI models like Whisper, XLS-R, MMS, and W2v-BERT, tested across 13 African languages. This kind of benchmarking not only provides valuable insights into the efficiency of data usage but also underscores the interdependence of linguistic complexity and domain alignment regarding model performance.
A Comprehensive Literature Review
In addition to practical data collection and technological development, the WAXAL project conducted a systematic literature review, cataloging 74 datasets across 111 African languages. This review has established a mapping of the current state of speech technology on the continent and has underscored a critical need for more multi-domain conversational corpora. Furthermore, it advocates for adopting linguistically informed evaluation metrics, such as Character Error Rate (CER), to enhance performance assessments in linguistically rich and tonal contexts.
By flushing out the existing research landscape, WAXAL emphasizes the necessity of targeted efforts to bolster the development of linguistic resources right across Africa.
Conclusion
The WAXAL project serves as a vital framework for advancing the African AI ecosystem. Through its commitment to collaboration, data ownership, and open access, it not only cultivates local capabilities but also sets the stage for the next generation of technological advancements that are inclusive, diverse, and reflective of the continent’s rich tapestry of languages and cultures.
Inspired by: Source

