19 African Languages · 57 Models · Open Source

WAXALNet
African ASR Benchmark

57 open-source ASR models fine-tuned across 19 African languages. Compact edge models outperform zero-shot giants by 26.9pp WER — using models 3–40× smaller.

Read Paper ↗ 🤗 Browse Models View Results ↓

scroll

African Languages

Open-Source Models

26.9pp

Avg. WER Reduction

2,279h

Training Speech

Publication

The Paper

📄

The WAXAL ASR Benchmark: Fine-Tuned Edge Models Across 19 African Languages

2026 · arXiv:2606.02375

We benchmark three zero-shot foundation models against compact fine-tuned edge models across 19 African languages using the conversational WAXAL corpus. Fine-tuned edge models achieve a macro-averaged WER of 38.0% compared to 64.9% for the best zero-shot baseline — a 26.9pp reduction using models 3–40× smaller. An audit by native speakers across all 19 languages reveals systematic architectural failure patterns aligned with language family, script system, and morphological typology.

arXiv:2606.02375 ↗ CC-BY 4.0 Data Open-Source Models

Benchmark

Results

Word Error Rate (%) on the WAXAL test set. Lower is better.

Language	Family	MMS-300M	Whisper S	Whisper T
Acholi	Nilo-Saharan	42.3	42.3	57.7
Akan	Niger-Congo	34.2	31.7	37.9
Amharic	Afro-Asiatic	37.8	33.6	41.3
Dagaare	Niger-Congo	34.9	34	37.3
Dagbani	Niger-Congo	35	34	39.5
Ewe	Niger-Congo (Kwa)	31.3	32.3	35.5
Fula	Atlantic-Congo	40.6	42.6	35.5
Ikposo	Niger-Congo (Kwa)	75.3	77.5	80.9
Lingala	Niger-Congo (Bantu)	42.6	42.7	49
Luganda	Niger-Congo (Bantu)	16.9	21.6	33.8
Malagasy	Austronesian	12.8	13.1	17.7
Masaaba	Niger-Congo (Bantu)	49.5	75.5	59.6
Nyankole	Niger-Congo (Bantu)	38.6	44.7	46.7
Oromo	Afro-Asiatic	26.9	25.2	29.3
Shona	Niger-Congo (Bantu)	25	26.9	31.4
Sidama	Afro-Asiatic	35.6	30.1	34.4
Soga	Niger-Congo (Bantu)	47.2	57.1	69
Tigrinya	Afro-Asiatic	57.1	53.5	60.3
Wolaytta	Afro-Asiatic	38.8	39.5	42.6
Macro Average		38.0	39.9	44.2

Bold = best fine-tuned model per language

Open Source

WAXALNet Models

Three model families, 19 languages each. All available on HuggingFace.

🎙

300M · CTC

MMS-300M

Best character-level accuracy. Wins on all 6 Bantu languages. Immune to repetition loops.

19 languages · View on HuggingFace →

🎙

244M · Autoregressive

Whisper Small

Preferred for Afro-Asiatic languages. Strong language model prior aids complex morphology.

19 languages · View on HuggingFace →

🎙

39M · Autoregressive

Whisper Tiny

Ultra-lightweight edge deployment. Leads on Fula. Runs on mobile hardware.

19 languages · View on HuggingFace →

Quick Start

from transformers import pipeline

# Replace {language} with any of the 19 ISO codes
# e.g. lug, amh, sna, ewe, orm ...
asr = pipeline("automatic-speech-recognition",
               model="waxal-benchmarking/mms-300m-waxal-{language}")

result = asr("audio.wav")
print(result["text"])

People

The Team

31 researchers across 3 continents

Victor Tolulope Olufemi· Oreoluwa Babatunde· Ramsey Njema· Bolarinwa Gbotemi· Wanchi Lucia Yen· John Uzodinma· Sunday Ajayi· Oluwademilade Williams· Kausar Moshood· Innocent Elendu Anyaele· Akebert Tesfahunegn Arefaine· Candace Hunzwi· Wongel Dawit Daniel· Emmilly Immaculate Namuganga· Cleophas Kadima· Athanase Biluge Bahizire· Onitsiky Ranaivoson· Emmanuel Aaron· Nicholaus Dismas Ladislaus· Idris Muhammed· Jonathan Enoch Simenya· Martin Koome· Matewos Tegete Endaylalu· Peter Ifeoluwa Adeyemo· Hondi Prisca Birindwa· Ukachi Agnes Eze-Mbey· Yacoba Oduro-Yeboah· Toluwani Aremu· Pericles Adjovi· Mikel K Ngueajio· Prasenjit Mitra

🧪

LyngualLabs

Compute · Researchers · Storage

Carnegie Mellon University Africa

Researchers · Native Speakers

Linguistic Acknowledgements: Ajara Oyinloye · Abubakari Sadic Mohammed · Hafiz Adjei · Aliga Norah Lele · Marie-Louise B. Ndamuso · Odong Diana

Reference

Cite this work

BibTeX

@article{waxalnet2026,
  title  = {The WAXAL ASR Benchmark: Fine-Tuned Edge Models Across 19 African Languages},
  author = {Olufemi, Victor Tolulope and Babatunde, Oreoluwa and Njema, Ramsey and
            Gbotemi, Bolarinwa and Yen, Wanchi Lucia and Uzodinma, John and
            Ajayi, Sunday and Williams, Oluwademilade and Moshood, Kausar and
            Anyaele, Innocent Elendu and Arefaine, Akebert Tesfahunegn and
            Hunzwi, Candace and Daniel, Wongel Dawit and Namuganga, Emmilly Immaculate and
            Kadima, Cleophas and Bahizire, Athanase Biluge and Ranaivoson, Onitsiky and
            Aaron, Emmanuel and Ladislaus, Nicholaus Dismas and Muhammed, Idris and
            Simenya, Jonathan Enoch and Koome, Martin and Endaylalu, Matewos Tegete and
            Adeyemo, Peter Ifeoluwa and Birindwa, Hondi Prisca and Eze-Mbey, Ukachi Agnes and
            Oduro-Yeboah, Yacoba and Aremu, Toluwani and Adjovi, Pericles and
            Ngueajio, Mikel K and Mitra, Prasenjit},
  year   = {2026},
  note   = {arXiv preprint arXiv:2606.02375}
}

WAXALNet African ASR Benchmark

The Paper

The WAXAL ASR Benchmark: Fine-Tuned Edge Models Across 19 African Languages

Results

WAXALNet Models

MMS-300M

Whisper Small

Whisper Tiny

The Team

Cite this work

WAXALNet
African ASR Benchmark