ADVERTISEMENT
The Moment Nigeria
  • Home
  • News
  • Business
  • Entertainment
  • Interviews
  • Life and Styles
  • Sport
No Result
View All Result
  • Home
  • News
  • Business
  • Entertainment
  • Interviews
  • Life and Styles
  • Sport
No Result
View All Result
The Moment Nigeria
No Result
View All Result
  • Home
  • News
  • Business
  • Entertainment
  • Interviews
  • Life and Styles
  • Sport

Pleias and GSMA launch CommonLingua, an Open-Source AI Model supporting 61 African languages

by Honesty Victor
April 29, 2026
Reading Time: 2 mins read
Pleias and GSMA launch CommonLingua, an Open-Source AI Model supporting 61 African languages
Share on FacebookShare on TwitterShare on WhatsappShare on LinkedIn

Pleias and the GSMA announced the release of CommonLingua, an open-source language identification (LID) model purpose-built to unlock African language data at scale. It is delivered under the GSMA’s AI Language Models in Africa, by Africa, for Africa initiative, a coalition dedicated to closing the African language gap in AI.

Africa is home to more than 2,000 living languages, many of which remain underrepresented in AI training data. As a result, language identification systems often perform less reliably on African-language content, particularly when distinguishing between closely related or code-mixed text. Before a Swahili, Yoruba, or Wolof language model can be built, the underlying text must first be correctly identified by language – a step where existing tools currently often fail on African content.

This is because leading LID systems such as fastText, GlotLID, and OpenLID were built around European and Asian high-resource languages and frequently mislabel African-language text as English or French. Even state-of-the-art frontier models drop roughly 30 points in accuracy on African languages compared to major world languages.

RELATED STORIES

Reps pass bill to reduce Senate, governorship age to 30

Reps approve Tinubu’s $516.33m Loan for Sokoto-Badagry Superhighway

April 29, 2026
Alleged N1.63b money laundering: EFCC re-arraigns Bauchi Accountant-General, One other

Alleged N1.63b money laundering: EFCC re-arraigns Bauchi Accountant-General, One other

April 29, 2026

CommonLingua is designed to fix this first step of the pipeline. On the new CommonLID benchmark, CommonLingua achieves 83% accuracy and a macro score F1 of 0.79, outperforming leading LID models by more than 10 percentage points under comparable evaluation conditions, while using roughly one three-hundredth of the parameters. The model is lightweight at 2 million parameters and shipping as an 8 MB checkpoint, and is designed for efficient deployment, running approximately 20 texts per second on CPU and up to 3,000 texts per second on a single GPU.

CommonLingua covers 334 languages in total, including 61 African languages across eight language families: Bantu (21), Niger-Congo / West African (18), Afro-Asiatic and Semitic (7), Cushitic and Chadic (4), Berber (3), Nilo-Saharan (3), and pidgins, creoles, and other (5). The model operates directly on UTF-8 byte sequences rather than relying on a language-specific tokenizer, enabling consistent handling across scripts including Latin, Arabic, Ethiopic, N’Ko, and Tifinagh.

The model is trained exclusively on open-licensed and public domain content aggregated through the Common Corpus project, including Wikipedia, Scientific publications in OpenAlex, VOA Africa, WaxalNLP, Cultural Heritage, and Pralekha. All datasets are released under permissive licenses.

This conversation will continue at MWC26 Kigali, where GSMA and partners will bring together industry leaders to accelerate progress on African-language AI. Register now to be part of the discussion.

Next Post
Alleged N1.63b money laundering: EFCC re-arraigns Bauchi Accountant-General, One other

Alleged N1.63b money laundering: EFCC re-arraigns Bauchi Accountant-General, One other

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

STANBIC IBTC ADVERT

About Us

Themomentng.com is an online community of reporters and social advocates dedicated to bringing you features, news reports by Africans, but from a global perspective.

Contact Us

+447771081433
+2348051966180(WhatsApp/SMS Only)
Email: themomentng@gmail.com

Categories

  • Business
  • Education
  • Entertainment
  • Events
  • Featured
  • Food
  • Foreign
  • Health
  • Interviews
  • Life and Styles
  • Metro
  • Motoring
  • News
  • Opinion
  • Politics
  • Religion
  • Society
  • Sport
  • Technology
  • Top Story

Follow Us

Facebook Twitter Instagram

Copyright © Themomentng.com. All Rights Reserved.

No Result
View All Result
  • Home
  • News
  • Business
  • Entertainment
  • Interviews
  • Life and Styles
  • Sport