Close Menu
TechCentralTechCentral

    Subscribe to the newsletter

    Get the best South African technology news and analysis delivered to your e-mail inbox every morning.

    Facebook X (Twitter) YouTube LinkedIn
    WhatsApp Facebook X (Twitter) LinkedIn YouTube
    TechCentralTechCentral
    • News
      Icasa caught in the political crossfire over Starlink - Elon Musk

      Icasa caught in the political crossfire over Starlink

      24 April 2026
      Malatsi runs out of patience with Icasa on BEE reform - Solly Malatsi

      Malatsi runs out of patience with Icasa on BEE reform

      24 April 2026
      DeepSeek's long-awaited V4 model enters preview

      DeepSeek’s long-awaited V4 model enters preview

      24 April 2026
      South Africa planning big overhaul of public sector IT - State IT Agency Sita

      South Africa planning big overhaul of public sector IT

      23 April 2026
      Usaasa's 30-year run nears its end - Communications minister Solly Malatsi. Image c/o DCDT

      Usaasa’s 30-year run nears its end

      23 April 2026
    • World
      More organic compounds detected on Mars - Nasa Curiosity rover

      More organic compounds detected on Mars

      21 April 2026
      Adobe bets on AI agents to fend off cheaper rivals

      Adobe bets on AI agents to fend off cheaper rivals

      16 April 2026
      Google poised to lose ad crown to Meta

      Google poised to lose ad crown to Meta

      14 April 2026
      Grand Theft Data - hackers hit Rockstar Games - Grand Theft Auto

      Grand Theft Data – hackers hit Rockstar Games

      14 April 2026
      UK PM Keir Starmer declares war on doomscrolling

      UK PM Keir Starmer declares war on doomscrolling

      13 April 2026
    • In-depth
      Africa switches on as Europe dims the lights

      Africa switches on as Europe dims the lights

      9 April 2026
      The biggest untapped EV market on Earth is hiding in plain sight

      The biggest untapped EV market on Earth is hiding in plain sight

      1 April 2026
      The R18-billion tech giant hiding in plain sight - Jens Montanana

      The R16-billion tech giant hiding in plain sight

      26 March 2026
      The last generation of coders

      The last generation of coders

      18 February 2026
      Sentech is in dire straits

      Sentech is in dire straits

      10 February 2026
    • TCS

      TCS+ | ‘The ISP for ISPs’: Vox’s shift to wholesale aggregator

      20 April 2026
      TCS | Werner Lindemann on how AI is rewriting the infosec rulebook

      TCS | Werner Lindemann on how AI is rewriting the infosec rulebook

      15 April 2026
      TCS | Donovan Marsh on AI and the future of filmmaking

      TCS | Donovan Marsh on AI and the future of filmmaking

      7 April 2026
      TCS+ | Vodacom Business moves to crack the SME tech gap - Andrew Fulton, Sannesh Beharie

      TCS+ | Vodacom Business moves to crack the SME tech gap

      7 April 2026
      TCS | MTN's Divysh Joshi on the strategy behind Pi - Divyesh Joshi

      TCS | MTN’s Divyesh Joshi on the strategy behind Pi

      1 April 2026
    • Opinion
      The conflict of interest at the heart of PayShap's slow adoption - Cheslyn Jacobs

      The conflict of interest at the heart of PayShap’s slow adoption

      26 March 2026
      South Africa's energy future hinges on getting wheeling right - Aishah Gire

      South Africa’s energy future hinges on getting wheeling right

      10 March 2026
      Hold the doom: the case for a South African comeback - Duncan McLeod

      Apple just dropped a bomb on the Windows world

      5 March 2026
      R230-million in the bag for Endeavor's third Harvest Fund - Alison Collier

      VC’s centre of gravity is shifting – and South Africa is in the frame

      3 March 2026
      Hold the doom: the case for a South African comeback - Duncan McLeod

      Hold the doom: the case for a South African comeback

      26 February 2026
    • Company Hubs
      • 1Stream
      • Africa Data Centres
      • AfriGIS
      • Altron Digital Business
      • Altron Document Solutions
      • Altron Group
      • Arctic Wolf
      • Ascent Technology
      • AvertITD
      • BBD
      • Braintree
      • CallMiner
      • CambriLearn
      • CYBER1 Solutions
      • Digicloud Africa
      • Digimune
      • Domains.co.za
      • ESET
      • Euphoria Telecom
      • HOSTAFRICA
      • Incredible Business
      • iONLINE
      • IQbusiness
      • Iris Network Systems
      • Kaspersky
      • LSD Open
      • Mitel
      • NEC XON
      • Netstar
      • Network Platforms
      • Next DLP
      • Ovations
      • Paracon
      • Paratus
      • Q-KON
      • SevenC
      • SkyWire
      • Solid8 Technologies
      • Telit Cinterion
      • Telviva
      • Tenable
      • Vertiv
      • Videri Digital
      • Vodacom Business
      • Wipro
      • Workday
      • XLink
    • Sections
      • AI and machine learning
      • Banking
      • Broadcasting and Media
      • Cloud services
      • Contact centres and CX
      • Cryptocurrencies
      • Education and skills
      • Electronics and hardware
      • Energy and sustainability
      • Enterprise software
      • Financial services
      • HealthTech
      • Information security
      • Internet and connectivity
      • Internet of Things
      • Investment
      • IT services
      • Lifestyle
      • Motoring
      • Policy and regulation
      • Public sector
      • Retail and e-commerce
      • Satellite communications
      • Science
      • SMEs and start-ups
      • Social media
      • Talent and leadership
      • Telecoms
    • Events
    • Advertise
    TechCentralTechCentral
    Home » Sections » AI and machine learning » Haibo! AI language models for Zulu and Sotho in the works

    Haibo! AI language models for Zulu and Sotho in the works

    New language tools are being built that allow speakers of indigenous African languages to interact with the latest AI apps.
    By Nkosinathi Ndlovu3 April 2024
    Twitter LinkedIn Facebook WhatsApp Email Telegram Copy Link
    News Alerts
    WhatsApp
    Jade Abbott

    New language tools are being built that allow speakers of indigenous African languages to interact with the latest artificial intelligence applications.

    Lelapa AI is a local AI research and product lab that is building “language technology”, including large language models (LLMs), using indigenous African languages such as isiZulu and seSotho, to help speakers of these languages interact with the latest tools.

    TechCentral spoke with Jade Abbott, chief technology officer at Lelapa (the seSotho word for “home”), to learn more about the challenges and opportunities in the natural language processing (NLP) space.

    Building NLP tools for indigenous languages is not as easy as it is for languages such as English and French

    “The internet is over 90% English; this means that only certain parts of the world have access to this powerful tool,” said Abbott. “We need to build the language technology that ensures we are represented as a continent, that makes digital knowledge and services accessible to us.”

    But building NLP tools for indigenous languages is not as easy as it is for languages such as English and French, Abbott said. Described by NLP experts as “high-resource” languages, French and English have large data sets available on the internet that can be “scraped” and used to train new NLP tools. In contrast, “low-resource” languages such as isiZulu and seSotho do not have vast data sets available for scraping, which makes developing computational tools for processing these languages more difficult.

    ‘Do it from scratch’

    To get around this problem, Lelapa uses a “do it from scratch” approach and creates the data required to train the models that they produce. This methodology has its own complexities:

    • Firstly, languages are large and nuanced, so training models on them requires massive data sets;
    • Secondly, the computing capacity required to train these models is vast and therefore costly; and
    • Thirdly, the standard tools used to evaluate the efficacy of language processing tools work well for languages like English but are less useful for indigenous languages.

    Lelapa employs various strategies to get around these complexities. The first involves shrinking the application domain for the model being built so that the resulting model is as small at it can be to solve the problem being addressed.

    This has the added benefit that the compute resources required to build the model are also minimised, which drives down costs.

    “We build our models similarly to how an engineer might build a bridge,” said Abbott. “We know exactly how well the model works within a specified domain and what the tolerances are. We don’t try to build a generalisable tool that is going to work everywhere because there is not enough data – it is not going to work.”

    The specified domain can be finance or agriculture, for example. But Lelapa also makes use of native language speakers throughout the development process to ensure its models are accurate. This is especially important in the evaluation phase of the process, where standardised tools such as the Bleu score are not as effective for indigenous languages.

    A third component of Lelapa’s development strategy is to use tools that fit the problem, a methodology that sometimes leads to the exclusion of AI in lieu of a more straightforward computational solution, said Abbott.

    “When the application domain is well understood, you sometimes don’t want to add a generative tool because of the complexity that comes with that,” she said.

    Before deciding on using these tools, companies must evaluate how well they work for their specific use case

    According to Abbott, the company is seeing most demand for its transcription and conversational products. Lelapa tools are being used in the financial sector where clients such as banks are able to coax their less digitally savvy clients onto digital platforms knowing that the customer support for these apps can be facilitated in the customer’s native language wherever it is needed.

    Call centres are also making use of Lelapa’s tools, especially for quality control, where AI is being used to evaluate interactions between agents and customers to ensure that company representatives are “not overpromising” in sales calls to non-English speaking clients, for example.

    Read: Google apologises for ‘woke’ AI tool

    “Before deciding on using these tools, companies must evaluate how well they work for their specific use case and see how it will augment their people rather than replace them. We are still a long way off from AI being powerful enough to replace humans, but carefully considering how it might augment workers will help derive more value from it,” said Abbott.  – © 2024 NewsCentral Media

    Get breaking news alerts from TechCentral on WhatsApp

    Follow TechCentral on Google News Add TechCentral as your preferred source on Google


    Jade Abbott Lelapa Lelapa AI
    WhatsApp YouTube
    Share. Facebook Twitter LinkedIn WhatsApp Telegram Email Copy Link
    Previous ArticleMicrosoft claims breakthrough in quantum computing
    Next Article Google may charge for AI-powered search engine

    Related Posts

    Company News
    Cybersecurity in the age of AI: why speed and trust now define resilience - iqbusiness

    Cybersecurity in the AI age: speed and trust define resilience

    24 April 2026
    Security by design is the channel's strongest pitch - Othelo Vieira

    Security by design is the channel’s strongest pitch

    23 April 2026
    Your brand is invisible to the AI that's choosing your competitor - Michelle Losco

    Your brand is invisible to the AI that’s choosing your competitor

    23 April 2026
    Opinion
    The conflict of interest at the heart of PayShap's slow adoption - Cheslyn Jacobs

    The conflict of interest at the heart of PayShap’s slow adoption

    26 March 2026
    South Africa's energy future hinges on getting wheeling right - Aishah Gire

    South Africa’s energy future hinges on getting wheeling right

    10 March 2026
    Hold the doom: the case for a South African comeback - Duncan McLeod

    Apple just dropped a bomb on the Windows world

    5 March 2026

    Subscribe to Updates

    Get the best South African technology news and analysis delivered to your e-mail inbox every morning.

    Latest Posts
    Icasa caught in the political crossfire over Starlink - Elon Musk

    Icasa caught in the political crossfire over Starlink

    24 April 2026
    Cybersecurity in the age of AI: why speed and trust now define resilience - iqbusiness

    Cybersecurity in the AI age: speed and trust define resilience

    24 April 2026
    Malatsi runs out of patience with Icasa on BEE reform - Solly Malatsi

    Malatsi runs out of patience with Icasa on BEE reform

    24 April 2026
    DeepSeek's long-awaited V4 model enters preview

    DeepSeek’s long-awaited V4 model enters preview

    24 April 2026
    © 2009 - 2026 NewsCentral Media
    • Cookie policy (ZA)
    • TechCentral – privacy and Popia

    Type above and press Enter to search. Press Esc to cancel.

    Manage consent

    TechCentral uses cookies to enhance its offerings. Consenting to these technologies allows us to serve you better. Not consenting or withdrawing consent may adversely affect certain features and functions of the website.

    Functional Always active
    The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
    Preferences
    The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
    Statistics
    The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
    Marketing
    The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
    • Manage options
    • Manage services
    • Manage {vendor_count} vendors
    • Read more about these purposes
    View preferences
    • {title}
    • {title}
    • {title}