Close Menu
TechCentralTechCentral

    Subscribe to the newsletter

    Get the best South African technology news and analysis delivered to your e-mail inbox every morning.

    Facebook X (Twitter) YouTube LinkedIn
    WhatsApp Facebook X (Twitter) LinkedIn YouTube
    TechCentralTechCentral
    • News
      Altron walked away from multiple M&A deals - Werner Kapp

      Altron walked away from multiple M&A deals

      25 May 2026
      Altron expects big jump in full-year earnings - Werner Kapp

      Altron surprises with special dividend

      25 May 2026
      Sita, Sars rubbish reports they were hacked

      Sita, Sars rubbish reports they were hacked

      25 May 2026
      Cape Town pioneers pooled wheeling of renewable electricity

      Cape Town pioneers pooled wheeling of renewable electricity

      25 May 2026
      Pick n Pay's online growth slows as Sixty60 lead widens - Sean Summers

      Pick n Pay’s online growth slows as Sixty60 lead widens

      25 May 2026
    • World
      Pope urges world to hit brakes on AI - Pope Leo

      Pope urges world to hit brakes on AI

      25 May 2026
      SpaceX's record-setting IPO is here

      SpaceX’s record-setting IPO is here

      21 May 2026
      The Mythos hacking threat is looking overblown

      The Mythos hacking threat is looking overblown

      20 May 2026
      Vatican confronts the age of artificial intelligence. Edgar Beltrán/The Pillar 

      Vatican confronts the age of artificial intelligence

      19 May 2026
      The walkout that could hit every laptop and AI server - Samsung

      The walkout that could hit every laptop and AI server

      18 May 2026
    • In-depth
      Alfa's electric rebel - Alfa Romeo Junior Elettrica Veloce

      Alfa’s electric rebel

      29 April 2026
      Africa switches on as Europe dims the lights

      Africa switches on as Europe dims the lights

      9 April 2026
      The biggest untapped EV market on Earth is hiding in plain sight

      The biggest untapped EV market on Earth is hiding in plain sight

      1 April 2026
      Datatec is firing on all cylinders - Jens Montanana

      The R16-billion tech giant hiding in plain sight

      26 March 2026
      The last generation of coders

      The last generation of coders

      18 February 2026
    • TCS
      TCS+ | The Up&Up Group on the hidden cost of AI - Jason Harrison

      TCS+ | The Up&Up Group on the hidden cost of AI

      13 May 2026
      Michael Rossouw

      TCS+ | The retirement decision most South Africans get wrong

      6 May 2026
      TCS | The Cape Town start-up listening for TB with AI - Braden van Breda

      TCS | The Cape Town start-up listening for TB with AI

      4 May 2026

      TCS+ | ‘The ISP for ISPs’: Vox’s shift to wholesale aggregator

      20 April 2026
      TCS | Werner Lindemann on how AI is rewriting the infosec rulebook

      TCS | Werner Lindemann on how AI is rewriting the infosec rulebook

      15 April 2026
    • Opinion
      Treasury's crypto crackdown is a betrayal of Mandela's promise - Duncan McLeod

      Treasury’s crypto crackdown is a betrayal of Mandela’s promise

      22 May 2026
      South Africa is sleepwalking into another AI policy failure - Celeste Labuschagne

      South Africa is sleepwalking into another AI policy failure

      20 May 2026
      AI won't fix your culture - it will expose it - Jackie Kennedy

      AI won’t fix your culture – it will expose it

      19 May 2026
      Treasury's crypto crackdown is a betrayal of Mandela's promise - Duncan McLeod

      Free calls, dead voice and Shameel Joosub’s Spanish ghost

      22 April 2026
      The conflict of interest at the heart of PayShap's slow adoption - Cheslyn Jacobs

      The conflict of interest at the heart of PayShap’s slow adoption

      26 March 2026
    • Company Hubs
      • 1Stream
      • Africa Data Centres
      • AfriGIS
      • Altron Digital Business
      • Altron Document Solutions
      • Altron Group
      • Arctic Wolf
      • Ascent Technology
      • AvertITD
      • BBD
      • Braintree
      • CallMiner
      • CambriLearn
      • CM Telecom
      • Contactable
      • CYBER1 Solutions
      • Digicloud Africa
      • Digimune
      • Domains.co.za
      • ESET
      • Euphoria Telecom
      • HOSTAFRICA
      • Incredible Business
      • iONLINE
      • IQbusiness
      • Iris Network Systems
      • Kaspersky
      • LSD Open
      • Mitel
      • NEC XON
      • Netstar
      • Network Platforms
      • Next DLP
      • Ovations
      • Paracon
      • Paratus
      • Q-KON
      • SevenC
      • SkyWire
      • Solid8 Technologies
      • Telit Cinterion
      • Telviva
      • Tenable
      • Vertiv
      • Videri Digital
      • Vodacom Business
      • Wipro
      • Workday
      • XLink
    • Sections
      • AI and machine learning
      • Banking
      • Broadcasting and Media
      • Cloud services
      • Contact centres and CX
      • Cryptocurrencies
      • Education and skills
      • Electronics and hardware
      • Energy and sustainability
      • Enterprise software
      • Financial services
      • HealthTech
      • Information security
      • Internet and connectivity
      • Internet of Things
      • Investment
      • IT services
      • Lifestyle
      • Motoring
      • Policy and regulation
      • Public sector
      • Retail and e-commerce
      • Satellite communications
      • Science
      • SMEs and start-ups
      • Social media
      • Talent and leadership
      • Telecoms
    • Events
    • Advertise
    TechCentralTechCentral
    Home » Sections » AI and machine learning » Jack Ma-backed Ant Group touts AI breakthrough using Chinese chips

    Jack Ma-backed Ant Group touts AI breakthrough using Chinese chips

    Ant Group has reportedly used Chinese chips to develop techniques for training AI models that would cut costs by 20%.
    By Agency Staff24 March 2025
    Twitter LinkedIn Facebook WhatsApp Email Telegram Copy Link
    News Alerts
    WhatsApp
    Jack Ma-backed Ant touts AI breakthrough on Chinese chips
    Billionaire Chinese businessman Jack Ma

    Jack Ma-backed Ant Group used Chinese-made chips to develop techniques for training AI models that would cut costs by 20%, according to people familiar with the matter.

    Ant used domestic chips, including from affiliate Alibaba Group and Huawei Technologies, to train models using the so-called Mixture of Experts machine learning approach, the people said.

    It got results similar to those from Nvidia chips like the H800, they said, asking not to be named as the information isn’t public. Ant is still using Nvidia for AI development but is now relying mostly on alternatives including from AMD and Chinese chips for its latest models, one of the people said.

    As companies pour significant money into AI, Mixture of Experts models have emerged as a popular option

    The models mark Ant’s entry into a race between Chinese and US companies that’s accelerated since DeepSeek demonstrated how capable models can be trained for far less than the billions invested by OpenAI and Google. It underscores how Chinese companies are trying to use local alternatives to the most advanced Nvidia semiconductors. While not the most advanced, the H800 is a relatively powerful processor and currently barred by the US from China.

    The company published a research paper this month that claimed its models at times outperformed Meta Platforms in certain benchmarks. If they work as advertised, Ant’s platforms could mark another step forward for Chinese AI development by slashing the cost of inferencing or supporting AI services.

    ‘Without premium GPUs’

    As companies pour significant money into AI, MoE models have emerged as a popular option, gaining recognition for their use by Google and Hangzhou start-up DeepSeek, among others. That technique divides tasks into smaller sets of data, very much like having a team of specialists who each focus on a segment of a job, making the process more efficient. Ant declined to comment.

    However, the training of MoE models typically relies on high-performing chips like the graphics processing units Nvidia sells. The cost has to date been prohibitive for many small firms and limited broader adoption. Ant has been working on ways to train LLMs more efficiently and eliminate that constraint. Its paper title makes that clear, as the company sets the goal to scale a model “without premium GPUs”.

    Read: OpenAI study finds links between ChatGPT use and loneliness

    That goes against the grain of Nvidia. CEO Jensen Huang has argued that computation demand will grow even with the advent of more efficient models like DeepSeek’s R1, positing that companies will need better chips to generate more revenue, not cheaper ones to cut costs. He’s stuck to a strategy of building big GPUs with more processing cores, transistors and increased memory capacity.

    Ant said it cost about C¥6.35 million yuan (R16-million) to train one trillion tokens using high-performance hardware, but its optimised approach would cut that down to C¥5.1-million using lower-specification hardware. Tokens are the units of information that a model ingests in order to learn about the world and deliver useful responses to user queries.

    The company plans to leverage the recent breakthrough in the large language models it has developed, Ling-Plus and Ling-Lite, for industrial AI solutions including health care and finance, the people said.

    Ant bought Chinese online platform Haodf.com this year to beef up its AI services in health care. It also has an AI “life assistant” app called Zhixiaobao and a financial advisory AI service Maxiaocai.

    On English-language understanding, Ant said in its paper that the Ling-Lite model did better in a key benchmark compared with one of Meta’s Llama models. Both Ling-Lite and Ling-Plus models outperformed DeepSeek’s equivalents on Chinese-language benchmarks.

    Ling-Plus has 290 billion parameters, which is considered relatively large in the realm of language models

    “If you find one point of attack to beat the world’s best kung fu master, you can still say you beat them, which is why real-world application is important,” said Robin Yu, chief technology officer of Beijing-based AI solution provider Shengshang Tech.

    Ant has made the Ling models open source. Ling-Lite contains 16.8 billion parameters, which are the adjustable settings that work like knobs and dials to direct the model’s performance. Ling-Plus has 290 billion parameters, which is considered relatively large in the realm of language models. For comparison, experts estimate that ChatGPT’s GPT-4.5 has 1.8 trillion parameters, according to the MIT Technology Review. DeepSeek-R1 has 671 billion.

    Ant faced challenges in some areas of the training, including stability. Even small changes in the hardware or the model’s structure led to problems, including jumps in the models’ error rate, it said in the paper.  — Lulu Yilun Chen, with Debby Wu, (c) 2025 Bloomberg LP

    Get breaking news from TechCentral on WhatsApp. Sign up here

    Don’t miss:

    Tesla is flailing in China – and the rapid rise of BYD is to blame

    Follow TechCentral on Google News Add TechCentral as your preferred source on Google


    Ant Ant Financial Ant Group Jack Ma Tencent
    WhatsApp YouTube
    Share. Facebook Twitter LinkedIn WhatsApp Telegram Email Copy Link
    Previous Article7 ways you can fortify your financial business against modern threats
    Next Article Google names graduates of its South African start-ups programme

    Related Posts

    M-Net pioneer Cobus Stofberg steps down from Naspers, Prosus boards

    M-Net pioneer Cobus Stofberg steps down from Naspers, Prosus boards

    20 August 2025
    Huawei claims chip design breakthrough

    China is behind in AI chips – but for how much longer?

    13 June 2025
    Nvidia CEO says China is catching up fast in AI chip race - Jensen Huang

    Nvidia CEO says China is catching up fast in AI chip race

    29 May 2025
    Company News
    Retro Rabbit / SmarTek21 refines the art and science of product delivery - Rouan van der Walt

    Retro Rabbit / SmarTek21 refines the art and science of product delivery

    25 May 2026
    Webinar today: a 30-day plan to protect your SME from cyberattacks - SevenC

    Webinar today: a 30-day plan to protect your SME from cyberattacks

    25 May 2026
    How African enterprises can leapfrog the AI infrastructure trap - Huawei Cloud

    How African enterprises can leapfrog the AI infrastructure trap

    22 May 2026
    Opinion
    Treasury's crypto crackdown is a betrayal of Mandela's promise - Duncan McLeod

    Treasury’s crypto crackdown is a betrayal of Mandela’s promise

    22 May 2026
    South Africa is sleepwalking into another AI policy failure - Celeste Labuschagne

    South Africa is sleepwalking into another AI policy failure

    20 May 2026
    AI won't fix your culture - it will expose it - Jackie Kennedy

    AI won’t fix your culture – it will expose it

    19 May 2026

    Subscribe to Updates

    Get the best South African technology news and analysis delivered to your e-mail inbox every morning.

    Latest Posts
    Altron walked away from multiple M&A deals - Werner Kapp

    Altron walked away from multiple M&A deals

    25 May 2026
    Altron expects big jump in full-year earnings - Werner Kapp

    Altron surprises with special dividend

    25 May 2026
    Sita, Sars rubbish reports they were hacked

    Sita, Sars rubbish reports they were hacked

    25 May 2026
    Cape Town pioneers pooled wheeling of renewable electricity

    Cape Town pioneers pooled wheeling of renewable electricity

    25 May 2026
    © 2009 - 2026 NewsCentral Media
    • Cookie policy (ZA)
    • TechCentral – privacy and Popia

    Type above and press Enter to search. Press Esc to cancel.

    Manage consent

    TechCentral uses cookies to enhance its offerings. Consenting to these technologies allows us to serve you better. Not consenting or withdrawing consent may adversely affect certain features and functions of the website.

    Functional Always active
    The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
    Preferences
    The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
    Statistics
    The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
    Marketing
    The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
    • Manage options
    • Manage services
    • Manage {vendor_count} vendors
    • Read more about these purposes
    View preferences
    • {title}
    • {title}
    • {title}