Close Menu
TechCentralTechCentral

    Subscribe to the newsletter

    Get the best South African technology news and analysis delivered to your e-mail inbox every morning.

    Facebook X (Twitter) YouTube LinkedIn
    WhatsApp Facebook X (Twitter) LinkedIn YouTube
    TechCentralTechCentral
    • News
      MultiChoice scraps annual DStv price hikes for 2026 - David Mignot

      MultiChoice scraps annual DStv price hike

      20 February 2026
      What Gen Z really thinks about the tech world it inherited - Tinashe Mazodze

      What Gen Z really thinks about the tech world it inherited

      20 February 2026
      Showmax 'can't continue' in its current form

      Showmax ‘can’t continue’ in its current form

      20 February 2026
      Free Market Foundation slams treasury's proposed gambling tax

      Free Market Foundation slams treasury’s proposed gambling tax

      20 February 2026
      South Africa's dynamic spectrum breakthrough - Paul Colmer

      South Africa’s dynamic spectrum breakthrough

      20 February 2026
    • World
      Prominent Southern African journalist targeted with Predator spyware

      Prominent Southern African journalist targeted with Predator spyware

      18 February 2026
      More drama in Warner Bros tug of war

      More drama in Warner Bros tug of war

      17 February 2026
      Russia bans WhatsApp

      Russia bans WhatsApp

      12 February 2026
      EU regulators take aim at WhatsApp

      EU regulators take aim at WhatsApp

      9 February 2026
      Musk hits brakes on Mars mission

      Musk hits brakes on Mars mission

      9 February 2026
    • In-depth
      How liberalisation is rewiring South Africa's power sector

      How liberalisation is rewiring South Africa’s power sector

      21 January 2026
      The top-performing South African tech shares of 2025

      The top-performing South African tech shares of 2025

      12 January 2026
      Digital authoritarianism grows as African states normalise internet blackouts

      Digital authoritarianism grows as African states normalise internet blackouts

      19 December 2025
      TechCentral's South African Newsmakers of 2025

      TechCentral’s South African Newsmakers of 2025

      18 December 2025
      Black Friday goes digital in South Africa as online spending surges to record high

      Black Friday goes digital in South Africa as online spending surges to record high

      4 December 2025
    • TCS
      Watts & Wheels S1E4: 'We drive an electric Uber'

      Watts & Wheels S1E4: ‘We drive an electric Uber’

      10 February 2026
      TCS+ | How Cloud On Demand is helping SA businesses succeed in the cloud - Xhenia Rhode, Dion Kalicharan

      TCS+ | Cloud On Demand and Consnet: inside a real-world AWS partner success story

      30 January 2026
      Watts & Wheels S1E4: 'We drive an electric Uber'

      Watts & Wheels S1E3: ‘BYD’s Corolla Cross challenger’

      30 January 2026
      Watts & Wheels S1E4: 'We drive an electric Uber'

      Watts & Wheels S1E2: ‘China attacks, BMW digs in, Toyota’s sublime supercar’

      23 January 2026

      TCS+ | Why cybersecurity is becoming a competitive advantage for SA businesses

      20 January 2026
    • Opinion
      A million reasons monopolies don't work - Duncan McLeod

      A million reasons monopolies don’t work

      10 February 2026
      The author, Business Leadership South Africa CEO Busi Mavuso

      Eskom unbundling U-turn threatens to undo hard-won electricity gains

      9 February 2026
      South Africa's skills advantage is being overlooked at home - Richard Firth

      South Africa’s skills advantage is being overlooked at home

      29 January 2026
      Why Elon Musk's Starlink is a 'hard no' for me - Songezo Zibi

      Why Elon Musk’s Starlink is a ‘hard no’ for me

      26 January 2026
      A million reasons monopolies don't work - Duncan McLeod

      South Africa’s new fibre broadband battle

      20 January 2026
    • Company Hubs
      • Africa Data Centres
      • AfriGIS
      • Altron Digital Business
      • Altron Document Solutions
      • Altron Group
      • Arctic Wolf
      • AvertITD
      • Braintree
      • CallMiner
      • CambriLearn
      • CYBER1 Solutions
      • Digicloud Africa
      • Digimune
      • Domains.co.za
      • ESET
      • Euphoria Telecom
      • Incredible Business
      • iONLINE
      • IQbusiness
      • Iris Network Systems
      • LSD Open
      • Mitel
      • NEC XON
      • Netstar
      • Network Platforms
      • Next DLP
      • Ovations
      • Paracon
      • Paratus
      • Q-KON
      • SevenC
      • SkyWire
      • Solid8 Technologies
      • Telit Cinterion
      • Tenable
      • Vertiv
      • Videri Digital
      • Vodacom Business
      • Wipro
      • Workday
      • XLink
    • Sections
      • AI and machine learning
      • Banking
      • Broadcasting and Media
      • Cloud services
      • Contact centres and CX
      • Cryptocurrencies
      • Education and skills
      • Electronics and hardware
      • Energy and sustainability
      • Enterprise software
      • Financial services
      • Information security
      • Internet and connectivity
      • Internet of Things
      • Investment
      • IT services
      • Lifestyle
      • Motoring
      • Public sector
      • Retail and e-commerce
      • Satellite communications
      • Science
      • SMEs and start-ups
      • Social media
      • Talent and leadership
      • Telecoms
    • Events
    • Advertise
    TechCentralTechCentral
    Home » Sections » AI and machine learning » Jack Ma-backed Ant Group touts AI breakthrough using Chinese chips

    Jack Ma-backed Ant Group touts AI breakthrough using Chinese chips

    Ant Group has reportedly used Chinese chips to develop techniques for training AI models that would cut costs by 20%.
    By Agency Staff24 March 2025
    Twitter LinkedIn Facebook WhatsApp Email Telegram Copy Link
    News Alerts
    WhatsApp
    Jack Ma-backed Ant touts AI breakthrough on Chinese chips
    Billionaire Chinese businessman Jack Ma

    Jack Ma-backed Ant Group used Chinese-made chips to develop techniques for training AI models that would cut costs by 20%, according to people familiar with the matter.

    Ant used domestic chips, including from affiliate Alibaba Group and Huawei Technologies, to train models using the so-called Mixture of Experts machine learning approach, the people said.

    It got results similar to those from Nvidia chips like the H800, they said, asking not to be named as the information isn’t public. Ant is still using Nvidia for AI development but is now relying mostly on alternatives including from AMD and Chinese chips for its latest models, one of the people said.

    As companies pour significant money into AI, Mixture of Experts models have emerged as a popular option

    The models mark Ant’s entry into a race between Chinese and US companies that’s accelerated since DeepSeek demonstrated how capable models can be trained for far less than the billions invested by OpenAI and Google. It underscores how Chinese companies are trying to use local alternatives to the most advanced Nvidia semiconductors. While not the most advanced, the H800 is a relatively powerful processor and currently barred by the US from China.

    The company published a research paper this month that claimed its models at times outperformed Meta Platforms in certain benchmarks. If they work as advertised, Ant’s platforms could mark another step forward for Chinese AI development by slashing the cost of inferencing or supporting AI services.

    ‘Without premium GPUs’

    As companies pour significant money into AI, MoE models have emerged as a popular option, gaining recognition for their use by Google and Hangzhou start-up DeepSeek, among others. That technique divides tasks into smaller sets of data, very much like having a team of specialists who each focus on a segment of a job, making the process more efficient. Ant declined to comment.

    However, the training of MoE models typically relies on high-performing chips like the graphics processing units Nvidia sells. The cost has to date been prohibitive for many small firms and limited broader adoption. Ant has been working on ways to train LLMs more efficiently and eliminate that constraint. Its paper title makes that clear, as the company sets the goal to scale a model “without premium GPUs”.

    Read: OpenAI study finds links between ChatGPT use and loneliness

    That goes against the grain of Nvidia. CEO Jensen Huang has argued that computation demand will grow even with the advent of more efficient models like DeepSeek’s R1, positing that companies will need better chips to generate more revenue, not cheaper ones to cut costs. He’s stuck to a strategy of building big GPUs with more processing cores, transistors and increased memory capacity.

    Ant said it cost about C¥6.35 million yuan (R16-million) to train one trillion tokens using high-performance hardware, but its optimised approach would cut that down to C¥5.1-million using lower-specification hardware. Tokens are the units of information that a model ingests in order to learn about the world and deliver useful responses to user queries.

    The company plans to leverage the recent breakthrough in the large language models it has developed, Ling-Plus and Ling-Lite, for industrial AI solutions including health care and finance, the people said.

    Ant bought Chinese online platform Haodf.com this year to beef up its AI services in health care. It also has an AI “life assistant” app called Zhixiaobao and a financial advisory AI service Maxiaocai.

    On English-language understanding, Ant said in its paper that the Ling-Lite model did better in a key benchmark compared with one of Meta’s Llama models. Both Ling-Lite and Ling-Plus models outperformed DeepSeek’s equivalents on Chinese-language benchmarks.

    Ling-Plus has 290 billion parameters, which is considered relatively large in the realm of language models

    “If you find one point of attack to beat the world’s best kung fu master, you can still say you beat them, which is why real-world application is important,” said Robin Yu, chief technology officer of Beijing-based AI solution provider Shengshang Tech.

    Ant has made the Ling models open source. Ling-Lite contains 16.8 billion parameters, which are the adjustable settings that work like knobs and dials to direct the model’s performance. Ling-Plus has 290 billion parameters, which is considered relatively large in the realm of language models. For comparison, experts estimate that ChatGPT’s GPT-4.5 has 1.8 trillion parameters, according to the MIT Technology Review. DeepSeek-R1 has 671 billion.

    Ant faced challenges in some areas of the training, including stability. Even small changes in the hardware or the model’s structure led to problems, including jumps in the models’ error rate, it said in the paper.  — Lulu Yilun Chen, with Debby Wu, (c) 2025 Bloomberg LP

    Get breaking news from TechCentral on WhatsApp. Sign up here

    Don’t miss:

    Tesla is flailing in China – and the rapid rise of BYD is to blame

    Follow TechCentral on Google News Add TechCentral as your preferred source on Google


    Ant Ant Financial Ant Group Jack Ma Tencent
    WhatsApp YouTube
    Share. Facebook Twitter LinkedIn WhatsApp Telegram Email Copy Link
    Previous Article7 ways you can fortify your financial business against modern threats
    Next Article Google names graduates of its South African start-ups programme

    Related Posts

    M-Net pioneer Cobus Stofberg steps down from Naspers, Prosus boards

    M-Net pioneer Cobus Stofberg steps down from Naspers, Prosus boards

    20 August 2025
    China is behind in AI chips - but for how much longer?

    China is behind in AI chips – but for how much longer?

    13 June 2025
    Nvidia CEO says China is catching up fast in AI chip race - Jensen Huang

    Nvidia CEO says China is catching up fast in AI chip race

    29 May 2025
    Company News
    Service is everyone's problem now - and that's exactly why the Atlassian Service Collection matters

    Service is everyone’s problem now – why the Atlassian Service Collection matters

    20 February 2026
    Customers have new expectations. Is your CX ready? 1Stream

    Customers have new expectations. Is your CX ready?

    19 February 2026
    South Africa's cybersecurity challenge is not a tool problem - Nicholas Applewhite, Trinexia South Africa

    South Africa’s cybersecurity challenge is not a tool problem

    19 February 2026
    Opinion
    A million reasons monopolies don't work - Duncan McLeod

    A million reasons monopolies don’t work

    10 February 2026
    The author, Business Leadership South Africa CEO Busi Mavuso

    Eskom unbundling U-turn threatens to undo hard-won electricity gains

    9 February 2026
    South Africa's skills advantage is being overlooked at home - Richard Firth

    South Africa’s skills advantage is being overlooked at home

    29 January 2026

    Subscribe to Updates

    Get the best South African technology news and analysis delivered to your e-mail inbox every morning.

    Latest Posts
    MultiChoice scraps annual DStv price hikes for 2026 - David Mignot

    MultiChoice scraps annual DStv price hike

    20 February 2026
    What Gen Z really thinks about the tech world it inherited - Tinashe Mazodze

    What Gen Z really thinks about the tech world it inherited

    20 February 2026
    Showmax 'can't continue' in its current form

    Showmax ‘can’t continue’ in its current form

    20 February 2026
    Free Market Foundation slams treasury's proposed gambling tax

    Free Market Foundation slams treasury’s proposed gambling tax

    20 February 2026
    © 2009 - 2026 NewsCentral Media
    • Cookie policy (ZA)
    • TechCentral – privacy and Popia

    Type above and press Enter to search. Press Esc to cancel.

    Manage consent

    TechCentral uses cookies to enhance its offerings. Consenting to these technologies allows us to serve you better. Not consenting or withdrawing consent may adversely affect certain features and functions of the website.

    Functional Always active
    The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
    Preferences
    The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
    Statistics
    The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
    Marketing
    The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
    • Manage options
    • Manage services
    • Manage {vendor_count} vendors
    • Read more about these purposes
    View preferences
    • {title}
    • {title}
    • {title}