Close Menu
TechCentralTechCentral

    Subscribe to the newsletter

    Get the best South African technology news and analysis delivered to your e-mail inbox every morning.

    Facebook X (Twitter) YouTube LinkedIn
    WhatsApp Facebook X (Twitter) LinkedIn YouTube
    TechCentralTechCentral
    • News

      Sam Altman and Jony Ive’s big bet to out-Apple Apple

      22 May 2025

      Former MTN bosses approach SA’s top court in Turkcell case

      22 May 2025

      Bitcoin smashes R2-million mark in record-breaking rally

      22 May 2025

      TCS | Reserve Bank fintech head Lyle Horsley on the G20 TechSprint

      22 May 2025

      iPhone designer Jony Ive to build AI devices with OpenAI

      22 May 2025
    • World

      First AI-generated drugs could go on sale by 2030

      22 May 2025

      Google, Volvo deepen partnership on car software

      21 May 2025

      Microsoft pushes for industry standards in AI agent collaboration

      19 May 2025

      Microsoft to lay off 3% of workforce in organisation-wide cuts

      14 May 2025

      AI-voiced audiobooks are coming to Audible

      13 May 2025
    • In-depth

      South Africa unveils big state digital reform programme

      12 May 2025

      Is this the end of Google Search as we know it?

      12 May 2025

      Social media’s Big Tobacco moment is coming

      13 April 2025

      This is Europe’s shot to emerge from Silicon Valley’s shadow

      10 April 2025

      Microsoft turns 50

      4 April 2025
    • TCS

      TCS+ | Schneider Electric’s Clive Roberts on driving digitisation in the CPG sector

      22 May 2025

      TCS | Dalene Steyn on Capitec’s ambitious mobile gameplan

      21 May 2025

      Meet the CIO | Schalk Visser on Cell C’s big tech pivot

      13 May 2025

      TCS | Kiaan Pillay on fintech start-up Stitch and its R1-billion funding round

      7 May 2025

      TCS+ | Switchcom and Huawei eKit: networking made easy for SMEs

      6 May 2025
    • Opinion

      Solar panic? The truth about SSEG, fines and municipal rules

      14 April 2025

      Data protection must be crypto industry’s top priority

      9 April 2025

      ICT distributors must embrace innovation or risk irrelevance

      9 April 2025

      South Africa unprepared for deepfake chaos

      3 April 2025

      Google: South African media plan threatens investment

      3 April 2025
    • Company Hubs
      • Africa Data Centres
      • AfriGIS
      • Altron Digital Business
      • Altron Document Solutions
      • Arctic Wolf
      • AvertITD
      • Braintree
      • CallMiner
      • CYBER1 Solutions
      • Digicloud Africa
      • Digimune
      • Domains.co.za
      • ESET
      • Euphoria Telecom
      • Incredible Business
      • iONLINE
      • Iris Network Systems
      • LSD Open
      • NEC XON
      • Network Platforms
      • Next DLP
      • Ovations
      • Paracon
      • Paratus
      • Q-KON
      • SkyWire
      • Solid8 Technologies
      • Tenable
      • Vertiv
      • Videri Digital
      • Wipro
      • Workday
    • Sections
      • AI and machine learning
      • Banking
      • Broadcasting and Media
      • Cloud services
      • Contact centres and CX
      • Cryptocurrencies
      • Education and skills
      • Electronics and hardware
      • Energy and sustainability
      • Enterprise software
      • Fintech
      • Information security
      • Internet and connectivity
      • Internet of Things
      • Investment
      • IT services
      • Lifestyle
      • Motoring
      • Public sector
      • Retail and e-commerce
      • Science
      • Social media
      • Talent and leadership
      • Telecoms
    • Events
    • Advertise
    TechCentralTechCentral
    Home » AI and machine learning » Next-gen Nvidia AI chips coming to cloud platforms

    Next-gen Nvidia AI chips coming to cloud platforms

    Nvidia has showcased the technology it hopes will fuel the next wave of AI breakthroughs.
    By Agency Staff22 March 2023
    Twitter LinkedIn Facebook WhatsApp Email Telegram Copy Link
    News Alerts
    WhatsApp
    Nvidia CEO Jensen Huang

    Riding the surge of hype around ChatGPT and other artificial intelligence products, Nvidia introduced new chips, supercomputing services and a raft of high-profile partnerships on Tuesday intended to showcase how its technology will fuel the next wave of AI breakthroughs.

    At the chip maker’s annual developer conference on Tuesday, CEO Jensen Huang positioned Nvidia as the engine behind “the iPhone moment of AI”, as he’s taken to calling this inflection point in computing. Spurred by a boom in consumer and enterprise applications, such as advanced chatbots and eye-popping graphics generators, “generative AI will reinvent nearly every industry”, Huang said.

    The idea is to build infrastructure that can make AI apps faster and more accessible to customers. Nvidia’s graphics processing units have become the brains behind ChatGPT and its ilk, helping them digest and process ever-greater sums of training data. Microsoft revealed last week it had to string together tens of thousands of Nvidia’s A100 GPUs in data centres in order to handle the computational workloads in the cloud for OpenAI, ChatGPT’s developer.

    Oracle announced that its platform will feature 16 000 Nvidia H100 GPUs, the A100’s successor

    Other tech giants are following suit with similarly colossal cloud infrastructures geared for AI. Oracle announced that its platform will feature 16 000 Nvidia H100 GPUs, the A100’s successor, for high-performance compute applications, and Nvidia said a forthcoming system from Amazon Web Services will be able to scale up to 20 000 interconnected H100s. Microsoft has likewise started adding the H100 to its server racks.

    These kinds of chip superclusters are part of a push by Nvidia to rent out supercomputing services through a new program called DGX Cloud, hosted by Oracle and soon Microsoft Azure and Google Cloud. Nvidia said the goal is to make accessing an AI supercomputer as easy as opening a webpage, enabling companies to train their models without the need for on-premises infrastructure that’s costly to install and manage.

    Pricing

    “Provide your job, point to your data set, and you hit go — and all of the orchestration and everything underneath is taken care of,” said Manuvir Das, Nvidia’s vice president of enterprise computing. The DGX Cloud service will start at US$37 000 (about R684 000) per instance per month, with each “instance”— essentially the amount of computing horsepower being rented — equating to eight H100 GPUs.

    Nvidia also launched two new chips, one focused on enhancing AI video performance and the other an upgrade to the H100. The latter GPU is designed specifically to improve the deployment of large language models like those used by ChatGPT. Called the H100 NVL, it can perform 12 times faster when handling inferences — that is, how AI responds to real-life queries — compared to the prior generation of A100s at scale in data centres.

    Read: Nvidia in big push into quantum computing

    Ian Buck, vice president of hyperscale and high-performance computing at Nvidia, said it will help “democratise ChatGPT use cases and bring that capability to every server and every cloud”.  — Austin Carr, (c) 2023 Bloomberg LP

    Get TechCentral’s daily newsletter



    Amazon Web Services AWS Google Google Cloud Jensen Huang Microsoft Nvidia Oracle Oracle Cloud
    Subscribe to TechCentral Subscribe to TechCentral
    Share. Facebook Twitter LinkedIn WhatsApp Telegram Email Copy Link
    Previous ArticleGoogle opens access to Bard, its ChatGPT rival
    Next Article Ethernet inventor Bob Metcalfe awarded top industry prize

    Related Posts

    The end of Windows 10 support is nigh – what you need to know

    22 May 2025

    Google, Volvo deepen partnership on car software

    21 May 2025

    Google’s AI goes personal, proactive and premium

    21 May 2025
    Company News

    What SA’s financial institutions must know about the new IT governance law

    22 May 2025

    Top tech leaders back SAPHILA 2025

    22 May 2025

    The end of Windows 10 support is nigh – what you need to know

    22 May 2025
    Opinion

    Solar panic? The truth about SSEG, fines and municipal rules

    14 April 2025

    Data protection must be crypto industry’s top priority

    9 April 2025

    ICT distributors must embrace innovation or risk irrelevance

    9 April 2025

    Subscribe to Updates

    Get the best South African technology news and analysis delivered to your e-mail inbox every morning.

    © 2009 - 2025 NewsCentral Media

    Type above and press Enter to search. Press Esc to cancel.