Nvidia’s Next-Gen AI Chips Are Coming to AWS and Google Cloud

Austin Carr

Tue, Mar 21, 2023, 12:00 PM3 min read

In this article:

(Bloomberg) -- Riding the surge of hype around ChatGPT and other artificial intelligence products, Nvidia Corp. introduced new chips, supercomputing services and a raft of high-profile partnerships Tuesday intended to showcase how its technology will fuel the next wave of AI breakthroughs.

Most Read from Bloomberg

At the chipmaker’s annual developer conference on Tuesday, Chief Executive Officer Jensen Huang positioned Nvidia as the engine behind “the iPhone moment of AI,” as he’s taken to calling this inflection point in computing. Spurred by a boom in consumer and enterprise applications, such as advanced chatbots and eye-popping graphics generators, “generative AI will reinvent nearly every industry,” Huang said.

The idea is to build infrastructure that can make AI apps faster and more accessible to customers. Nvidia’s graphics processing units have become the brains behind ChatGPT and its ilk, helping them digest and process ever-greater sums of training data. Microsoft Corp. revealed last week it had to string together tens of thousands of Nvidia’s A100 GPUs in data centers in order to handle the computational workload in the cloud for OpenAI, ChatGPT’s developer.

Other tech giants are following suit with similarly colossal cloud infrastructures geared for AI. Oracle Corp. announced that its platform will feature 16,000 Nvidia H100 GPUs, the A100’s successor, for high-performance compute applications, and Nvidia said a forthcoming system from Amazon Web Services will be able to scale up to 20,000 interconnected H100s. Microsoft has likewise started adding the H100 to its server racks.

These kinds of chip superclusters are part of a push by Nvidia to rent out supercomputing services through a new program called DGX Cloud, hosted by Oracle and soon Microsoft Azure and Google Cloud. Nvidia said the goal is to make accessing an AI supercomputer as easy as opening a webpage, enabling companies to train their models without the need for on-premise infrastructure that’s costly to install and manage.

“Provide your job, point to your data set, and you hit go — and all of the orchestration and everything underneath is taken care of,” said Manuvir Das, Nvidia’s vice president of enterprise computing. The DGX Cloud service will start at $36,999 per instance per month, with each “instance”— essentially the amount of computing horsepower being rented — equating to eight H100 GPUs.

Nvidia also launched two new chips, one focused on enhancing AI video performance and the other an upgrade to the H100.

The latter GPU is designed specifically to improve the deployment of large language models like those used by ChatGPT. Called the H100 NVL, it can perform 12 times faster when handling inferences — that is, how AI responds to real-life queries — compared with the prior generation of A100s at scale in data centers.

Ian Buck, vice president of hyperscale and high-performance computing at Nvidia, said it will help “democratize ChatGPT use cases and bring that capability to every server and every cloud.”

Most Read from Bloomberg Businessweek

©2023 Bloomberg L.P.

Advertisement

California McDonald's Franchise Owner Says, 'The Focus Is On Survival' With 'Unprecedented' $20 Per Hour Minimum Wage Forcing Higher Prices
In response to California's new $20 minimum wage law, fast food franchises are being forced to rethink their business strategies to stay afloat. Scott Rodrick, who owns 18 McDonald's franchises in the state, is considering measures to manage the increased labor costs without resorting to layoffs, which he sees as a last resort. Don't Miss: 82% of Americans aren’t using this government secured 5% passive income stream, are you one of them? The average American couple has saved this much money for
Benzinga•21h ago
Sam Bankman-Fried Agrees to Help FTX Investors Go After Celeb Promoters
Sam Bankman-Fried has inked a settlement agreement with a group of FTX customers who have agreed to drop their class action lawsuit against him in exchange for his help going after celebrity promoters of the collapsed exchange.
CoinDesk•22h ago
The Bond Market Is Sounding Its Most Severe Alarm in Decades, and It Could Mean Trouble for the Stock Market
This bond market indicator has predicted past recessions with near-perfect accuracy since the mid-1960s, and it's sending Wall Street a warning right now.
Motley Fool•5h ago
I Was Incredibly Wrong About the Tesla Cybertruck
Unit sales appear considerably below my estimates.
Motley Fool•6h ago
Silicon Valley and Hollywood worlds collide as David Ellison bids for Paramount
David Ellison, 41, would not be the first rich guy to arrive in Hollywood with a fat bank account and dreams of making movies, though the son of billionaire Oracle founder Larry Ellison boasts the rarest of attributes for a budding media mogul: a Silicon Valley pedigree. In an industry where many get their start fetching coffee or moving props, Ellison spent summers writing computer code for his father's software company and getting insights on the movie business from Pixar Animation Studios co-founder Steve Jobs. Ellison is orchestrating a multi-step transaction that would result in the merger of his independent studio, Skydance Media, with Paramount.
Reuters•4h ago
Time to Pounce: 2 Phenomenal Ultra-High-Yield REITs That Haven't Been This Cheap in Years
The premier name among retail REITs, along with a 14%-yielding REIT that's returned $25 billion to its shareholders since going public, make for sensational buys right now.
Motley Fool•5h ago
American Airlines (AAL) Q1 Earnings on the Horizon: Analysts' Insights on Key Performance Measures
Get a deeper insight into the potential performance of American Airlines (AAL) for the quarter ended March 2024 by going beyond Wall Street's top -and-bottom-line estimates and examining the estimates for some of its key metrics.
Zacks•1d ago
Trump poised to clinch $1.3 billion social media company stock award
Donald Trump is set to secure on Tuesday a stock bonus worth $1.3 billion from the company that operates his social media app Truth Social, equivalent to about half the majority stake he already owns in it, thanks to the wild rally in its shares. The award will take the former U.S. President's overall stake in the company, Trump Media & Technology Group (TMTG), to $4.1 billion. While Trump has agreed not to sell any of his TMTG shares before September, the windfall represents a significant boost to his wealth, which Forbes pegs at $4.7 billion.
Reuters•4h ago
It's Time to Ditch These 2 "Magnificent Seven" Stocks and Replace Them With 2 Bona Fide Outperformers
Among Microsoft, Apple, Nvidia, Alphabet, Amazon, Meta Platforms, and Tesla, there are two former highfliers that are no longer magnificent.
Motley Fool•1d ago
Former House Speaker Nancy Pelosi Can't Stop Buying the 1 Artificial Intelligence (AI) Stock Billionaires Have Been Eager to Sell
Though this highflying stock is making Nancy Pelosi and her venture capitalist husband richer, more than a half-dozen billionaires have sent it to the chopping block.
Motley Fool•1d ago