AI hypocrisy: OpenAI, Google and Anthropic won't let their data be used to train other AI models, but they use everyone else's content

Alistair Barr

Updated Fri, Jun 2, 2023, 2:35 PM3 min read

Sam Altman testifying before Congress in May 2023 — Samuel Altman, CEO of OpenAI, testifies before the Senate Judiciary Subcommittee on Privacy, Technology, and the Law May 16, 2023 in Washington, DC.Win McNamee/Getty Images

Microsoft-backed OpenAI, Google and Anthropic ban the use of their content to train other AI models.
However, these companies have been using other online content for their own model training.
Can Big Tech have it both ways? Reddit and others are trying to stop this.

In the new age of generative AI, big tech companies are following a "do as I say, not as I do" strategy when it comes to the use of online content.

Microsoft-backed OpenAI, along with Google, and Google-backed Anthropic have for years been using online content created by companies to train their generative AI models. This was done without asking for specific permission, and it's part of a brewing legal battle that will decide the future of the web and how copyright laws are applied in this new world.

The tech industry will likely argue that their approach is fair use. That has yet to be decided. However, these big tech companies won't let their own content be used to train other AI models. So why should they be allowed to do this to everyone else?

Take a look at the terms of service for Claude, Anthropic's AI assistant:

"You may not access or use the Services in the following ways, and if any of these restrictions are inconsistent with or ambiguous in relation to the Acceptable Use Policy, the Acceptable Use Policy controls: To develop any products or services that compete with our Services, including to develop or train any artificial intelligence or machine learning algorithms or models."

Here's an excerpt from the top of Google's generative AI terms of use:

"You may not use the Services to develop machine learning models or related technology."

And here's the relevant section from OpenAI's terms of use. This is the company behind ChatGPT.

"You may not... use output from the Services to develop models that compete with OpenAI."

These companies are not dumb, but they are hypocritical

These companies are not dumb. They know that quality content is vital for training new AI models. So it makes sense that they won't allow their output to be used this way.

But why would any other website or company let their content be freely used by these giant tech companies to train their models?

Insider asked OpenAI, Google and Anthropic for comment on Friday. At the time of publication, they had not responded.

Reddit and other companies say enough is enough

Other companies are just beginning to realize what's been happening, and they are not happy. Reddit, which has been used for years in AI model training, plans to start charging for access to its data.

"The Reddit corpus of data is really valuable. But we don't need to give all of that value to some of the largest companies in the world for free," said Steve Huffman, CEO of Reddit.

In April, Elon Musk accused Microsoft, the main backer of OpenAI, of illegally using Twitter's data to train AI models. "Lawsuit time," he tweeted.

"There is so much wrong w/ this premise I don't even know where to start," a Microsoft spokesman wrote in an email to Insider when asked for comment.

OpenAI's CEO Sam Altman is trying to be more thoughtful on this issue, by working on new AI models that respect copyright. "We're trying to work on new models where if an AI system is using your content, or if it's using your style, you get paid for that," he said recently, according to Axios.

Publishers, including Insider which produced this story, have a vested interest here. Some publishers, including News Corp., are already pushing tech companies to pay to use their content for training AI models.

The current way AI models are trained 'breaks' the web

One former Microsoft executive believes something is wrong here. Steven Sinofsky recently said the current way AI models are trained "breaks" the web.

"Crawling used to be allowed in exchange for clicks. But now the crawling simply trains a model and no value is ever delivered to the creator(s) / copyright holders," he tweeted. Insider asked him for comment, but he was traveling on Friday and couldn't respond.

Read the original article on Business Insider

California McDonald's Franchise Owner Says, 'The Focus Is On Survival' With 'Unprecedented' $20 Per Hour Minimum Wage Forcing Higher Prices
In response to California's new $20 minimum wage law, fast food franchises are being forced to rethink their business strategies to stay afloat. Scott Rodrick, who owns 18 McDonald's franchises in the state, is considering measures to manage the increased labor costs without resorting to layoffs, which he sees as a last resort. Don't Miss: 82% of Americans aren’t using this government secured 5% passive income stream, are you one of them? The average American couple has saved this much money for
Benzinga•21h ago
Sam Bankman-Fried Agrees to Help FTX Investors Go After Celeb Promoters
Sam Bankman-Fried has inked a settlement agreement with a group of FTX customers who have agreed to drop their class action lawsuit against him in exchange for his help going after celebrity promoters of the collapsed exchange.
CoinDesk•22h ago
I Was Incredibly Wrong About the Tesla Cybertruck
Unit sales appear considerably below my estimates.
Motley Fool•6h ago
The Bond Market Is Sounding Its Most Severe Alarm in Decades, and It Could Mean Trouble for the Stock Market
This bond market indicator has predicted past recessions with near-perfect accuracy since the mid-1960s, and it's sending Wall Street a warning right now.
Motley Fool•5h ago
Silicon Valley and Hollywood worlds collide as David Ellison bids for Paramount
David Ellison, 41, would not be the first rich guy to arrive in Hollywood with a fat bank account and dreams of making movies, though the son of billionaire Oracle founder Larry Ellison boasts the rarest of attributes for a budding media mogul: a Silicon Valley pedigree. In an industry where many get their start fetching coffee or moving props, Ellison spent summers writing computer code for his father's software company and getting insights on the movie business from Pixar Animation Studios co-founder Steve Jobs. Ellison is orchestrating a multi-step transaction that would result in the merger of his independent studio, Skydance Media, with Paramount.
Reuters•4h ago
Time to Pounce: 2 Phenomenal Ultra-High-Yield REITs That Haven't Been This Cheap in Years
The premier name among retail REITs, along with a 14%-yielding REIT that's returned $25 billion to its shareholders since going public, make for sensational buys right now.
Motley Fool•5h ago
Trump poised to clinch $1.3 billion social media company stock award
Donald Trump is set to secure on Tuesday a stock bonus worth $1.3 billion from the company that operates his social media app Truth Social, equivalent to about half the majority stake he already owns in it, thanks to the wild rally in its shares. The award will take the former U.S. President's overall stake in the company, Trump Media & Technology Group (TMTG), to $4.1 billion. While Trump has agreed not to sell any of his TMTG shares before September, the windfall represents a significant boost to his wealth, which Forbes pegs at $4.7 billion.
Reuters•4h ago
American Airlines (AAL) Q1 Earnings on the Horizon: Analysts' Insights on Key Performance Measures
Get a deeper insight into the potential performance of American Airlines (AAL) for the quarter ended March 2024 by going beyond Wall Street's top -and-bottom-line estimates and examining the estimates for some of its key metrics.
Zacks•1d ago
Former House Speaker Nancy Pelosi Can't Stop Buying the 1 Artificial Intelligence (AI) Stock Billionaires Have Been Eager to Sell
Though this highflying stock is making Nancy Pelosi and her venture capitalist husband richer, more than a half-dozen billionaires have sent it to the chopping block.
Motley Fool•1d ago
Forget Nvidia: Billionaires Are Selling It and Buying These 2 High-Octane Artificial Intelligence (AI) Growth Stocks Instead
Billionaire investors are ditching the "infrastructure backbone" of the artificial intelligence (AI) revolution in favor of two industry-leading, irreplaceable AI stocks.
Motley Fool•5h ago

News

Life

Entertainment

Finance

Sports

New on Yahoo

Yahoo Finance

AI hypocrisy: OpenAI, Google and Anthropic won't let their data be used to train other AI models, but they use everyone else's content

These companies are not dumb, but they are hypocritical

Reddit and other companies say enough is enough

The current way AI models are trained 'breaks' the web

Recommended Stories

California McDonald's Franchise Owner Says, 'The Focus Is On Survival' With 'Unprecedented' $20 Per Hour Minimum Wage Forcing Higher Prices

Sam Bankman-Fried Agrees to Help FTX Investors Go After Celeb Promoters

I Was Incredibly Wrong About the Tesla Cybertruck

The Bond Market Is Sounding Its Most Severe Alarm in Decades, and It Could Mean Trouble for the Stock Market

Silicon Valley and Hollywood worlds collide as David Ellison bids for Paramount

Time to Pounce: 2 Phenomenal Ultra-High-Yield REITs That Haven't Been This Cheap in Years

Trump poised to clinch $1.3 billion social media company stock award

American Airlines (AAL) Q1 Earnings on the Horizon: Analysts' Insights on Key Performance Measures

Former House Speaker Nancy Pelosi Can't Stop Buying the 1 Artificial Intelligence (AI) Stock Billionaires Have Been Eager to Sell

Forget Nvidia: Billionaires Are Selling It and Buying These 2 High-Octane Artificial Intelligence (AI) Growth Stocks Instead