NYT vs. Microsoft: AI copyright suit will be 'complex'

Wed, Dec 27, 2023, 5:01 PM

In this article:

The New York Times (NYT) is suing OpenAI and Microsoft (MSFT) for allegedly using millions of articles without permission to train its AI chatbots. MIT Initiative on the Digital Economy Director Sinan Aral and Newsroom Robots Podcast host Nikita Roy discuss the details of the copyright lawsuit and potential implications for the publishing industry with Yahoo Finance Live.

Roy says she was "completely expecting" this scenario, suggesting the Times is "helping" smaller publishers lacking resources to take legal action. However, she notes accusing chatbots of infringement is "complex," hinging on whether courts deem AI a tool or if liability falls on the user. Still, Roy stresses that "we are facing a very ethical issue" regarding how creators' work gets utilized.

"This is a debate about whether the companies training large language models on content from the web... is fair use or infringement of copyright," Aral states. If courts mandate payments to original data producers, costs could rise significantly for AI firms. However, Aral believes "this was expected," and the only question that remains is "where does the price point lie?"

For more expert insight and the latest market action, click here to watch this full episode of Yahoo Finance Live.

Video Transcript

- One, Nikita, were you surprised that the Times brought this lawsuit? And two, give us your thoughts on it. What did you make of it?

NIKITA ROY: Yeah. I was completely expecting this to happen actually for a long time. And I think the New York Times is doing this is really helping a lot of the media organizations who probably don't have the ability to go out and take on these tech giants.

But the issue of copyright is really so complex, and it comes down to how the courts are going to define generative AI and specifically large language models because who is liable in this case? Is it considered a tool, or is it the user? And the problem is that large language models and all of these tech companies like OpenAI, and Anthropic by Cloud, they are completely shifting in the way they aren't thinking about these large language models.

So one of the things that I think really is important to take note of is last month, OpenAI CEO Sam Altman said in his keynote speech that they would defend their customers and pay the costs incurred if they face legal claims around copyright. And so I think that really shows how confident maybe the tech companies are regarding their claims to make sure that these are considered just as tools and push that liability over to users. But at the end of the day, we are really facing a very ethical issue in terms of how are we going to be using people's work, and take away that, and be them be a competitor in that space as well.

- And Sinan, I want to bring you into this because as someone who both studies these issues but also invests in startups that use AI, I'm curious. You know, when Nikita talks about that some of these companies are ready to face the costs, how substantial do you think they could potentially be for both smaller startups and really big ones like OpenAI?

SINAN ARAL: Well, I mean, I think that the costs could be very large, Julie. It's great to see you. Happy Holidays. This is a debate between whether the companies training large language models on content from the web, the New York Times, and content from lots of other places is fair use or infringement of copyright.

And if the courts or a settlement determine that there need to be payments made to the original producers of the training data, that could increase the costs for AI, generative AI companies, generative AI startups, and that makes a big difference for how the industry runs.

However, this was expected, as Nikita said. So these costs in large part have been planned for, have been thought about, and it's not something that is brand new. This has been a debate that's been ongoing for months, if not the better part of a year. And this is just the first opening of the conversation.

And really the determination will be where does the price point lie? Is it going to be a very large transfer to those who are creating the content that the models are trained on, or is it going to be smaller? Is it going to be settled, or is it going to go to court and have judicial case law created about copyright infringement based on this particular case? We will see. However, this was inevitable. And therefore, I think a lot of these costs have already been thought about in the long run.

- But Sinan, let me just get your take because we do tech companies have made the case before, right? And it seems like their argument is this is publicly available information we're scraping from the public internet. It's just oceans of data, oceans of text. So it is fair use. Do you buy that argument?

SINAN ARAL: No. I don't buy that argument, and it depends on how you use it. So if I were to take New York Times articles and start a website, sinanaral.com, and post New York Times articles on my website and charge for them, that would be copyright infringement, even if I scraped them from the web. That is not considered fair use.

If I were to, however, copy a single New York Times article for one class at MIT for educational purposes, that would likely be considered fair use. And training very large language models on millions, as the lawsuit indicates, pieces of content by The New York Times is a new use. In other words, it hasn't been considered in the past as being a traditional use of copyrighted material. And therefore, we have not decided as a society whether this is fair use or not. And that's why this case is so important because it will decide either through case law or through settlement how much we believe content producers deserve in terms of the use, this type of use of their content, and that's what's new about this case.

Forget Nvidia: Members of Congress Are Scooping Up Shares of Its Core Rival Instead
There's a much more popular AI stock on Capitol Hill than the leading AI chip maker.
Motley Fool•3h ago
Trump to set interest rates himself under secret presidential plan
Donald Trump’s aides have drawn up secret plans to oust the chairman of the Federal Reserve and allow the president to set interest rates, according to reports.
The Telegraph•19h ago
2 Stock-Split Stocks to Buy Hand Over Fist Right Now
These proven wealth builders could be exactly what you're searching for right now.
Motley Fool•23h ago
Walmart CEO started his career unloading trailers at the warehouse—he says he got promotion after promotion by raising his hand when his boss was out of town
Walmart's CEO went from earning $6.50 an hour to $25 million a year—here are his three tips for climbing the corporate ladder like him
Fortune•1d ago
Housing supply surges by up to 50% in these metro areas — and many sellers are being forced to slash their asking prices
The property report includes 85 major metropolitan areas in the U.S. with populations of at least 750,000.
MarketWatch•1d ago
Analysts revamp Microsoft stock price target after earnings
Here's what could happen next to Microsoft shares.
TheStreet•13h ago
U.S. panic over national debt might mark a culture shift—are Americans becoming more ‘European’ about money?
Jamie Dimon and Jerome Powell are taking the European viewpoint on soaring debt levels in the U.S.
Fortune•1d ago
Bank of America lays out the exact scenario that could finally pop the stock market's AI bubble
It's been an "everything buy bonds" bull rally in markets for months, but BofA is cautiously watching a couple of indicators.
Business Insider•11h ago
Why I Just Added This Ultra-High-Yield Dividend ETF to My Retirement Account
The JPMorgan Nasdaq Equity Premium Income ETF offers a compelling blend of income and upside to my portfolio.
Motley Fool•5h ago
Meet the 2 Best S&P 500 Stocks of 2024. They Could Soar Another 69% and 91%, According to Certain Wall Street Analysts.
Nvidia and Super Micro Computer have been the best-performing S&P 500 stocks of 2024, but select Wall Street analysts still see substantial upside for shareholders.
Motley Fool•4h ago

News

Life

Entertainment

Finance

Sports

New on Yahoo

Yahoo Finance

NYT vs. Microsoft: AI copyright suit will be 'complex'

Video Transcript

Recommended Stories

Forget Nvidia: Members of Congress Are Scooping Up Shares of Its Core Rival Instead

Trump to set interest rates himself under secret presidential plan

2 Stock-Split Stocks to Buy Hand Over Fist Right Now

Walmart CEO started his career unloading trailers at the warehouse—he says he got promotion after promotion by raising his hand when his boss was out of town

Housing supply surges by up to 50% in these metro areas — and many sellers are being forced to slash their asking prices

Analysts revamp Microsoft stock price target after earnings

U.S. panic over national debt might mark a culture shift—are Americans becoming more ‘European’ about money?

Bank of America lays out the exact scenario that could finally pop the stock market's AI bubble

Why I Just Added This Ultra-High-Yield Dividend ETF to My Retirement Account

Meet the 2 Best S&P 500 Stocks of 2024. They Could Soar Another 69% and 91%, According to Certain Wall Street Analysts.