Reddit sues AI firm for allegedly ‘scraping’comments to train chatbot
By Matt O'brien
FILE - Reddit Inc. signage is seen on the New York Stock Exchange trading floor, prior to Reddit IPO, Thursday, March. 21, 2024. v(AP Photo/Yuki Iwamura, File)
Social media platform Reddit has sued the artificial intelligence company Anthropic, alleging that it is illegally “scraping” the comments of Reddit users to train its chatbot Claude.
Reddit claims that Anthropic has used automated bots to access Reddit’s content despite being asked not to do so, and “intentionally trained on the personal data of Reddit users without ever requesting their consent.”
Anthropic didn’t immediately return a request for comment Wednesday. Reddit filed the lawsuit Wednesday in California Superior Court in San Francisco, where both companies are based.
“AI companies should not be allowed to scrape information and content from people without clear limitations on how they can use that data,” said Ben Lee, Reddit’s chief legal officer, in a statement Wednesday.
Those agreements “enable us to enforce meaningful protections for our users, including the right to delete your content, user privacy protections, and preventing users from being spammed using this content,” Lee said.
Anthropic was formed by former OpenAI executives in 2021 and its flagship Claude chatbot remains a key competitor to OpenAI’s ChatGPT. Much like other AI companies, it’s relied heavily on websites such as Wikipedia and Reddit that are full of rich sources of written materials to teach an AI assistant the patterns of human language
In a 2021 paper co-authored by Anthropic CEO Dario Amodei — cited in the lawsuit — researchers at the company identified the subreddits, or subject-matter forums, that contained the highest quality data, such as those focused on gardening, history or thoughts people have in the shower.
Anthropic in 2023 argued in a letter to the U.S. Copyright Office that the “way Claude was trained qualifies as a quintessentially lawful use of materials,” by making copies of information to perform a statistical analysis of a large body of data.
But Reddit’s lawsuit is different from others brought against AI companies because it doesn’t allege copyright infringement. Instead, it focuses on the alleged breach of Reddit’s terms of use, and the unfair competition, it says, was created.
Computer chipmaker Nvidia is poised to release a quarterly earnings report that is expected to either deepen a recent downturn in the stock market or prompt an ebullient sigh of relief among investors increasingly worried the world’s most valuable company is perched upon an artificial intelligence bubble about to burst.
Phoebe Gates and Sophia Kianni introduce Phia, a fashion tech startup that helps users find price comparisons and discover alternative options for apparel
Universal Music Group and AI platform Udio have settled a copyright lawsuit and will collaborate on a new music creation and streaming platform. The companies announced on Wednesday that they reached a compensatory legal settlement and new licensing agreements. These agreements aim to provide more revenue opportunities for Universal's artists and songwriters. The rise of AI song generation tools like Udio has disrupted the music streaming industry, leading to accusations from record labels. This deal marks the first since Universal and others sued Udio and Suno last year. Financial terms of the settlement weren't disclosed.
Microsoft says users of its Azure cloud portal may be not be able to access Office 365, Minecraft or other services due to issues with its global content delivery network services. The tech company posted a note to its Azure status page that its teams are currently deploying a fix to address the outage.
Nvidia on Wednesday became the first public company to reach a market capitalization of $5 trillion. The ravenous appetite for the Silicon Valley company’s chips is the main reason that the company’s stock price has increased so rapidly since early 2023.
A new poll finds that as the United States rapidly builds massive data centers for the development of artificial intelligence, many Americans are concerned about the environmental impact.
Brain.fm merges music and neuroscience to enhance focus, creativity, and mental health—Dr. Kevin Woods reveals how sound is transforming cognitive performance.
An internet outage on Monday morning highlights the reliance on Amazon's cloud services. This incident reveals vulnerabilities in the concentrated system. Cloud computing allows companies to rent Amazon's infrastructure instead of building their own. Amazon leads the market, followed by Google and Microsoft. The outage originated in Northern Virginia, the biggest and oldest cloud hub in the U.S. This region handles significantly more data than other hubs. Despite the idea of spreading workloads, many rely on this single hub. The demand for computing power, especially for AI, is driving a construction boom for data centers.