Authors sue Anthropic for training AI using pirated books

Vector illustration of the Anthropic logo.
Image: The Verge

A group of authors has sued Anthropic, accusing it of training its models on pirated books, as reported by Reuters. The proposed class action lawsuit was filed in a California court on Monday and alleges Anthropic “built a multibillion-dollar business by stealing hundreds of thousands of copyrighted books.”

In the lawsuit, the authors say that Anthropic used a sprawling, open-source dataset known as “The Pile” to train its family of Claude AI chatbots. Within this dataset is something called Books3, a massive library of pirated ebooks that includes works from Stephen King, Michael Pollan, and thousands of other authors. Earlier this month, Anthropic confirmed to Vox that it used The Pile to train Claude.

“It is apparent that Anthropic...

Continue reading…



source https://www.theverge.com/2024/8/20/24224450/anthropic-copyright-lawsuit-pirated-books-ai

Comments

Popular posts from this blog

In a world first, China lands a spacecraft gently on the Moon’s far side

Snap suspends two anonymous messaging apps after cyberbullying lawsuit