Anthropic accused of using pirated books to train AI, authors file class action lawsuit

According to Reuters, a group of writers onArtificial Intelligence Companies Anthropic filed a lawsuit accusing the company of using pirated books to train its AI Models.

Anthropic accused of using pirated books to train AI, authors file class action lawsuit

The class action lawsuit was filed Monday in a California court, according to reportsThe plaintiffs claim Anthropic "has built a multi-billion dollar business by stealing hundreds of thousands of copyrighted books."

In their lawsuit, the authors say Anthropic used a massive open-source dataset called "The Pile" to train its Claude series of AI chatbots.This dataset contains a section called Books3, which is a huge library of pirated ebooksThe Pile, which includes works by Stephen King, Michael Pollan, and thousands of other authors. Earlier this month, Anthropic confirmed to Vox that it used The Pile to train Claude.

"It is clear that Anthropic downloaded and reproduced copies of The Pile and Books3, knowing that these datasets contained a significant amount of copyrighted content from pirate sites such as Bibiliotik," the lawsuit reads. The authors want the court to approve their class action lawsuit and require Anthropic to pay the proposed damages and prohibit the company from using the copyrighted material in the future.

Last year former Arkansas Governor Mike Huckabee and other authors filed a similar lawsuit against Meta, Microsoft, and EleutherAI, the nonprofit organization behind The Pile, alleging that they misappropriated their work to train AI models.George R.R. Martin, Jodi Picoult, Michael Chabon, and others have also sued OpenAI for allegedly using their copyrighted content. George R.R. Martin, Jodi Picoult, Michael Chabon and several other authors have also sued OpenAI over the company's alleged use of their copyrighted content.

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.
Information

Google opens HeAR AI model API: 100 million cough sound training to assist in screening, diagnosis and monitoring of tuberculosis

2024-8-22 9:14:16

Information

Baidu, SenseTime, and Zhipu are the top three. IDC releases the first report on the market share of large model platforms and applications

2024-8-22 9:16:33

Search