According to Reuters, a group of writers onArtificial Intelligence Companies Anthropic filed a lawsuit accusing the company of using pirated books to train its AI Models.
The class action lawsuit was filed Monday in a California court, according to reportsThe plaintiffs claim Anthropic "has built a multi-billion dollar business by stealing hundreds of thousands of copyrighted books."
In their lawsuit, the authors say Anthropic used a massive open-source dataset called "The Pile" to train its Claude series of AI chatbots.This dataset contains a section called Books3, which is a huge library of pirated ebooksThe Pile, which includes works by Stephen King, Michael Pollan, and thousands of other authors. Earlier this month, Anthropic confirmed to Vox that it used The Pile to train Claude.
"It is clear that Anthropic downloaded and reproduced copies of The Pile and Books3, knowing that these datasets contained a significant amount of copyrighted content from pirate sites such as Bibiliotik," the lawsuit reads. The authors want the court to approve their class action lawsuit and require Anthropic to pay the proposed damages and prohibit the company from using the copyrighted material in the future.
Last year former Arkansas Governor Mike Huckabee and other authors filed a similar lawsuit against Meta, Microsoft, and EleutherAI, the nonprofit organization behind The Pile, alleging that they misappropriated their work to train AI models.George R.R. Martin, Jodi Picoult, Michael Chabon, and others have also sued OpenAI for allegedly using their copyrighted content. George R.R. Martin, Jodi Picoult, Michael Chabon and several other authors have also sued OpenAI over the company's alleged use of their copyrighted content.