OpenAI In Trouble: US Comedian, TV Writers, and Authors Sue AI Bot Makers for Content Theft

Date:

Title: Comedian, TV Writers, and Authors Sue OpenAI and Meta for Content Theft

Comedian Sarah Silverman, along with authors Christopher Golden and Richard Kadrey, has taken legal action against OpenAI and Meta, accusing them of stealing their work and training AI models, namely ChatGPT and LLaMA, with unlawfully obtained datasets. The lawsuits, filed in a US District Court, allege copyright infringement against both companies.

The complaint alleges that the datasets used to train the AI models included content obtained from websites notorious for sharing books through torrent systems, such as Bibliotik, Library Genesis, and Z-Library. The plaintiffs claim that their works were included in these datasets without their consent.

Sarah Silverman, Christopher Golden, and Richard Kadrey assert that the AI models, when prompted, provide summaries of their books, thereby violating their copyrights. The exhibits presented as evidence include specific examples of ChatGPT summarizing Silverman’s Bedwetter, Golden’s Ararat, and Kadrey’s Sandman Slim. Notably, the chatbot fails to include any of the copyright information originally included with the published works.

Meta, in particular, is targeted in a separate lawsuit pertaining to its LLaMA models, a series of open-source AI models unveiled in February. The authors assert that their books were included in the datasets used to train LLaMA, and they allege that these datasets were acquired unlawfully.

The complaints highlight Meta’s own documentation on LLaMA, which acknowledges the sources of their training datasets, including one called ThePile compiled by EleutherAI. The lawsuit argues that ThePile was created using the content of the Bibliotik private tracker, among others. The authors claim that these shadow libraries are overtly illegal.

See also  Artificial Intelligence leads to better job opportunities, says OpenAI CEO

In both lawsuits, the authors stress that they never granted permission for their copyrighted books to be used as training material for these AI models. The allegations include multiple counts of copyright infringement, negligence, unjust enrichment, and unfair competition. The plaintiffs are seeking statutory damages, restitution of profits, and other appropriate remedies.

Joseph Saveri and Matthew Butterick, the authors’ legal representatives, highlight on their LLMlitigation website that they have been contacted by other concerned writers, authors, and publishers who share concerns about ChatGPT’s ability to generate text similar to copyrighted materials, potentially encompassing thousands of books.

Saveri, together with Butterick, is initiating legal action against AI companies on behalf of programmers and artists. In another ongoing case, Getty Images has filed a lawsuit alleging that Stability AI, the creator of the AI image generation tool Stable Diffusion, trained its model using millions of copyrighted images. Mona Awad and Paul Tremblay are also being represented by Saveri and Butterick in a similar case involving Meta’s chatbot.

The lawsuits serve as a call to address the concerns of content creators regarding the unauthorized use and potential infringement of their copyrighted works by AI models. These developments signal the need for increased oversight and regulations within the AI industry to protect intellectual property rights and ensure fair compensation for creators.

Frequently Asked Questions (FAQs) Related to the Above News

Who has filed lawsuits against OpenAI and Meta for content theft?

Comedian Sarah Silverman, along with authors Christopher Golden and Richard Kadrey, has taken legal action against OpenAI and Meta.

What are the allegations against OpenAI and Meta?

The lawsuits accuse OpenAI and Meta of stealing the plaintiffs' work and training AI models, such as ChatGPT and LLaMA, with unlawfully obtained datasets.

What datasets were allegedly used without consent?

The datasets used to train the AI models are claimed to include content obtained from websites known for sharing books through torrent systems, such as Bibliotik, Library Genesis, and Z-Library.

How do the AI models violate copyrights?

The AI models, when prompted, provide summaries of the plaintiffs' books, which is seen as a violation of their copyrights. The summaries do not include the original copyright information.

What specific examples are presented as evidence?

The exhibits include ChatGPT summarizing Sarah Silverman's Bedwetter, Christopher Golden's Ararat, and Richard Kadrey's Sandman Slim.

What is the separate lawsuit against Meta about?

Meta is being targeted in a separate lawsuit regarding its LLaMA models. The authors claim that their books were included in the datasets used to train LLaMA, and they allege that these datasets were obtained unlawfully.

What documentation supports the allegations against Meta?

The plaintiffs point to Meta's own documentation on LLaMA, which acknowledges the sources of their training datasets, including a dataset called ThePile, allegedly compiled using content from Bibliotik and other similar sources.

What legal claims are made in these lawsuits?

The lawsuits include claims of copyright infringement, negligence, unjust enrichment, and unfair competition. The plaintiffs are seeking statutory damages, restitution of profits, and other appropriate remedies.

Are there other similar cases or complaints?

Yes, the legal representatives for the authors have mentioned being contacted by other concerned writers, authors, and publishers who share similar concerns about AI models generating text similar to copyrighted materials.

What does this situation highlight?

These lawsuits highlight the need to address the concerns of content creators regarding the unauthorized use and potential infringement of their copyrighted works by AI models. It also underscores the necessity for increased oversight and regulations within the AI industry to safeguard intellectual property rights and ensure fair compensation for creators.

Please note that the FAQs provided on this page are based on the news article published. While we strive to provide accurate and up-to-date information, it is always recommended to consult relevant authorities or professionals before making any decisions or taking action based on the FAQs or the news article.

Aryan Sharma
Aryan Sharma
Aryan is our dedicated writer and manager for the OpenAI category. With a deep passion for artificial intelligence and its transformative potential, Aryan brings a wealth of knowledge and insights to his articles. With a knack for breaking down complex concepts into easily digestible content, he keeps our readers informed and engaged.

Share post:

Subscribe

Popular

More like this
Related

WhatsApp Unveils New AI Feature: Generate Images of Yourself Easily

WhatsApp introduces a new AI feature, allowing users to easily generate images of themselves. Revolutionizing the way images are interacted with on the platform.

India to Host 5G/6G Hackathon & WTSA24 Sessions

Join India's cutting-edge 5G/6G Hackathon & WTSA24 Sessions to explore the future of telecom technology. Exciting opportunities await! #IndiaTech #5GHackathon

Wimbledon Introduces AI Technology to Protect Players from Online Abuse

Wimbledon introduces AI technology to protect players from online abuse. Learn how Threat Matrix enhances player protection at the tournament.

Hacker Breaches OpenAI, Exposes AI Secrets – Security Concerns Rise

Hacker breaches OpenAI, exposing AI secrets and raising security concerns. Learn about the breach and its implications for data security.