Close Menu
TechCentralTechCentral

    Subscribe to the newsletter

    Get the best South African technology news and analysis delivered to your e-mail inbox every morning.

    Facebook X (Twitter) YouTube LinkedIn
    WhatsApp Facebook X (Twitter) LinkedIn YouTube
    TechCentralTechCentral
    • News
      Court ruling marks major turning point in driving licence card saga - Barbara Creecy

      Court ruling marks major turning point in driving licence card saga

      7 January 2026
      South Africa lets rivals team up to cut crippling electricity costs - Parks Tau

      South Africa lets rivals team up to cut crippling electricity costs

      7 January 2026
      The next wave: 10 technologies that will define 2026

      The next wave: 10 technologies that will define 2026

      7 January 2026
      South Africa's new car market roared back to life in 2025, with NEVs gaining ground

      South Africa’s new car market roared back to life in 2025, with NEVs gaining ground

      7 January 2026
      Why South Africa should extend the e-hailing compliance deadline

      Why South Africa should extend the e-hailing compliance deadline

      7 January 2026
    • World
      EU pressure mounts on Musk's X over AI 'undressing' images - Wolfram Weimer

      EU pressure mounts on Musk’s X over AI ‘undressing’ images

      7 January 2026
      Intel launches Panther Lake, its next-gen PC chip

      Intel launches Panther Lake, its next-gen PC chip

      6 January 2026
      Starlink plans to lower satellite orbit to enhance safety

      Starlink plans to lower satellite orbit to enhance safety

      4 January 2026
      Lou Gerstner, the man who saved IBM, dies at 83

      Lou Gerstner, the man who saved IBM, dies at 83

      29 December 2025
      Starlink satellite anomaly creates debris in rare orbital mishap

      Starlink satellite anomaly creates debris in rare orbital mishap

      19 December 2025
    • In-depth
      Digital authoritarianism grows as African states normalise internet blackouts

      Digital authoritarianism grows as African states normalise internet blackouts

      19 December 2025
      TechCentral's South African Newsmakers of 2025

      TechCentral’s South African Newsmakers of 2025

      18 December 2025
      Black Friday goes digital in South Africa as online spending surges to record high

      Black Friday goes digital in South Africa as online spending surges to record high

      4 December 2025
      DStv dodges channel blackout in last-minute deal with Warner Bros

      Canal+ plays hardball – and DStv viewers feel the pain

      3 December 2025
      Jensen Huang Nvidia

      So, will China really win the AI race?

      14 November 2025
    • TCS
      TCS+ | Africa's digital transformation - unlocking AI through cloud and culture - Cliff de Wit Accelera Digital Group

      TCS+ | Cloud without culture won’t deliver AI: Accelera’s Cliff de Wit

      12 December 2025
      TCS+ | How Cloud on Demand helps partners thrive in the AWS ecosystem - Odwa Ndyaluvane and Xenia Rhode

      TCS+ | How Cloud On Demand helps partners thrive in the AWS ecosystem

      4 December 2025
      TCS | MTN Group CEO Ralph Mupita on competition, AI and the future of mobile

      TCS | Ralph Mupita on competition, AI and the future of mobile

      28 November 2025
      TCS | Dominic Cull on fixing South Africa's ICT policy bottlenecks

      TCS | Dominic Cull on fixing South Africa’s ICT policy bottlenecks

      21 November 2025
      TCS | BMW CEO Peter van Binsbergen on the future of South Africa's automotive industry

      TCS | BMW CEO Peter van Binsbergen on the future of South Africa’s automotive industry

      6 November 2025
    • Opinion
      ANC's attack on Solly Malatsi shows how BEE dogma trumps economic reality - Duncan McLeod

      ANC’s attack on Solly Malatsi shows how BEE dogma trumps economic reality

      14 December 2025
      Netflix, Warner Bros deal raises fresh headaches for MultiChoice - Duncan McLeod

      Netflix, Warner Bros deal raises fresh headaches for MultiChoice

      5 December 2025
      BIN scans, DDoS and the next cybercrime wave hitting South Africa's banks - Entersekt Gerhard Oosthuizen

      BIN scans, DDoS and the next cybercrime wave hitting South Africa’s banks

      3 December 2025
      ANC's attack on Solly Malatsi shows how BEE dogma trumps economic reality - Duncan McLeod

      Your data, your hardware: the DIY AI revolution is coming

      20 November 2025
      Zero Carbon Charge founder Joubert Roux

      The energy revolution South Africa can’t afford to miss

      20 November 2025
    • Company Hubs
      • Africa Data Centres
      • AfriGIS
      • Altron Digital Business
      • Altron Document Solutions
      • Altron Group
      • Arctic Wolf
      • AvertITD
      • Braintree
      • CallMiner
      • CambriLearn
      • CYBER1 Solutions
      • Digicloud Africa
      • Digimune
      • Domains.co.za
      • ESET
      • Euphoria Telecom
      • Incredible Business
      • iONLINE
      • IQbusiness
      • Iris Network Systems
      • LSD Open
      • NEC XON
      • Netstar
      • Network Platforms
      • Next DLP
      • Ovations
      • Paracon
      • Paratus
      • Q-KON
      • SevenC
      • SkyWire
      • Solid8 Technologies
      • Telit Cinterion
      • Tenable
      • Vertiv
      • Videri Digital
      • Vodacom Business
      • Wipro
      • Workday
      • XLink
    • Sections
      • AI and machine learning
      • Banking
      • Broadcasting and Media
      • Cloud services
      • Contact centres and CX
      • Cryptocurrencies
      • Education and skills
      • Electronics and hardware
      • Energy and sustainability
      • Enterprise software
      • Financial services
      • Information security
      • Internet and connectivity
      • Internet of Things
      • Investment
      • IT services
      • Lifestyle
      • Motoring
      • Public sector
      • Retail and e-commerce
      • Satellite communications
      • Science
      • SMEs and start-ups
      • Social media
      • Talent and leadership
      • Telecoms
    • Events
    • Advertise
    TechCentralTechCentral
    Home » Sections » AI and machine learning » New claims in Meta fight over copyrighted books used in AI

    New claims in Meta fight over copyrighted books used in AI

    Lawyers had warned Meta about the legal perils of using pirated books to train its AI models, according to a new filing.
    By Katie Paul13 December 2023
    Twitter LinkedIn Facebook WhatsApp Email Telegram Copy Link
    News Alerts
    WhatsApp

    Meta Platforms’ lawyers had warned it about the legal perils of using thousands of pirated books to train its AI models, but the company did it anyway, according to a new filing in a copyright infringement lawsuit initially brought earlier this year.

    The new filing late on Monday night consolidates two lawsuits brought against the Facebook and Instagram owner by comedian Sarah Silverman, Pulitzer Prize winner Michael Chabon and other prominent authors, who allege that Meta has used their works without permission to train its artificial intelligence language model, Llama.

    A California judge last month dismissed part of the Silverman lawsuit and indicated that he would give the authors permission to amend their claims. Meta did not immediately respond to a request for comment on the allegations.

    Tech companies have been facing a slew of lawsuits this year from content creators…

    The new complaint includes chat logs of a Meta-affiliated researcher discussing procurement of the dataset in a Discord server, a potentially significant piece of evidence indicating that Meta was aware that its use of the books may not be protected by US copyright law.

    In the chat logs quoted in the complaint, researcher Tim Dettmers describes his back-and-forth with Meta’s legal department over whether use of the book files as training data would be “legally okay”.

    “At Facebook, there are a lot of people interested in working with The Pile, including myself, but in its current form, we are unable to use it for legal reasons,” Dettmers wrote in 2021, referring to a dataset Meta has acknowledged using to train its first version of Llama, according to the complaint.

    ‘Active copyrights’

    The month prior, Dettmers wrote that Meta’s lawyers had told him “the data cannot be used or models cannot be published if they are trained on that data”, the complaint said.

    While Dettmers does not describe the lawyers’ concerns, his counterparts in the chat identify “books with active copyrights” as the biggest likely source of worry. They say training on the data should “fall under fair use”, a US legal doctrine that protects certain unlicensed uses of copyrighted works.

    Dettmers, a doctoral student at the University of Washington, said he was not immediately able to comment on the claims.

    Read: Pansy Tlakula: ‘Why AI is giving me sleepless nights’

    Tech companies have been facing a slew of lawsuits this year from content creators who accuse them of ripping off copyright-protected works to build generative AI models that have created a global sensation and spurred a frenzy of investment.

    If successful, those cases could dampen the generative AI craze, as they could raise the cost of building the data-hungry models by compelling AI companies to compensate artists, authors and other content creators for the use of their works.

    At the same time, new provisional rules in Europe regulating artificial intelligence could force companies to disclose the data they use to train their models, potentially exposing them to more legal risk.

    Meta released a first version of its Llama large language model in February and published a list of datasets used for training, including “the Books3 section of The Pile”. The person who assembled that dataset has said elsewhere that it contains 196 640 books, according to the complaint.

    The company did not disclose training data for its latest version of the model, Llama 2, which it made available for commercial use this northern hemisphere summer.

    Llama 2 is free to use for companies with fewer than 700 million monthly active users. Its release was seen in the tech sector as a potential game-changer in the market for generative AI software, threatening to upend the dominance of players like OpenAI and Google that charge for use of their models.  — (c) 2023 NewsCentral Media

    Get breaking news alerts from TechCentral on WhatsApp



    Google Meta Meta Platforms OpenAI
    Subscribe to TechCentral Subscribe to TechCentral
    Share. Facebook Twitter LinkedIn WhatsApp Telegram Email Copy Link
    Previous ArticleSpaceX valuation soars to nearly $180-billion
    Next Article US takes first step to mandating anti-drunk driving technology

    Related Posts

    TechCentral's International Newsmakers of 2025

    TechCentral’s International Newsmakers of 2025

    17 December 2025
    OpenAI launches GPT-5.2 after 'code red' push to counter Google. Shelby Tauber/Reuters

    OpenAI launches GPT-5.2 after ‘code red’ push to counter Google

    12 December 2025
    OpenAI warns new models pose high cybersecurity risk

    OpenAI warns new models pose high cybersecurity risk

    11 December 2025
    Company News
    Why trust is the real currency in modern media

    Why trust is the real currency in modern media

    6 January 2026
    Why banks and insurers need a single decisioning brain as pressures collide - SAS

    Why banks and insurers need a single decisioning brain as pressures collide

    29 December 2025
    First Technology Western Cape delivers the tools - and intelligence - behind modern business - Dell Technologies

    First Technology Western Cape delivers the tools – and intelligence – behind modern business

    29 December 2025
    Opinion
    ANC's attack on Solly Malatsi shows how BEE dogma trumps economic reality - Duncan McLeod

    ANC’s attack on Solly Malatsi shows how BEE dogma trumps economic reality

    14 December 2025
    Netflix, Warner Bros deal raises fresh headaches for MultiChoice - Duncan McLeod

    Netflix, Warner Bros deal raises fresh headaches for MultiChoice

    5 December 2025
    BIN scans, DDoS and the next cybercrime wave hitting South Africa's banks - Entersekt Gerhard Oosthuizen

    BIN scans, DDoS and the next cybercrime wave hitting South Africa’s banks

    3 December 2025

    Subscribe to Updates

    Get the best South African technology news and analysis delivered to your e-mail inbox every morning.

    Latest Posts
    Court ruling marks major turning point in driving licence card saga - Barbara Creecy

    Court ruling marks major turning point in driving licence card saga

    7 January 2026
    South Africa lets rivals team up to cut crippling electricity costs - Parks Tau

    South Africa lets rivals team up to cut crippling electricity costs

    7 January 2026
    The next wave: 10 technologies that will define 2026

    The next wave: 10 technologies that will define 2026

    7 January 2026
    South Africa's new car market roared back to life in 2025, with NEVs gaining ground

    South Africa’s new car market roared back to life in 2025, with NEVs gaining ground

    7 January 2026
    © 2009 - 2026 NewsCentral Media
    • Cookie policy (ZA)
    • TechCentral – privacy and Popia

    Type above and press Enter to search. Press Esc to cancel.

    Manage consent

    TechCentral uses cookies to enhance its offerings. Consenting to these technologies allows us to serve you better. Not consenting or withdrawing consent may adversely affect certain features and functions of the website.

    Functional Always active
    The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
    Preferences
    The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
    Statistics
    The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
    Marketing
    The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
    • Manage options
    • Manage services
    • Manage {vendor_count} vendors
    • Read more about these purposes
    View preferences
    • {title}
    • {title}
    • {title}