Academic publishing giant Elsevier, along with other publishers and authors, has filed a lawsuit against Meta, accusing the company of illegally scraping and using copyrighted research papers to train its Llama large language models. The plaintiffs allege that Meta obtained these papers through the Common Crawl dataset and, more directly, through notorious piracy platforms like LibGen and Sci-Hub. Meta is defending its actions by invoking the 'fair use' doctrine, arguing that using copyrighted material for AI training constitutes transformative use and promotes innovation, citing a previous court ruling that allowed Anthropic to use purchased books for AI training. AI
Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →
IMPACT This lawsuit could set a crucial legal precedent for AI training data, potentially impacting how models are developed and the cost of accessing copyrighted materials.
RANK_REASON Major academic publisher Elsevier initiates a lawsuit against Meta, alleging copyright infringement in AI model training, marking a significant legal battle over data usage.