AIs can generate near-verbatim copies of novels from training data

The internet’s book club turned courtroom today after a new Stanford/Yale study showed big-name chatbots can cough up near-verbatim chunks of bestselling novels when nudged just right. Gemini reportedly reproduced about 76.8% of Harry Potter and the Philosopher’s Stone, Grok hit 70.3%, and researchers even jailbroke Claude to spill almost entire books. Cue chaos: half the comment section yelled “no duh”—if a bot learned from books, of course it can finish their sentences. The other half screamed copyright alarm, asking whether this turns your friendly assistant into a stealth e‑book machine.

The hottest fight is over intent: some argue it’s like asking a trivia whiz to recall a passage, not theft. Others say if it can spit out pages on command, that’s not “transformative”—that’s copying, with real legal and privacy fallout. One camp shrugged it off as a “nothing burger,” noting the models needed carefully crafted prompts or even jailbreaks to behave badly. Another called the “AI-as-library” analogy cute but misleading—libraries have licenses; LLMs just ate the shelves.

Humor broke through the tension with memes about wizard bots “summoning Harry Potter paragraphs,” and quips comparing LLMs to search engines on steroids. Legal vibes were heavy—people warned the fair‑use defense might be wobbling. Expect more lawsuits, more guardrails, and a lot more drama every time someone types “continue this sentence…”

February 23, 2026

Bots caught book‑handed

Internet erupts: is your chatbot a sneaky e‑book pirate or just predictable

Key Points

Hottest takes

February 23, 2026

Bots caught book‑handed

AIs can generate near-verbatim copies of novels from training data

Internet erupts: is your chatbot a sneaky e‑book pirate or just predictable

Key Points

Hottest takes

Save News