Language Models Are Injective and Hence Invertible

A new paper claims modern language models—the chatbots that turn text into meaning—are “injective,” a math-y way of saying their inner representations uniquely trace back to the exact input. Translation: in theory, you can reconstruct what you typed from the model’s hidden gears. The authors even drop an algorithm, SipIt, promising to rebuild the original text efficiently, and brag about billions of “collision tests” (aka checks that different inputs don’t lead to the same internal state) with zero collisions. Cue the crowd going spicy. One camp cheers the transparency angle; another yells “marketing!” FXWIN grumbles the title misleads because real users see tokens (words) not distributions (probabilities), so “invertible” doesn’t mean what people think. Kobelb asks the blunt question: what’s actually invertible—the weights or the prompts? Sigmoid10 questions the testing itself, calling the collision threshold “arbitrary” and implying “no collisions” could just be statistical lucky breaks. Meanwhile, fatherrhyme goes full meme: “Congrats on proving the sky is blue,” warning that stateful systems revealing state isn’t shocking—it’s a privacy hazard. There’s even laughter at the author contribution note and a tweet getting dragged for semantics. Neuroscience fans wonder if this helps crack brain codes; skeptics say it mostly rebrands old worries with new math. Dramatic? Absolutely.

October 30, 2025

Press Ctrl+Z on AI brains

Researchers say AI can be 'rewound'—commenters cry hype, nitpick, and make jokes

TLDR: Researchers claim AI’s inner states can uniquely reconstruct the exact input, and even show an algorithm to do it. The comments split between excitement and eye-rolls, arguing the title is misleading, the tests are shaky, and the real story is privacy and semantics.

Key Points

Hottest takes

October 30, 2025

Press Ctrl+Z on AI brains

Language Models Are Injective and Hence Invertible

Researchers say AI can be 'rewound'—commenters cry hype, nitpick, and make jokes

TLDR: Researchers claim AI’s inner states can uniquely reconstruct the exact input, and even show an algorithm to do it. The comments split between excitement and eye-rolls, arguing the title is misleading, the tests are shaky, and the real story is privacy and semantics.

Key Points

Hottest takes

Save News