Chomsky and the Two Cultures of Statistical Learning

Back in 2011 at MIT, Noam Chomsky threw shade at purely statistical language models, basically saying, “Congrats, you copied the vibes, not the meaning.” Today, that old fight just got reheated—hard. Peter Norvig’s classic response, “On Chomsky and the Two Cultures of Statistical Learning”, is making the rounds again, and the comments are pure fireworks.

On one side: the “causality or bust” crowd, insisting that curve‑fitting isn’t science. One top comment gripes that Norvig “confuses the map for the territory,” demanding real explanations over clever predictions. Another asks if this is just a statistics turf war—Bayes vs. frequentist—like it’s Team Red vs. Team Blue for math nerds.

On the other side: the “LLMs made it real” squad, gleefully resurfacing a Chomsky line from 1969 dismissing the “probability of a sentence” as useless. With today’s chatbots and auto‑translate everywhere, they’re calling that take “aged terribly.” Cue the memes: “OK boomer linguistics,” “butterfly collecting vs. bug‑squashing,” and screenshots of chatbots writing passable essays.

There’s also meta‑drama: folks asking “Is this from 2011?” like they stumbled into a time capsule, and one commenter tries to inject unrelated personal scandals—swiftly flagged as off‑topic. Beneath the snark, the fight is real: Is building useful systems enough, or must we explain language like a scientist? Norvig says both. The crowd can’t agree—and that’s why they won’t stop clicking.

December 20, 2025

Grammar vs. Gigabytes

Internet brawl: Did Chomsky call it—or did the internet pass him by

Key Points

Hottest takes

December 20, 2025

Grammar vs. Gigabytes

Chomsky and the Two Cultures of Statistical Learning

Internet brawl: Did Chomsky call it—or did the internet pass him by

Key Points

Hottest takes

Save News