May 2, 2026
Extra! Extra! Search all about it
Show HN: Large Scale Article Extract of Newspapers 1730s-1960s
AI digs up 250 years of old news, and commenters instantly demand receipts, demos, and less mystery
TLDR: A new site says it has organized 6 million old American newspaper stories into a searchable history engine, and that’s a big deal because much of this material is hard to find online. Commenters were impressed but pushed hard for clearer demos and easier access, prompting the creator to open part of it for free.
A new project called snewpapers is promising something catnip-level irresistible for history nerds: 6 million newspaper stories from the 1730s to the 1960s, pulled out and organized so people can search by idea, not just exact words. Translation for normal humans: instead of hunting one dusty phrase, you can look for bigger themes like events, scandals, or social trends across centuries of American papers. The pitch is huge. The community reaction? Equal parts wow, curiosity, and “okay, but show me how this actually works.”
That tension became the real show. One early commenter basically gave the project the gold star for ambition, calling it “awesome and impressive,” but then immediately hit the brakes: the tool may be powerful, yet the site doesn’t make it obvious how regular people would use it. Ouch—but constructive. In a very live-demo moment, the creator jumped into the thread with example links, then went full public-service mode and opened a free “Today in History” page to everyone after the feedback. That’s the kind of comment-section plot twist people love.
Meanwhile, fellow data tinkerers piled on with battle scars. One said the real nightmare with old newspapers is page layout—giant headlines, weird columns, stacked text, pure chaos. Another asked the dreaded question: if this works on newspapers, what about magazines, where text splashes across pictures and everything gets messy? The vibe wasn’t hostile; it was more like a room full of fascinated skeptics yelling, “Cool! Now survive our follow-up questions.”
Key Points
- •The archive contains 6 million newspaper stories and is described as growing daily.
- •It spans roughly 250 years of American history, from the 1730s to the 1960s.
- •The stories were extracted and organized using AI.
- •The platform emphasizes semantic search, allowing users to search by meaning rather than only by keywords.
- •Users can refine searches with 24 categories, more than 1,000 sub-categories, and state and date filters.