December 2, 2025
One file, many feelings
Show HN: Marmot – Single-binary data catalog (no Kafka, no Elasticsearch)
HN cheers 'one-file data finder' while skeptics demand integrations
TLDR: Marmot launches a one-file, open-source tool to quickly find company data. The crowd loved the simplicity but debated missing integrations and how it compares to Amundsen and OpenMetadata, fueling a fresh round of the “simple vs full-featured” data tools showdown.
Hacker News lit up as Marmot promised “find any company data in seconds” with a single, easy-to-deploy file—no heavy add‑ons like Kafka or Elasticsearch. For the non-nerds: a “data catalog” is basically Google for your organization’s scattered data. Marmot’s pitch is simple: one file, a clean interface, and fast search across tables, message queues, buckets, and APIs, plus clickable maps that show how data moves from point A to B.
The crowd quickly split. Fans swooned over the minimal setup—one commenter cheered, “That’s it!”—and another dreamed up a plugin to rope in their SQLite and D1 data. The vibes got meme-y fast: “one file to rule them all” jokes and marmot puns burrowed through the thread. But the skeptics showed up with receipts. How’s this different from Amundsen? And why not choose the heavyweight champ OpenMetadata? One team said they picked OpenMetadata precisely because of ready-made integrations with Airflow (a workflow tool) and dashboards like Tableau and Power BI.
The spiciest debate? Simplicity vs. ecosystem. Marmot’s no-frills approach charmed the “ship it” crowd, while others asked when a lightweight catalog is enough—and when you need the big, integrated platforms. One commenter even challenged the basics: when should you reach for a catalog instead of a data warehouse or data lake? Translation: the feature checklist fight has officially begun.
Key Points
- •Marmot is an open-source data catalog focused on simplicity and speed, shipping as a single binary with an intuitive UI.
- •It catalogs diverse assets (tables, topics, queues, buckets, APIs, pipelines) across an organization’s data stack.
- •Search combines full-text with structured queries, metadata filters, boolean logic, and comparison operators.
- •Interactive lineage visualization provides dependency graphs to trace data flows and assess impact.
- •Integrations include CLI, REST API, Terraform, and Pulumi; Marmot is PostgreSQL-backed and licensed under MIT.