Fine-tuning an LLM to write docs like it's 1995

An AI learned to write like it’s 1995, and the comments instantly turned into a nerd family feud

TLDR: A writer trained a local AI to imitate 1990s software manuals using old archived docs, turning retro tech style into a modern experiment. Commenters were split between nostalgia, skepticism that style matters without real substance, and jokes that nobody reads documentation anymore anyway.

A writer set out to do something gloriously retro: teach a local artificial intelligence program to write software manuals like the glory days of beige computers and chunky Microsoft help files. To pull it off, they dug through Bitsavers, a giant archive of old computer manuals, cleaned up millions of words of scanned text, and trained the model on that vintage style. The result? A lovingly nerdy experiment in making modern machine writing sound like it came bundled with Windows on a stack of floppy disks.

But the real show started in the comments, where readers immediately split into camps. One side was charmed and nostalgic, basically saying: yes, bring back the era when manuals actually explained things. Another side was far less romantic, arguing that old-school style is the easy part and that real documentation needs deep understanding, not just a retro coat of paint. That kicked off the big debate: is this a clever creative experiment, or just cosplay for manuals?

Then came the spicy practical crowd. One commenter flatly questioned the whole “local first” dream, saying the biggest barrier isn’t philosophy, it’s the painful price of the hardware needed to run these tools at home. Another dropped the ultimate 2026 hot take: “Who is reading docs these days?” Apparently the new workflow is letting AI read the instructions so humans can vibe freely. And in peak internet fashion, one person responded with the classic challenge: cool demo, now do it without fine-tuning. Retro docs, modern snark, zero chill.

Key Points

  • The author ran an independent experiment to fine-tune a local instruct model to emulate the style of 1980s and 1990s software technical documentation.
  • The training corpus came from the Bitsavers Microsoft collection, which contains more than 37 million words of out-of-print manuals published between 1977 and 2005.
  • OCR text was cleaned with Python scripts and filtered further by using gemma-4-26b via OpenRouter to classify paragraphs as keep or drop, at a reported cost of about $8.
  • The processed corpus was split into roughly 512-token chunks and converted into 192,456 JSONL training examples paired with synthetic instructions.
  • The author argues that fine-tuning is more suitable than training from scratch or using RAG when the goal is to reproduce writing style rather than retrieve facts.

Hottest takes

"The trick about documentation is depth, not prose" — v1ne
"Who is reading docs these days?" — holoduke
"The big part is there’s significant sticker shock to buying capable hardware" — mock-possum
Made with <3 by @siedrix and @shesho from CDMX. Powered by Forge&Hive.