Running local models is good now

Your laptop might finally do AI at home — if you can afford the fancy Mac

TLDR: A developer says home-run AI on personal computers is finally good enough for real work, especially coding and writing help. Commenters are split between excitement that pricey online AI may face real competition and a very loud joke that this "cheap" future still seems to require an expensive Mac.

The big claim in this post is simple: running artificial intelligence models on your own computer has gone from clunky science project to actually useful. The writer says that on a powerful Mac, these tools can now help with coding, proofreading, tests, and basic project setup without constantly needing to phone home to a paid online service. In plain English: the "do it yourself" AI crowd thinks the home setup is finally growing up.

But the comments? That’s where the real fireworks are. The instant crowd favorite was a brutally relatable joke: "Just get a 64GB Mac with 1TB of storage!" Cue the eye-rolls from everyone whose budget does not, in fact, include a luxury laptop. That became the thread’s unofficial meme — local AI is amazing, sure, as long as your wallet also runs frontier hardware.

Still, plenty of people were genuinely hyped. One commenter said moving off Amazon’s cloud and onto self-hosted tools was a budget win, and marveled that text, images, audio, and even video can now run locally. Another dropped the spiciest industry take of the thread: companies charging huge monthly fees for online AI should be nervous, because buyers will start comparing subscription bills to the cost of just buying a machine outright.

Not everyone is ready to declare victory. Skeptics pushed back, asking for proof, speed numbers, and actual code results. Their complaint was blunt: local models still hit weird bumps, make mistakes, and can slow work down. So the mood is clear: huge progress, big excitement, but the comments are not letting the hype off easy.

Key Points

  • The article says the author tested multiple local models, including Mistral 7B, OpenAI OSS-20B, Qwen 3 MOE, and Qwen 2.5 Coder, on a 2022 M2 Mac with 64 GB RAM and 1 TB storage.
  • It states that early local models were slower, harder to use, and less accurate for programming tasks, but the author saw a notable improvement starting with GPT-OSS.
  • The author says Google’s Gemma 4 family made local agentic coding feasible for their workflow, with gemma-4-26b-a4b used as the default model in LM Studio.
  • Example tasks completed locally included Python refactoring, type-hint linting, proofreading, unit-test generation, and bootstrapping a two-tower recommendation repository.
  • The article describes a setup using Pi as the agent harness, LM Studio as the inference server, and Docker containers with restricted permissions for security.

Hottest takes

"Just get a 64GB Mac with 1TB of storage!" — _doctor_love
"This is the kind of thing that Anthropic et al should be worried about" — rmunn
"they aren't quite ready to be replaced yet, sadly" — embedding-shape
Made with <3 by @siedrix and @shesho from CDMX. Powered by Forge&Hive.