Norway's 2 petabytes of Huawei flash storage and LLM training

Norway wants its own AI brain, but the internet is already side-eyeing the storage brag

TLDR: Norway’s National Library is using a massive national archive and Huawei storage to build an AI that understands Norwegian language and culture. Commenters were split between praising the ambition and mocking the hype, with jokes that the storage size sounded less like a national moonshot and more like a nerd’s basement project.

Norway’s National Library is building a homegrown artificial intelligence model that actually understands Norwegian history, culture, newspapers, books, and all the stuff global English-first bots tend to blur past. The big flex: it has a giant national archive, a special deal to train on newspaper content, and 2 petabytes of super-fast Huawei storage helping move data into the training process before the real number-crunching happens on the country’s national supercomputer. In plain English: Norway is trying to make sure its future AI speaks like Norway, not Silicon Valley.

But the comments absolutely refused to clap on cue. One camp instantly pounced on the numbers, with skeptics basically saying, “2 petabytes? Cute.” Another commenter waved around a recent Dell server launch to argue that the hardware specs didn’t sound especially jaw-dropping in 2026. Others were even harsher about the bigger idea, rolling their eyes at the claim that every country needs a “sovereign” AI model, with one reader snarking that this kind of argument sounds true enough to unlock funding.

And then came the comedy relief: one of the funniest reactions compared 2 petabytes to the sort of stash the average r/datahoarder user supposedly keeps in a basement. The sharpest divide wasn’t really about Norway’s culture mission — it was whether this is visionary nation-building or just another very expensive AI vanity project with good branding and even better buzzwords

Key Points

  • Norway’s National Library is building a Norwegian-language sovereign LLM using national archive materials and licensed newspaper content.
  • The library uses 2 PB of Huawei OceanStor Dorado all-flash storage in its AI data pipeline, alongside an Nvidia DGX H200 system and a 384-core CPU cluster.
  • The institution has digitized collections since 2005 and now holds 20 PB of unique data, stored in a 3-2-1 model for about 60 PB total.
  • Prepared data is sent to the Sigma2 Olivia national supercomputer, an HPE Cray Supercomputing EX system with 448 GPUs and 64,512 CPU cores, for training runs.
  • According to Marius Husnes, the main technical challenge is not compute availability but moving and preparing PB-scale archival data for high-throughput AI training, along with unresolved issues in evaluation, governance, and orchestration.

Hottest takes

"whatever sounds true enough and gets funding" — Den_VR
"This is how much storage the average r/datahoarder user has in their basement" — ipsum2
"I’m afraid the answer is, mostly you don’t" — Levitz
Made with <3 by @siedrix and @shesho from CDMX. Powered by Forge&Hive.
Norway's 2 petabytes of Huawei flash storage and LLM training - Weaving News | Weaving News