GLM 5.2 Performance Benchmarks

This new AI is fast and clever, but the comments are split over the hype and the price

TLDR: GLM-5.2 (max) is a powerful new open AI model with strong scores, high speed, and a huge reading limit, but it also costs a lot. Commenters are torn between cheering its progress, praising its honesty on tricky questions, and doubting whether the benchmark hype is fully believable.

GLM-5.2 (max) just dropped into the AI rankings like a reality-show contestant who is obviously talented but also arrives wearing a wildly expensive outfit. On paper, the model looks strong: it scores 51 on the Artificial Analysis Intelligence Index, which puts it far above the average in its class, it blasts along at 112 tokens per second, and it can handle a massive 1 million-token context window — basically a huge amount of text at once. But the community? Oh, the community immediately turned this into a messy group chat about trust, bragging rights, and whether this thing ever stops talking.

The biggest applause came from people cheering open models getting closer to the top. One commenter joked that with “one or two more releases” it could hit Fable level, which is basically the AI equivalent of saying, “Cute now, but future champion.” Others were more impressed by a benchmark that rewards models for admitting when they don’t know something instead of confidently making stuff up. That got real love, with one fan practically applauding GLM-5.2 for not trying to “bullshit” its way through hard questions.

But not everyone was buying the party balloons. One camp side-eyed the benchmark rankings altogether, saying results like Muse Spark beating GPT-5.5 made them hesitate. Another recurring gripe: this model is chatty. At 140 million output tokens during testing, some users basically called it the friend who answers a simple question with a podcast. So the verdict is deliciously divided: impressive, fast, and open — but expensive, wordy, and still on trial in the court of comment-section opinion.

Key Points

  • GLM-5.2 (max) is an open-weights reasoning model released in June 2026 with text input/output support and a 1 million-token context window.
  • The model scores 51 on the Artificial Analysis Intelligence Index, above the comparable-model average of 24.
  • Artificial Analysis reports that GLM-5.2 (max) generated 140 million tokens during evaluation, compared with an average of 110 million, indicating higher verbosity.
  • Pricing is listed at $1.40 per 1 million input tokens and $4.40 per 1 million output tokens, with a total Intelligence Index evaluation cost of $867.88.
  • Technical specifications include 753 billion total parameters, 40 billion active parameters, MIT licensing, and model weights available on Hugging Face.

Hottest takes

"reach Fable level" — DeathArrow
"punishes them for trying to bullshit" — wongarsu
"gives me pause" — theturtletalks
Made with <3 by @siedrix and @shesho from CDMX. Powered by Forge&Hive.