GLM-5.1
Zhipu AI's 744B MIT-licensed model that beats Claude and GPT on SWE-Bench
The Panel's Take
GLM-5.1 is Zhipu AI's latest open-weights language model — a 744B parameter mixture-of-experts (MoE) architecture that activates 40B parameters per forward pass. Released under the MIT license with a 200,000-token context window, it has quietly topped the SWE-Bench Pro leaderboard, surpassing both Claude Opus 4.6 and GPT-5.4 on expert-level software engineering tasks. The MoE architecture means GLM-5.1 is significantly cheaper to run per token than a dense 744B model, with inference costs approaching dense 40B models for most workloads. Zhipu AI (a Tsinghua University spin-out) has steadily iterated on the GLM family to produce a text-focused reasoning model that holds its own against proprietary frontier models — now, for the first time, reportedly exceeding them on coding benchmarks. The MIT license is the headline for enterprise and research users: full commercial use, no usage restrictions, no API dependency. This puts GLM-5.1 in direct competition with Qwen3.5 for the "best open-weights model you can actually use for anything" crown, with a differentiating edge in software engineering tasks specifically.
Share this verdict
GLM-5.1 verdict: SKIP ⏭️ 2 ships · 2 skips from the expert panel Full review: shiporskip.io/tool/glm-51-zhipu-ai-744b-moe-mit-open-weights-swe-bench-2026
Weekly AI Tool Verdicts
Get the next verdict in your inbox
7 critics review a new AI tool every day. Weekly digest — free.
Compare GLM-5.1 with Others
Embed this verdict
Tool makers can add a live ShipOrSkip badge to their site. Badge loads track impressions; clicks route back to this review.
<a href="https://shiporskip.io/api/badge-click/glm-51-zhipu-ai-744b-moe-mit-open-weights-swe-bench-2026" target="_blank" rel="noopener"><img src="https://shiporskip.io/api/badge/glm-51-zhipu-ai-744b-moe-mit-open-weights-swe-bench-2026" alt="GLM-5.1 Skip verdict on ShipOrSkip" width="360" height="90" /></a>[](https://shiporskip.io/api/badge-click/glm-51-zhipu-ai-744b-moe-mit-open-weights-swe-bench-2026)<iframe src="https://shiporskip.io/embed/glm-51-zhipu-ai-744b-moe-mit-open-weights-swe-bench-2026" title="GLM-5.1 ShipOrSkip verdict" width="360" height="260" style="border:0;border-radius:16px;max-width:100%;" loading="lazy"></iframe>The reviews
“SWE-Bench Pro beating Claude and GPT-5.4 is the real signal here. For coding automation workflows, having an MIT-licensed 200K context model at that quality tier changes the build-vs-buy calculus significantly. Deploying this on dedicated hardware is now a serious option for engineering teams.”
“744B total parameters still requires serious infrastructure — you're looking at 8x H100s at minimum for comfortable inference. The 40B active parameters help with cost but not with deployment complexity. This is 'open source' for well-funded teams, not indie builders.”
“The open-weights ecosystem has now fully caught up to proprietary models on the most demanding software engineering benchmarks. This is the moment the 'open vs closed' debate definitively changes — the argument that proprietary models are categorically better no longer holds.”
“Unless you're a creative tech team with serious infrastructure, this isn't practical for most creative workflows. The quality is undeniably impressive but the deployment story doesn't fit solo creators or small studios.”