G

GLM-5.1

Zhipu AI's 744B MIT-licensed model that beats Claude and GPT on SWE-Bench

PriceOpen Source (MIT)Reviewed2026-04-20
Verdict — Skip
2 Ships2 Skips
Visit huggingface.co

The Panel's Take

GLM-5.1 is Zhipu AI's latest open-weights language model — a 744B parameter mixture-of-experts (MoE) architecture that activates 40B parameters per forward pass. Released under the MIT license with a 200,000-token context window, it has quietly topped the SWE-Bench Pro leaderboard, surpassing both Claude Opus 4.6 and GPT-5.4 on expert-level software engineering tasks. The MoE architecture means GLM-5.1 is significantly cheaper to run per token than a dense 744B model, with inference costs approaching dense 40B models for most workloads. Zhipu AI (a Tsinghua University spin-out) has steadily iterated on the GLM family to produce a text-focused reasoning model that holds its own against proprietary frontier models — now, for the first time, reportedly exceeding them on coding benchmarks. The MIT license is the headline for enterprise and research users: full commercial use, no usage restrictions, no API dependency. This puts GLM-5.1 in direct competition with Qwen3.5 for the "best open-weights model you can actually use for anything" crown, with a differentiating edge in software engineering tasks specifically.

Share this verdict

GLM-5.1 verdict: SKIP ⏭️

2 ships · 2 skips from the expert panel

Full review: shiporskip.io/tool/glm-51-zhipu-ai-744b-moe-mit-open-weights-swe-bench-2026

Weekly AI Tool Verdicts

Get the next verdict in your inbox

7 critics review a new AI tool every day. Weekly digest — free.

Embed this verdict

Tool makers can add a live ShipOrSkip badge to their site. Badge loads track impressions; clicks route back to this review.

Skip · 5.0/10
HTML badge
<a href="https://shiporskip.io/api/badge-click/glm-51-zhipu-ai-744b-moe-mit-open-weights-swe-bench-2026" target="_blank" rel="noopener"><img src="https://shiporskip.io/api/badge/glm-51-zhipu-ai-744b-moe-mit-open-weights-swe-bench-2026" alt="GLM-5.1 Skip verdict on ShipOrSkip" width="360" height="90" /></a>
Markdown badge
[![GLM-5.1 Skip verdict on ShipOrSkip](https://shiporskip.io/api/badge/glm-51-zhipu-ai-744b-moe-mit-open-weights-swe-bench-2026)](https://shiporskip.io/api/badge-click/glm-51-zhipu-ai-744b-moe-mit-open-weights-swe-bench-2026)
Iframe widget
<iframe src="https://shiporskip.io/embed/glm-51-zhipu-ai-744b-moe-mit-open-weights-swe-bench-2026" title="GLM-5.1 ShipOrSkip verdict" width="360" height="260" style="border:0;border-radius:16px;max-width:100%;" loading="lazy"></iframe>

The reviews

SWE-Bench Pro beating Claude and GPT-5.4 is the real signal here. For coding automation workflows, having an MIT-licensed 200K context model at that quality tier changes the build-vs-buy calculus significantly. Deploying this on dedicated hardware is now a serious option for engineering teams.

Helpful?

744B total parameters still requires serious infrastructure — you're looking at 8x H100s at minimum for comfortable inference. The 40B active parameters help with cost but not with deployment complexity. This is 'open source' for well-funded teams, not indie builders.

Helpful?

The open-weights ecosystem has now fully caught up to proprietary models on the most demanding software engineering benchmarks. This is the moment the 'open vs closed' debate definitively changes — the argument that proprietary models are categorically better no longer holds.

Helpful?

Unless you're a creative tech team with serious infrastructure, this isn't practical for most creative workflows. The quality is undeniably impressive but the deployment story doesn't fit solo creators or small studios.

Helpful?

Bookmarks

Loading bookmarks...

No bookmarks yet

Bookmark tools to save them for later