Qwen3 Family
Alibaba's full model family: 0.6B to 235B with thinking modes
The Panel's Take
Alibaba's Qwen team released the full Qwen3 model family this week — 8 models ranging from 0.6B to 235B parameters, spanning both dense and Mixture-of-Experts (MoE) architectures. The headline model is Qwen3-235B-A22B, a 235B MoE that activates 22B parameters per token and matches GPT-4.1 on coding and math benchmarks while running at a fraction of the cost. All Qwen3 models feature switchable "thinking modes" — a built-in chain-of-thought toggle that can be enabled or disabled per request. This eliminates the need for separate reasoning vs. instruct variants, letting developers trade latency for accuracy dynamically. All models are released under Apache 2.0, with weights available on Hugging Face and ModelScope. The smaller models are competitive at their size class: Qwen3-4B reportedly matches Qwen2.5-72B-Instruct on several benchmarks, and the 0.6B model is designed to run efficiently on embedded and edge devices. The release also introduces a new multilingual benchmark covering 119 languages, on which the Qwen3 family sets new state-of-the-art scores for open-weights models.
Share this verdict
Qwen3 Family verdict: SHIP 🚀 3 ships · 1 skip from the expert panel Full review: shiporskip.io/tool/qwen3-alibaba-235b-moe-dense-family-8models-thinking-0-2026
Weekly AI Tool Verdicts
Get the next verdict in your inbox
7 critics review a new AI tool every day. Weekly digest — free.
Compare Qwen3 Family with Others
Embed this verdict
Tool makers can add a live ShipOrSkip badge to their site. Badge loads track impressions; clicks route back to this review.
<a href="https://shiporskip.io/api/badge-click/qwen3-alibaba-235b-moe-dense-family-8models-thinking-0-2026" target="_blank" rel="noopener"><img src="https://shiporskip.io/api/badge/qwen3-alibaba-235b-moe-dense-family-8models-thinking-0-2026" alt="Qwen3 Family Ship verdict on ShipOrSkip" width="360" height="90" /></a>[](https://shiporskip.io/api/badge-click/qwen3-alibaba-235b-moe-dense-family-8models-thinking-0-2026)<iframe src="https://shiporskip.io/embed/qwen3-alibaba-235b-moe-dense-family-8models-thinking-0-2026" title="Qwen3 Family ShipOrSkip verdict" width="360" height="260" style="border:0;border-radius:16px;max-width:100%;" loading="lazy"></iframe>The reviews
“Apache 2.0 on a 235B model that matches GPT-4.1 is the most impactful open-source release of the quarter. The dynamic thinking mode toggle is exactly what production systems need — you don't always want a 30-second reasoning chain on every request.”
“Alibaba's benchmark methodology has been questioned before. The 'matches GPT-4.1' claim needs independent validation on real tasks. Also, while Apache 2.0 is permissive, enterprise legal teams will still scrutinize models from Chinese companies for compliance reasons.”
“Eight models with consistent APIs, multilingual coverage, and open weights — this is what a real AI platform looks like. Alibaba is building a global alternative to OpenAI's stack, and the quality gap is closing faster than anyone expected two years ago.”
“The multilingual benchmark improvements are huge for global content teams. I tested Qwen3-7B on Japanese marketing copy and it handled tone and register better than anything at this size class. For small teams creating content in non-English markets, this is a serious unlock.”