Qwen3.6-35B-A3B
35B total, 3B active: Alibaba's lean MoE coding beast goes fully open source
The Panel's Take
Alibaba's Qwen team open-sourced Qwen3.6-35B-A3B on April 16, 2026 — a sparse Mixture-of-Experts model with 35 billion total parameters but only ~3 billion active per forward pass. That architectural trick is the whole story: you get near-frontier performance while consuming compute comparable to a 3B dense model. It's available under Apache 2.0 on Hugging Face and ModelScope. The model supports a 262K token context window (extensible to 1M with YaRN), multimodal inputs including text, images, and video, and is purpose-built for agentic coding workflows. On SWE-bench and Terminal-Bench it outperforms the much larger dense Qwen3.5-27B, matching Gemma4-31B on several benchmarks. RefCOCO visual grounding score hits 92.0 — some multimodal metrics reach Claude Sonnet 4.5 territory. Community reaction has been immediate: r/LocalLLaMA lit up with benchmarks showing it solving coding tasks that models with 10x the active parameters couldn't handle. The FP8 quantized variant runs comfortably on a single 24GB consumer GPU, making this the most capable locally-runnable coding agent most developers have ever had access to.
Share this verdict
Qwen3.6-35B-A3B verdict: SHIP 🚀 3 ships · 1 skip from the expert panel Full review: shiporskip.io/tool/qwen3-6-35b-a3b-alibaba-moe-3b-active-agentic-coding-apache-2026
Weekly AI Tool Verdicts
Get the next verdict in your inbox
7 critics review a new AI tool every day. Weekly digest — free.
Compare Qwen3.6-35B-A3B with Others
Embed this verdict
Tool makers can add a live ShipOrSkip badge to their site. Badge loads track impressions; clicks route back to this review.
<a href="https://shiporskip.io/api/badge-click/qwen3-6-35b-a3b-alibaba-moe-3b-active-agentic-coding-apache-2026" target="_blank" rel="noopener"><img src="https://shiporskip.io/api/badge/qwen3-6-35b-a3b-alibaba-moe-3b-active-agentic-coding-apache-2026" alt="Qwen3.6-35B-A3B Ship verdict on ShipOrSkip" width="360" height="90" /></a>[](https://shiporskip.io/api/badge-click/qwen3-6-35b-a3b-alibaba-moe-3b-active-agentic-coding-apache-2026)<iframe src="https://shiporskip.io/embed/qwen3-6-35b-a3b-alibaba-moe-3b-active-agentic-coding-apache-2026" title="Qwen3.6-35B-A3B ShipOrSkip verdict" width="360" height="260" style="border:0;border-radius:16px;max-width:100%;" loading="lazy"></iframe>The reviews
“3B active parameters with 35B parameter breadth is engineering magic. I'm getting near-frontier coding results in Cline and running it locally on a 3090 — the refusals are lower than Claude for security research too. Apache 2.0 means I can fine-tune it on my codebase. This is the best open-source coding model I've used.”
“MoE models have notoriously bad batching throughput — if you're serving this at scale, the economics don't work out. And Alibaba's track record on long-term model support and safety filtering is shakier than Google or Anthropic. It's impressive in isolation, but enterprise teams should pressure-test it before replacing frontier APIs.”
“The gap between open and closed models is closing faster than anyone predicted. When a freely downloadable model matches Claude Sonnet on multimodal benchmarks, the frontier lab pricing power evaporates. Qwen3.6-35B-A3B is another milestone in the commoditization of intelligence — and commoditization always accelerates adoption.”
“I don't often care about coding models, but this one handles image + video understanding for design briefs surprisingly well. I used it to analyze a competitor's UI and generate a full redesign spec. The 262K context means I can feed entire brand guidelines without chunking.”