AI tool comparison
Qwen3.6-35B-A3B vs Tencent Hy3 Preview
Which one should you ship with? Here is the side-by-side panel verdict, pricing read, reviewer split, and community vote comparison.
Open Source Models
Qwen3.6-35B-A3B
35B total, 3B active: Alibaba's lean MoE coding beast goes fully open source
75%
Panel ship
—
Community
Free
Entry
Alibaba's Qwen team open-sourced Qwen3.6-35B-A3B on April 16, 2026 — a sparse Mixture-of-Experts model with 35 billion total parameters but only ~3 billion active per forward pass. That architectural trick is the whole story: you get near-frontier performance while consuming compute comparable to a 3B dense model. It's available under Apache 2.0 on Hugging Face and ModelScope. The model supports a 262K token context window (extensible to 1M with YaRN), multimodal inputs including text, images, and video, and is purpose-built for agentic coding workflows. On SWE-bench and Terminal-Bench it outperforms the much larger dense Qwen3.5-27B, matching Gemma4-31B on several benchmarks. RefCOCO visual grounding score hits 92.0 — some multimodal metrics reach Claude Sonnet 4.5 territory. Community reaction has been immediate: r/LocalLLaMA lit up with benchmarks showing it solving coding tasks that models with 10x the active parameters couldn't handle. The FP8 quantized variant runs comfortably on a single 24GB consumer GPU, making this the most capable locally-runnable coding agent most developers have ever had access to.
AI Models
Tencent Hy3 Preview
295B MoE open weights — China's most efficient frontier model yet
75%
Panel ship
—
Community
Paid
Entry
Tencent open-sourced Hy3 Preview on April 23, 2026 — the first model to emerge from the company's rebuilt AI infrastructure, and its most credible challenge to frontier closed models to date. With 295 billion total parameters but only 21 billion active at inference time (plus 3.8B MTP layer parameters), it's a Mixture-of-Experts architecture that punches far above its compute weight. The model supports up to 256K context and is available via Hugging Face, ModelScope, and GitCode under the Tencent Hy Community License. On coding benchmarks, Hy3 scores 74.4% on SWE-bench Verified, 54.4% on Terminal-Bench 2.0, and 67.1% on BrowseComp — placing it firmly in the same tier as top models from Anthropic and OpenAI. Tencent claims a 40% efficiency improvement over its predecessor Hunyuan models, and pricing through Tencent Cloud TokenHub is aggressive: RMB 1.2 per million input tokens. A free two-week window at launch via OpenRouter made it widely accessible immediately. The model was led by a team that includes former OpenAI researchers and has already been deployed across Tencent's core products — WeChat, Yuanbao, and QQ. That production integration is a meaningful signal: this isn't a benchmark vanity release. For developers who need a powerful, cost-efficient reasoning and agentic model with actual open weights, Hy3 Preview is one of the most interesting drops of April 2026.
Reviewer scorecard
“3B active parameters with 35B parameter breadth is engineering magic. I'm getting near-frontier coding results in Cline and running it locally on a 3090 — the refusals are lower than Claude for security research too. Apache 2.0 means I can fine-tune it on my codebase. This is the best open-source coding model I've used.”
“21B active params with 295B total — this is genuinely practical to deploy on reasonable hardware while matching models 10x the inference cost. The 256K context and strong SWE-bench score make it a legitimate option for agentic coding pipelines. I'd use this today.”
“MoE models have notoriously bad batching throughput — if you're serving this at scale, the economics don't work out. And Alibaba's track record on long-term model support and safety filtering is shakier than Google or Anthropic. It's impressive in isolation, but enterprise teams should pressure-test it before replacing frontier APIs.”
“The Tencent Hy Community License is not Apache 2.0 or MIT — read it carefully before using this in production. There are usage restrictions that could bite commercial deployments. Also, benchmark scores look great, but independent evals of Chinese labs' models have historically diverged from self-reported numbers.”
“The gap between open and closed models is closing faster than anyone predicted. When a freely downloadable model matches Claude Sonnet on multimodal benchmarks, the frontier lab pricing power evaporates. Qwen3.6-35B-A3B is another milestone in the commoditization of intelligence — and commoditization always accelerates adoption.”
“The MoE efficiency race is the actual story here — we're getting frontier-class capability at a fraction of the activation cost. Hy3 is proof that the compute-vs-capability Pareto frontier keeps moving. Open weights with real deployment signals (WeChat at scale) is a combination that matters.”
“I don't often care about coding models, but this one handles image + video understanding for design briefs surprisingly well. I used it to analyze a competitor's UI and generate a full redesign spec. The 262K context means I can feed entire brand guidelines without chunking.”
“Strong visual coding capabilities and multimodal understanding make this genuinely useful for design-to-code workflows. The health image analysis and product comparison use cases already deployed in Yuanbao show real-world creative utility beyond pure benchmark games.”
Weekly AI Tool Verdicts
Get the next comparison in your inbox
New AI tools ship daily. We compare them before you waste an afternoon.