Question 1

Which is better: Qwen3.6-Plus or Qwen3 Family?

Accepted Answer

Based on our expert panel, Qwen3.6-Plus has a stronger verdict with a 75% Ship rate. Qwen3.6-Plus received a panel verdict of Ship and Qwen3 Family received Ship.

Question 2

Is Qwen3.6-Plus free?

Accepted Answer

Qwen3.6-Plus pricing: Free (preview) / Paid API

Question 3

Is Qwen3 Family free?

Accepted Answer

Qwen3 Family pricing: Open Source (Apache 2.0) / API via Alibaba Cloud

Question 4

What do experts say about Qwen3.6-Plus vs Qwen3 Family?

Accepted Answer

Qwen3.6-Plus: Qwen3.6-Plus is Alibaba's latest frontier model, built specifically for agentic real-world tasks with a particular emphasis on software engineering. Released in preview on OpenRouter as a free tier, it scores 61.6 on Terminal-Bench 2.0, edging past Claude Opus 4.5 (59.3), while running at roughly 3x the speed. It supports a 1M token context window with 65K output tokens — larger than most competitors.

Under the hood, Qwen3.6-Plus is a sparse mixture-of-experts architecture, activating a fraction of its parameters per forward pass for efficiency. It supports both text and multimodal inputs, and the API supports tool use natively — making it well-suited for agent loops. The free preview is positioned as a direct challenge to OpenAI and Anthropic in the agentic coding space.

The timing is notable: released the same week as Google Gemma 4 and Cursor 3, signaling an industry-wide pivot from autocomplete to full autonomous agents. With free preview access already expiring, Alibaba is clearly using the buzz from benchmark dominance to drive early adoption at the API tier. Qwen3 Family: Alibaba's Qwen team released the full Qwen3 model family this week — 8 models ranging from 0.6B to 235B parameters, spanning both dense and Mixture-of-Experts (MoE) architectures. The headline model is Qwen3-235B-A22B, a 235B MoE that activates 22B parameters per token and matches GPT-4.1 on coding and math benchmarks while running at a fraction of the cost.

All Qwen3 models feature switchable "thinking modes" — a built-in chain-of-thought toggle that can be enabled or disabled per request. This eliminates the need for separate reasoning vs. instruct variants, letting developers trade latency for accuracy dynamically. All models are released under Apache 2.0, with weights available on Hugging Face and ModelScope.

The smaller models are competitive at their size class: Qwen3-4B reportedly matches Qwen2.5-72B-Instruct on several benchmarks, and the 0.6B model is designed to run efficiently on embedded and edge devices. The release also introduces a new multilingual benchmark covering 119 languages, on which the Qwen3 family sets new state-of-the-art scores for open-weights models.

Qwen3.6-Plus vs Qwen3 Family

Qwen3.6-Plus

Qwen3 Family

Bookmarks