Question 1

Which is better: Qwen3.5-Omni or Qwen3.6-27B?

Accepted Answer

Based on our expert panel, Qwen3.5-Omni has a stronger verdict with a 75% Ship rate. Qwen3.5-Omni received a panel verdict of Ship and Qwen3.6-27B received Ship.

Question 2

Is Qwen3.5-Omni free?

Accepted Answer

Qwen3.5-Omni pricing: Proprietary / API (Alibaba Cloud)

Question 3

Is Qwen3.6-27B free?

Accepted Answer

Qwen3.6-27B pricing: Open Source

Question 4

What do experts say about Qwen3.5-Omni vs Qwen3.6-27B?

Accepted Answer

Qwen3.5-Omni: Qwen3.5-Omni is Alibaba's most advanced multimodal model yet — a native Thinker-Talker architecture that processes and generates text, audio, and video in a single unified system. Released in three variants (Plus, Flash, Light), it supports a 256k context window, 10+ hours of audio, and 400 seconds of 720p video at 1 FPS, with speech recognition across 113 languages and dialects.

The headline capability is what Alibaba is calling "Audio-Visual Vibe Coding" — an emergent behavior where the model writes functional code based solely on watching a video and listening to spoken instructions. In demos, it takes a hand-drawn sketch held up to a camera and converts it into a working React webpage in real time. This wasn't an explicitly trained capability; it emerged from the model's unified multimodal architecture.

The model uses semantic interruption and turn-taking intent recognition for real-time interaction, and TMRoPE for temporal multimodal position encoding. The catch: Alibaba broke from its open-source streak and kept Qwen3.5-Omni proprietary, accessible only through their chatbot interface and Alibaba Cloud. The open-source community has noticed — and is not pleased. Qwen3.6-27B: Qwen3.6-27B is a 27-billion-parameter dense language model from Alibaba's Qwen team, released today under an open license. The headline claim is striking: it outperforms the much larger Qwen3.5-397B on major coding benchmarks, achieving what the team calls 'flagship-level coding performance' at a fraction of the parameter count. This follows the broader MoE-to-dense efficiency trend playing out across the open-weights ecosystem.

The model targets software engineering tasks specifically — code generation, debugging, repository-level reasoning, and multi-file editing. It's available in full precision and quantized formats on Hugging Face, with community Q4 and Q8 builds already appearing within hours of the release. At 27B parameters in Q4, it fits comfortably on a single consumer GPU, making it practically accessible without enterprise hardware.

This release is significant for the local LLM community. Qwen has been one of the most competitive open-weights families for coding tasks, and a 27B dense model that competes with models several times its size changes the cost calculus for self-hosted coding agents, development tooling, and any application where inference cost matters. Expect rapid adoption in tools like Jan, LM Studio, and Ollama.

Qwen3.5-Omni vs Qwen3.6-27B

Qwen3.5-Omni

Qwen3.6-27B

Bookmarks