AI tool comparison
Qwen3.5-Omni vs Tencent Hy3-preview
Which one should you ship with? Here is the side-by-side panel verdict, pricing read, reviewer split, and community vote comparison.
AI Models
Qwen3.5-Omni
Show it a sketch, get a React app — Alibaba's native omnimodal AI
75%
Panel ship
—
Community
Paid
Entry
Qwen3.5-Omni is Alibaba's most advanced multimodal model yet — a native Thinker-Talker architecture that processes and generates text, audio, and video in a single unified system. Released in three variants (Plus, Flash, Light), it supports a 256k context window, 10+ hours of audio, and 400 seconds of 720p video at 1 FPS, with speech recognition across 113 languages and dialects. The headline capability is what Alibaba is calling "Audio-Visual Vibe Coding" — an emergent behavior where the model writes functional code based solely on watching a video and listening to spoken instructions. In demos, it takes a hand-drawn sketch held up to a camera and converts it into a working React webpage in real time. This wasn't an explicitly trained capability; it emerged from the model's unified multimodal architecture. The model uses semantic interruption and turn-taking intent recognition for real-time interaction, and TMRoPE for temporal multimodal position encoding. The catch: Alibaba broke from its open-source streak and kept Qwen3.5-Omni proprietary, accessible only through their chatbot interface and Alibaba Cloud. The open-source community has noticed — and is not pleased.
AI Models
Tencent Hy3-preview
Tencent's first open-source frontier MoE — 295B params, 21B active, free on HuggingFace
75%
Panel ship
—
Community
Free
Entry
Tencent's Hy3-preview is the company's first public frontier-class language model, released April 23 as open weights on Hugging Face. The model is a 295B parameter Mixture-of-Experts architecture with only 21B parameters active per token — keeping inference costs comparable to much smaller dense models while reaching capabilities that compete with leading proprietary systems. The release comes under new leadership: Yao Shunyu, a former OpenAI researcher, joined Tencent in early 2026 to build out its frontier AI effort. The team claims to have gone from project start to public release in under three months — an unusually fast timeline for a model of this scale. The 256K context window and strong performance on agentic and coding benchmarks position it directly against GLM-5.1 and Qwen3.6 in the open-source frontier race. Free inference is available on OpenRouter's free tier at launch, with the model also appearing on Hugging Face's Inference API. The architecture uses 192 routed experts in a hybrid dense-MoE configuration. For teams needing a capable open-weights model for agentic workflows without paying proprietary API rates, Hy3-preview arrives as a credible option at a remarkable cost-to-capability ratio.
Reviewer scorecard
“Audio-Visual Vibe Coding is the most interesting emergent capability I've seen in months — show it a sketch, get a React app. If they open the API with reasonable pricing, this becomes my go-to for multimodal prototyping immediately.”
“295B MoE with 21B active per token is a sweet spot for production use — you get frontier-quality outputs at a fraction of the compute cost. The 256K context and agent-optimized design make this immediately useful for complex workflow automation. Worth running evals against your specific use case.”
“Alibaba broke their open-source streak and didn't provide any API access outside Alibaba Cloud. The 'emergent' vibe coding demos look impressive in controlled settings but we have zero third-party validation. Wait for independent benchmarks and an actual API before getting excited.”
“Tencent hasn't published a full technical report yet, so benchmark claims are hard to independently verify. The 'three months to frontier' narrative sounds impressive but raises questions about training data sourcing and evaluation rigor. Preview releases from large Chinese labs have historically required patience before production stability.”
“Native audio-visual-to-code generation is a paradigm shift. The fact it emerged without explicit training suggests we're still in the early stages of understanding what multimodal models can do. This points toward agents that watch, listen, and build — simultaneously.”
“The pace of open-source frontier models from Chinese labs is accelerating faster than anyone predicted — we now have credible open-weight competition from Alibaba, Zhipu, Xiaomi, and Tencent simultaneously. This is geopolitically significant and means the open-source ecosystem will stay competitive with proprietary models for years.”
“Sketching on paper and getting a working webpage is every designer's dream workflow. The semantic interruption and turn-taking features make it feel like a genuine conversation partner rather than a query machine. Huge potential for creative applications.”
“For multilingual creative work — especially for Chinese market content — having a frontier-quality open-source model from a Chinese lab is meaningful. The free OpenRouter tier means creators can experiment without API budgets.”
Weekly AI Tool Verdicts
Get the next comparison in your inbox
New AI tools ship daily. We compare them before you waste an afternoon.