AI tool comparison
LFM2.5-VL vs Qwen3.6-Plus
Which one should you ship with? Here is the side-by-side panel verdict, pricing read, reviewer split, and community vote comparison.
AI Models
LFM2.5-VL
450M vision-language model that runs in under 250ms on edge hardware
75%
Panel ship
—
Community
Paid
Entry
Liquid AI just shipped LFM2.5-VL, a 450M-parameter vision-language model engineered from the ground up for edge deployment. Unlike most VLMs that require a beefy GPU in the cloud, LFM2.5-VL targets devices like the Snapdragon 8 Elite, NVIDIA Jetson Orin, and AMD Ryzen AI — hitting sub-250ms latency on-device without any cloud round-trip. This model builds significantly on its predecessor with four new capabilities: bounding box prediction (81.28 on RefCOCO-M), multilingual support across 8 languages, function calling, and improved instruction following. Those aren't just benchmark checkboxes — bounding box prediction means you can run visual grounding and object detection pipelines on a phone or robot without any server involvement. Liquid AI is the MIT-spun startup behind Liquid Foundation Models (LFMs), a non-Transformer architecture that delivers competitive performance at a fraction of the memory footprint. LFM2.5-VL is available free on HuggingFace and through Liquid's LEAP inference platform. For builders targeting on-device AI — robotics, mobile, embedded — this is one of the most practical releases of the month.
AI Models
Qwen3.6-Plus
The agentic coding model beating Claude Opus 4.5 — free on OpenRouter
75%
Panel ship
—
Community
Free
Entry
Qwen3.6-Plus is Alibaba's latest frontier model, built specifically for agentic real-world tasks with a particular emphasis on software engineering. Released in preview on OpenRouter as a free tier, it scores 61.6 on Terminal-Bench 2.0, edging past Claude Opus 4.5 (59.3), while running at roughly 3x the speed. It supports a 1M token context window with 65K output tokens — larger than most competitors. Under the hood, Qwen3.6-Plus is a sparse mixture-of-experts architecture, activating a fraction of its parameters per forward pass for efficiency. It supports both text and multimodal inputs, and the API supports tool use natively — making it well-suited for agent loops. The free preview is positioned as a direct challenge to OpenAI and Anthropic in the agentic coding space. The timing is notable: released the same week as Google Gemma 4 and Cursor 3, signaling an industry-wide pivot from autocomplete to full autonomous agents. With free preview access already expiring, Alibaba is clearly using the buzz from benchmark dominance to drive early adoption at the API tier.
Reviewer scorecard
“Sub-250ms on-device vision with function calling is the unlock for a huge class of apps that couldn't tolerate cloud latency — real-time AR overlays, offline field inspection, privacy-sensitive medical imaging. The bounding box support is icing; ship this.”
“The Terminal-Bench numbers don't lie — this thing completes agentic coding tasks better than Opus at a fraction of the cost. The 1M context window means I can throw an entire monorepo at it. Free preview while it lasts is a no-brainer for any dev working on agent pipelines.”
“450M parameters with 8-language support and benchmark-leading vision grounding sounds great until you try to fine-tune it for a domain-specific task. The LEAP platform is still invite-only and the open weights lack fine-tuning docs. Worth watching but not shipping to prod yet.”
“Benchmark performance on Terminal-Bench doesn't always translate to real-world reliability. Alibaba's track record on model longevity and API uptime is spottier than Anthropic's or OpenAI's. The free preview ending today is also a classic bait-and-switch move — the real question is what the paid tier costs.”
“The race to run capable VLMs on-device is the precursor to AI-native hardware. Liquid's non-Transformer architecture is showing that efficiency gains don't require the same trade-offs as quantization. This is what AI hardware of 2028 will be built around.”
“We're seeing the first real multi-model agent race, and Qwen3.6-Plus is the opening shot from China. The combination of 1M context, agentic optimization, and benchmark-beating performance signals that the era of Western AI dominance in coding agents may be over. This reshapes the market.”
“On-device vision that can call functions means camera-native apps that don't phone home. Think real-time style transfer, offline image tagging, or AR creative tools that actually work on a plane. The creator tooling implications are underrated.”
“For automation-heavy creative workflows — building tools, scraping, image pipelines — having a faster, cheaper frontier model with giant context is genuinely useful. I can run whole project contexts through it without hitting limits. The free preview makes it a zero-cost experiment.”
Weekly AI Tool Verdicts
Get the next comparison in your inbox
New AI tools ship daily. We compare them before you waste an afternoon.