Question 1

Which is better: Darwin-4B-David or Tencent Hy3 Preview?

Accepted Answer

Based on our expert panel, Darwin-4B-David has a stronger verdict with a 75% Ship rate. Darwin-4B-David received a panel verdict of Ship and Tencent Hy3 Preview received Ship.

Question 2

Is Darwin-4B-David free?

Accepted Answer

Darwin-4B-David pricing: Open Source

Question 3

Is Tencent Hy3 Preview free?

Accepted Answer

Tencent Hy3 Preview pricing: Open Weights (Tencent Hy Community License); API from RMB 1.2/M tokens

Question 4

What do experts say about Darwin-4B-David vs Tencent Hy3 Preview?

Accepted Answer

Darwin-4B-David: Darwin-4B-David is a 4.5-billion-parameter model that achieves 85.0% on GPQA Diamond — outperforming Google's Gemma-4-31B (84.3%) at roughly 1/7th the parameter count. The kicker: it required no training whatsoever. It was built in 45 minutes on a single H100 using MRI-guided DARE-TIES model merging, a novel variant of the merge-and-trim technique.

The MRI-guided approach uses activation analysis to identify which parameters in each source model are most critical, then applies DARE-TIES merging only to the high-value weight regions. This avoids the catastrophic interference that usually degrades merged models. The result is a small model that inherits the strengths of multiple larger predecessors without any of the compute cost of fine-tuning.

For the AI community, this is a meaningful data point: model merging continues to close the gap with expensive training runs. Darwin-4B-David demonstrates that thoughtful merge strategies can extract benchmark-level performance from models that are a fraction of the size, making capable AI more accessible on consumer hardware. Tencent Hy3 Preview: Tencent open-sourced Hy3 Preview on April 23, 2026 — the first model to emerge from the company's rebuilt AI infrastructure, and its most credible challenge to frontier closed models to date. With 295 billion total parameters but only 21 billion active at inference time (plus 3.8B MTP layer parameters), it's a Mixture-of-Experts architecture that punches far above its compute weight. The model supports up to 256K context and is available via Hugging Face, ModelScope, and GitCode under the Tencent Hy Community License.

On coding benchmarks, Hy3 scores 74.4% on SWE-bench Verified, 54.4% on Terminal-Bench 2.0, and 67.1% on BrowseComp — placing it firmly in the same tier as top models from Anthropic and OpenAI. Tencent claims a 40% efficiency improvement over its predecessor Hunyuan models, and pricing through Tencent Cloud TokenHub is aggressive: RMB 1.2 per million input tokens. A free two-week window at launch via OpenRouter made it widely accessible immediately.

The model was led by a team that includes former OpenAI researchers and has already been deployed across Tencent's core products — WeChat, Yuanbao, and QQ. That production integration is a meaningful signal: this isn't a benchmark vanity release. For developers who need a powerful, cost-efficient reasoning and agentic model with actual open weights, Hy3 Preview is one of the most interesting drops of April 2026.

Darwin-4B-David vs Tencent Hy3 Preview

Darwin-4B-David

Tencent Hy3 Preview

Bookmarks