Compare/Claude Opus 4.7 vs Tencent Hy3-preview

AI tool comparison

Claude Opus 4.7 vs Tencent Hy3-preview

Which one should you ship with? Here is the side-by-side panel verdict, pricing read, reviewer split, and community vote comparison.

C

Foundation Models

Claude Opus 4.7

Anthropic's new flagship — 87.6% SWE-bench, 1M context

Ship

75%

Panel ship

Community

Paid

Entry

Claude Opus 4.7 is Anthropic's latest flagship model, released April 16. It scores 87.6% on SWE-bench Verified — a 13-point improvement over Claude Opus 4.6 — and 94.2% on GPQA, making it competitive with the top frontier models on coding and scientific reasoning benchmarks. The context window extends to 1 million tokens with substantially improved retrieval accuracy at the far end of the window. The release introduces "Routines" — a first-party feature for defining persistent agentic workflows that Claude can execute autonomously across multiple sessions. Routines are defined in structured YAML and can include tool calls, conditional logic, and human-in-the-loop checkpoints. Anthropic positions this as a more reliable alternative to custom agent frameworks for common use cases. Pricing remains unchanged from Opus 4.6: $5/M input tokens, $25/M output tokens. The vision input resolution has been increased by 3.3x, which meaningfully improves performance on documents, diagrams, and UI screenshots. Available via API immediately and rolling out to Claude.ai Pro and Team plans over the next week.

T

AI Models

Tencent Hy3-preview

Tencent's first open-source frontier MoE — 295B params, 21B active, free on HuggingFace

Ship

75%

Panel ship

Community

Free

Entry

Tencent's Hy3-preview is the company's first public frontier-class language model, released April 23 as open weights on Hugging Face. The model is a 295B parameter Mixture-of-Experts architecture with only 21B parameters active per token — keeping inference costs comparable to much smaller dense models while reaching capabilities that compete with leading proprietary systems. The release comes under new leadership: Yao Shunyu, a former OpenAI researcher, joined Tencent in early 2026 to build out its frontier AI effort. The team claims to have gone from project start to public release in under three months — an unusually fast timeline for a model of this scale. The 256K context window and strong performance on agentic and coding benchmarks position it directly against GLM-5.1 and Qwen3.6 in the open-source frontier race. Free inference is available on OpenRouter's free tier at launch, with the model also appearing on Hugging Face's Inference API. The architecture uses 192 routed experts in a hybrid dense-MoE configuration. For teams needing a capable open-weights model for agentic workflows without paying proprietary API rates, Hy3-preview arrives as a credible option at a remarkable cost-to-capability ratio.

Decision
Claude Opus 4.7
Tencent Hy3-preview
Panel verdict
Ship · 3 ship / 1 skip
Ship · 3 ship / 1 skip
Community
No community votes yet
No community votes yet
Pricing
$5/M input · $25/M output (same as Opus 4.6)
Open Source (free on HuggingFace, free tier on OpenRouter)
Best for
Anthropic's new flagship — 87.6% SWE-bench, 1M context
Tencent's first open-source frontier MoE — 295B params, 21B active, free on HuggingFace
Category
Foundation Models
AI Models

Reviewer scorecard

Builder
80/100 · ship

87.6% on SWE-bench isn't a small improvement — that's a meaningful jump for real-world coding tasks. The Routines feature addresses the biggest pain point with Claude in production: reliable multi-step agent behavior without building a custom framework.

80/100 · ship

295B MoE with 21B active per token is a sweet spot for production use — you get frontier-quality outputs at a fraction of the compute cost. The 256K context and agent-optimized design make this immediately useful for complex workflow automation. Worth running evals against your specific use case.

Skeptic
45/100 · skip

Benchmarks look great but the 1M context window performance hasn't been independently validated at the limits. Routines sound powerful but the YAML spec is still in beta with known edge cases. If you're running stable Opus 4.6 workflows, wait a week for the community to stress-test this before migrating.

45/100 · skip

Tencent hasn't published a full technical report yet, so benchmark claims are hard to independently verify. The 'three months to frontier' narrative sounds impressive but raises questions about training data sourcing and evaluation rigor. Preview releases from large Chinese labs have historically required patience before production stability.

Futurist
80/100 · ship

Anthropic is quietly winning the enterprise coding agent race. The combination of top SWE-bench scores with the Routines feature is a moat — developers don't switch orchestration frameworks easily once workflows are deployed. This release deepens that lock-in strategically.

80/100 · ship

The pace of open-source frontier models from Chinese labs is accelerating faster than anyone predicted — we now have credible open-weight competition from Alibaba, Zhipu, Xiaomi, and Tencent simultaneously. This is geopolitically significant and means the open-source ecosystem will stay competitive with proprietary models for years.

Creator
80/100 · ship

The 3.3x vision resolution upgrade is underrated for design work. Document analysis, layout review, and iterating on visual mockups are all dramatically better. I can finally paste a full Figma export and get coherent feedback on the entire design rather than just the top half.

80/100 · ship

For multilingual creative work — especially for Chinese market content — having a frontier-quality open-source model from a Chinese lab is meaningful. The free OpenRouter tier means creators can experiment without API budgets.

Weekly AI Tool Verdicts

Get the next comparison in your inbox

New AI tools ship daily. We compare them before you waste an afternoon.

Bookmarks

Loading bookmarks...

No bookmarks yet

Bookmark tools to save them for later