Question 1

Which is better: Claude Opus 4.7 or Darwin-4B-David?

Accepted Answer

Based on our expert panel, Claude Opus 4.7 has a stronger verdict with a 75% Ship rate. Claude Opus 4.7 received a panel verdict of Ship and Darwin-4B-David received Ship.

Question 2

Is Claude Opus 4.7 free?

Accepted Answer

Claude Opus 4.7 pricing: $5/M input · $25/M output (same as Opus 4.6)

Question 3

Is Darwin-4B-David free?

Accepted Answer

Darwin-4B-David pricing: Open Source

Question 4

What do experts say about Claude Opus 4.7 vs Darwin-4B-David?

Accepted Answer

Claude Opus 4.7: Claude Opus 4.7 is Anthropic's latest flagship model, released April 16. It scores 87.6% on SWE-bench Verified — a 13-point improvement over Claude Opus 4.6 — and 94.2% on GPQA, making it competitive with the top frontier models on coding and scientific reasoning benchmarks. The context window extends to 1 million tokens with substantially improved retrieval accuracy at the far end of the window.

The release introduces "Routines" — a first-party feature for defining persistent agentic workflows that Claude can execute autonomously across multiple sessions. Routines are defined in structured YAML and can include tool calls, conditional logic, and human-in-the-loop checkpoints. Anthropic positions this as a more reliable alternative to custom agent frameworks for common use cases.

Pricing remains unchanged from Opus 4.6: $5/M input tokens, $25/M output tokens. The vision input resolution has been increased by 3.3x, which meaningfully improves performance on documents, diagrams, and UI screenshots. Available via API immediately and rolling out to Claude.ai Pro and Team plans over the next week. Darwin-4B-David: Darwin-4B-David is a 4.5-billion-parameter model that achieves 85.0% on GPQA Diamond — outperforming Google's Gemma-4-31B (84.3%) at roughly 1/7th the parameter count. The kicker: it required no training whatsoever. It was built in 45 minutes on a single H100 using MRI-guided DARE-TIES model merging, a novel variant of the merge-and-trim technique.

The MRI-guided approach uses activation analysis to identify which parameters in each source model are most critical, then applies DARE-TIES merging only to the high-value weight regions. This avoids the catastrophic interference that usually degrades merged models. The result is a small model that inherits the strengths of multiple larger predecessors without any of the compute cost of fine-tuning.

For the AI community, this is a meaningful data point: model merging continues to close the gap with expensive training runs. Darwin-4B-David demonstrates that thoughtful merge strategies can extract benchmark-level performance from models that are a fraction of the size, making capable AI more accessible on consumer hardware.

Claude Opus 4.7 vs Darwin-4B-David

Claude Opus 4.7

Darwin-4B-David

Bookmarks