Claude Opus 4.7
Anthropic's new flagship — 87.6% SWE-bench, 1M context
The Panel's Take
Claude Opus 4.7 is Anthropic's latest flagship model, released April 16. It scores 87.6% on SWE-bench Verified — a 13-point improvement over Claude Opus 4.6 — and 94.2% on GPQA, making it competitive with the top frontier models on coding and scientific reasoning benchmarks. The context window extends to 1 million tokens with substantially improved retrieval accuracy at the far end of the window. The release introduces "Routines" — a first-party feature for defining persistent agentic workflows that Claude can execute autonomously across multiple sessions. Routines are defined in structured YAML and can include tool calls, conditional logic, and human-in-the-loop checkpoints. Anthropic positions this as a more reliable alternative to custom agent frameworks for common use cases. Pricing remains unchanged from Opus 4.6: $5/M input tokens, $25/M output tokens. The vision input resolution has been increased by 3.3x, which meaningfully improves performance on documents, diagrams, and UI screenshots. Available via API immediately and rolling out to Claude.ai Pro and Team plans over the next week.
Share this verdict
Claude Opus 4.7 verdict: SHIP 🚀 3 ships · 1 skip from the expert panel Full review: shiporskip.io/tool/claude-opus-47-anthropic-87-swe-bench-1m-context-routines-2026
Weekly AI Tool Verdicts
Get the next verdict in your inbox
7 critics review a new AI tool every day. Weekly digest — free.
Compare Claude Opus 4.7 with Others
Embed this verdict
Tool makers can add a live ShipOrSkip badge to their site. Badge loads track impressions; clicks route back to this review.
<a href="https://shiporskip.io/api/badge-click/claude-opus-47-anthropic-87-swe-bench-1m-context-routines-2026" target="_blank" rel="noopener"><img src="https://shiporskip.io/api/badge/claude-opus-47-anthropic-87-swe-bench-1m-context-routines-2026" alt="Claude Opus 4.7 Ship verdict on ShipOrSkip" width="360" height="90" /></a>[](https://shiporskip.io/api/badge-click/claude-opus-47-anthropic-87-swe-bench-1m-context-routines-2026)<iframe src="https://shiporskip.io/embed/claude-opus-47-anthropic-87-swe-bench-1m-context-routines-2026" title="Claude Opus 4.7 ShipOrSkip verdict" width="360" height="260" style="border:0;border-radius:16px;max-width:100%;" loading="lazy"></iframe>The reviews
“87.6% on SWE-bench isn't a small improvement — that's a meaningful jump for real-world coding tasks. The Routines feature addresses the biggest pain point with Claude in production: reliable multi-step agent behavior without building a custom framework.”
“Benchmarks look great but the 1M context window performance hasn't been independently validated at the limits. Routines sound powerful but the YAML spec is still in beta with known edge cases. If you're running stable Opus 4.6 workflows, wait a week for the community to stress-test this before migrating.”
“Anthropic is quietly winning the enterprise coding agent race. The combination of top SWE-bench scores with the Routines feature is a moat — developers don't switch orchestration frameworks easily once workflows are deployed. This release deepens that lock-in strategically.”
“The 3.3x vision resolution upgrade is underrated for design work. Document analysis, layout review, and iterating on visual mockups are all dramatically better. I can finally paste a full Figma export and get coherent feedback on the entire design rather than just the top half.”