Claude Opus 4.7 leads on SWE-bench and agentic reasoning, beating GPT-5.4 and Gemini 3.1 Pro
Airfind news item
By Alina Maria Stan
Published on April 16, 2026.
Anthropic has released Claude Opus 4.7, its most capable generally available model, with benchmark-leading scores on SWE-bench Pro (64.3% vs GPT-5.4’s 57.7%), multi-agent coordination for hours-long workflows, 3x higher image resolution, and a 14% improvement in multi-step agentic reasoning with a third of the tool errors. The model is priced at $5/25 per million tokens and available across Claude plans and through Amazon Bedrock, Vertex AI, and Microsoft Foundry. The release comes at a time when Anthropic's commercial momentum is high, with a $30 billion annualised revenue rate and investor offers at roughly $800 billion. The most significant improvements may not be captured by any single benchmark, but by being the model that enterprises and developers choose to build on.
Read Original Article