3MinuteRead | Claude Opus 4.7 leads on SWE-bench and agentic reasoning, beating GPT-5.4 and Gemini 3.1 Pro

Claude Opus 4.7 leads on SWE-bench and agentic reasoning, beating GPT-5.4 and Gemini 3.1 Pro

By Alina Maria Stan

Published on April 16, 2026.

Anthropic has released Claude Opus 4.7, its most capable generally available model, with benchmark-leading scores on SWE-bench Pro (64.3% vs GPT-5.4’s 57.7%), multi-agent coordination for hours-long workflows, 3x higher image resolution, and a 14% improvement in multi-step agentic reasoning with a third of the tool errors. The model is priced at $5/25 per million tokens and available across Claude plans and through Amazon Bedrock, Vertex AI, and Microsoft Foundry. The release comes at a time when Anthropic's commercial momentum is high, with a $30 billion annualised revenue rate and investor offers at roughly $800 billion. The most significant improvements may not be captured by any single benchmark, but by being the model that enterprises and developers choose to build on.

Read Original Article

5 Graphics Cards That Could Outperform PlayStation 5 Pro

The PS5 Pro's price increases and the PS5's graphics upgrade make competing cards more compelling, with AMD's AMD 9060 XT and NVIDIA's RTX 5060 Ti offering better value and upscaling capabilities.

Microsoft and Stellantis want to use AI to help car owners

Stellantis and Microsoft are partnering to develop AI-based digital services for cars, aiming to improve user interaction and cybersecurity, despite current advancements in touchscreens and driver assistance systems.

New Apple Intelligence features for iOS 27 found in hidden code

Hidden code reveals new features for Apple Intelligence, including Visual Intelligence, nutrition tracking, and creating digital versions of physical cards and other items, pending release in iOS 27.