New Delhi, Nov. 25 -- Soon after the GPT-5.1 and Gemini 3 launch, Anthropic has launched its Claude Opus 4.5 model. The AI startup claims that its new model is the best in the world for coding, agents, and computer use related tasks.

Where does it rank?

Claude Opus 4.5 achieves 80.9% score on SWE-bench Verified, a real-world software engineering benchmark. Notably, Opus 4.5 is the first ever model to breach the 80% mark on SWE-bench Verified. In comparison, Google's newly released Gemini 3 Pro got a score of 76.2% while OpenAI's GPT-5.1 Codex Max got a score of 77.9%.

The new model also ranks higher than any human candidate on Anthropic's 2-hour time limit which is given to prospective performance engineering candidates.

"The take-hom...