Claude Opus 4 tackles complex, multi-step reasoning with patience and thoroughness — it tends to think through problems carefully rather than jumping to quick answers. It handles nuanced writing, intricate analysis, and difficult conceptual questions with notable depth, though this deliberateness comes at the cost of speed and higher inference costs compared to lighter models. It communicates with a measured, thoughtful tone that tends toward precision over brevity.
| Benchmark | Score | Type | Recorded |
|---|---|---|---|
| TerminalBench | 39.2 | accuracy | 5d ago |
| SWE-Bench | 72.5 | accuracy | 5d ago |
| TAU2 | 59.6 | accuracy | 5d ago |
| Aider Polyglot | 72.0 | accuracy | 5d ago |
| AIME 2025 | 75.5 | accuracy | 5d ago |