o3

Name: o3
Author: OpenAI

by OpenAIo3

APIAvailable through a hosted API — pay per token, no self-hosting required

Released April 2025200K context≈ 150,000 words

o3 thinks before it speaks — literally. It runs extended internal reasoning chains before producing a response, which makes it noticeably slower but significantly more reliable on problems requiring multi-step logic, mathematics, or careful deduction. It handles ambiguous or hard problems by working through them rather than pattern-matching to a quick answer, though that deliberation comes at a higher compute cost.

Capabilities

Capability scores are AI-generated based on model documentation, benchmarks, and technical specifications. Learn more

Reasoning & Logic

Exceptional

Instruction Following

o3

by OpenAIo3

APIAvailable through a hosted API — pay per token, no self-hosting required

Released April 2025200K context≈ 150,000 words

o3 thinks before it speaks — literally. It runs extended internal reasoning chains before producing a response, which makes it noticeably slower but significantly more reliable on problems requiring multi-step logic, mathematics, or careful deduction. It handles ambiguous or hard problems by working through them rather than pattern-matching to a quick answer, though that deliberation comes at a higher compute cost.

Capabilities

Capability scores are AI-generated based on model documentation, benchmarks, and technical specifications. Learn more

Reasoning & Logic

Exceptional

Instruction Following

Benchmark Scores

Benchmark	Score	Type	Recorded
IFBench	69.3	prompt_level_loose_accuracy	1mo ago
LCR	69.3	pass@1_accuracy	1mo ago
SWE-Bench	69.1	accuracy	1mo ago
AIME 2024	91.6	accuracy	1mo ago
AIME 2025	88.9	accuracy	1mo ago
Aider Polyglot	76.9	accuracy	1mo ago

Glossary

Multi-Step LogicThe ability to break down complex problems into sequential reasoning steps and correctly combine them to reach a solution.ReasoningThe model's ability to work through multi-step logical problems and provide justified answers rather than just pattern-matching.Reasoning ChainsA sequence of logical steps a model follows to work through a problem methodically rather than jumping directly to an answer.

Capabilities

Capabilities

Benchmark Scores

Use Case Fit

Glossary