Claude 3.5 Sonnet

Name: Claude 3.5 Sonnet
Author: Anthropic

by AnthropicClaude

APIAvailable through a hosted API — pay per token, no self-hosting required

Released June 2024200K context≈ 150,000 words

Claude 3.5 Sonnet operates like a sharp generalist who rarely needs to be asked twice — it follows nuanced instructions carefully, writes with clarity and natural tone, and handles complex reasoning without losing the thread. It sits in a practical middle ground: more capable than lighter models on multi-step tasks, without the latency of heavier ones. Occasionally cautious in sensitive areas, but consistent and reliable across a wide range of everyday tasks.

Capabilities

Capability scores are AI-generated based on model documentation, benchmarks, and technical specifications. Learn more

Reasoning & Logic

Exceptional

Coding

Claude 3.5 Sonnet

by AnthropicClaude

APIAvailable through a hosted API — pay per token, no self-hosting required

Released June 2024200K context≈ 150,000 words

Claude 3.5 Sonnet operates like a sharp generalist who rarely needs to be asked twice — it follows nuanced instructions carefully, writes with clarity and natural tone, and handles complex reasoning without losing the thread. It sits in a practical middle ground: more capable than lighter models on multi-step tasks, without the latency of heavier ones. Occasionally cautious in sensitive areas, but consistent and reliable across a wide range of everyday tasks.

Capabilities

Capability scores are AI-generated based on model documentation, benchmarks, and technical specifications. Learn more

Reasoning & Logic

Exceptional

Coding

Benchmark Scores

Benchmark	Score	Type	Recorded
SciCode	4.6	main_problem_pass@1	1mo ago
SWE-Bench	49.0	accuracy	1mo ago
GSM8K	96.4	accuracy	1mo ago
Aider Polyglot	51.6	accuracy	1mo ago
TAU2	46.0	accuracy	1mo ago

Glossary

Complex ReasoningThe ability to work through multi-step problems, analyze nuanced information, and draw logical conclusions.LatencyThe time delay between sending a request and receiving the first response token from a model.Multi-Step TasksProblems or workflows that require a model to perform multiple sequential operations or reasoning steps to reach a final answer.ReasoningThe model's ability to work through multi-step logical problems and provide justified answers rather than just pattern-matching.

Capabilities

Capabilities

Benchmark Scores

Use Case Fit

Glossary