DeepSeek R1

Name: DeepSeek R1
Author: DeepSeek

by DeepSeekDeepSeek R1

Open WeightModel weights are publicly available — can be downloaded and self-hosted

Released January 2025164K context≈ 122,880 words

R1 thinks out loud — it works through problems step by step, showing its reasoning chain before arriving at an answer. This makes it particularly transparent on math, logic, and coding tasks, where you can follow (and verify) its work. The trade-off is verbosity: responses are often long, and the model can over-deliberate on simple questions.

Capabilities

Capability scores are AI-generated based on model documentation, benchmarks, and technical specifications. Learn more

Reasoning & Logic

Exceptional

Coding

DeepSeek R1

by DeepSeekDeepSeek R1

Open WeightModel weights are publicly available — can be downloaded and self-hosted

Released January 2025164K context≈ 122,880 words

R1 thinks out loud — it works through problems step by step, showing its reasoning chain before arriving at an answer. This makes it particularly transparent on math, logic, and coding tasks, where you can follow (and verify) its work. The trade-off is verbosity: responses are often long, and the model can over-deliberate on simple questions.

Capabilities

Capability scores are AI-generated based on model documentation, benchmarks, and technical specifications. Learn more

Reasoning & Logic

Exceptional

Coding

Benchmark Scores

Benchmark	Score	Type	Recorded
AIME 2024	79.8	accuracy	1mo ago
IFBench	38.0	prompt_level_loose_accuracy	1mo ago
SciCode	4.6	main_problem_pass@1	1mo ago
LiveCodeBench	65.9	accuracy	1mo ago
AIME 2025	87.5	accuracy	1mo ago
SWE-Bench	49.2	accuracy	1mo ago
Aider Polyglot	56.9	accuracy	1mo ago

Glossary

ReasoningThe model's ability to work through multi-step logical problems and provide justified answers rather than just pattern-matching.Reasoning ChainA step-by-step explanation of how a model arrives at an answer, showing its intermediate thinking before the final result.

Capabilities

Capabilities

Benchmark Scores

Use Case Fit

Glossary