Grok 3

Name: Grok 3
Author: xAI

by xAIGrok

APIAvailable through a hosted API — pay per token, no self-hosting required

131K context≈ 98,304 words

Grok 3 has a reputation for directness and a willingness to engage with edgy or unconventional questions that other models tend to sidestep. It leans into reasoning-heavy tasks and handles complex analytical problems with notable depth. The trade-off is that its personality can feel more unfiltered than polished, which suits some workflows and jars others.

Capabilities

Capability scores are AI-generated based on model documentation, benchmarks, and technical specifications. Learn more

Reasoning & Logic

Exceptional

Tool Use

Strong

Long Context

Benchmark Scores

Benchmark	Score	Type	Recorded
AIME 2025	86.7	accuracy	1mo ago
Aider Polyglot	53.3	accuracy	1mo ago
AIME 2024	93.3	accuracy	1mo ago

Use Case Fit

Fit scores are AI-generated based on model capabilities, intended use, and technical specifications. Learn more

Grok 3

by xAIGrok

APIAvailable through a hosted API — pay per token, no self-hosting required

131K context≈ 98,304 words

Grok 3 has a reputation for directness and a willingness to engage with edgy or unconventional questions that other models tend to sidestep. It leans into reasoning-heavy tasks and handles complex analytical problems with notable depth. The trade-off is that its personality can feel more unfiltered than polished, which suits some workflows and jars others.

Capabilities

Capability scores are AI-generated based on model documentation, benchmarks, and technical specifications. Learn more

Reasoning & Logic

Exceptional

Tool Use

Strong

Long Context

Benchmark Scores

Benchmark	Score	Type	Recorded
AIME 2025	86.7	accuracy	1mo ago
Aider Polyglot	53.3	accuracy	1mo ago
AIME 2024	93.3	accuracy	1mo ago

Use Case Fit

Fit scores are AI-generated based on model capabilities, intended use, and technical specifications. Learn more

Capabilities

Benchmark Scores

Use Case Fit

Capabilities

Benchmark Scores

Use Case Fit

Glossary