Llama 3.1 405B Instruct

Name: Llama 3.1 405B Instruct
Author: Meta AI

by Meta AILlama 3.1

Open WeightModel weights are publicly available — can be downloaded and self-hosted

Released July 2024131K context≈ 98,304 words405B params

The heavyweight of the open-weight world, this model brings frontier-level reasoning and instruction-following to teams willing to run their own infrastructure. It handles complex multi-step problems, nuanced writing, and long-context tasks with notable coherence, though its sheer size means deployment requires serious compute resources. Think of it as the colleague who can do almost anything but needs a large desk to work at.

Capabilities

Capability scores are AI-generated based on model documentation, benchmarks, and technical specifications. Learn more

Reasoning & Logic

Exceptional

Instruction Following

Llama 3.1 405B Instruct

by Meta AILlama 3.1

Open WeightModel weights are publicly available — can be downloaded and self-hosted

Released July 2024131K context≈ 98,304 words405B params

The heavyweight of the open-weight world, this model brings frontier-level reasoning and instruction-following to teams willing to run their own infrastructure. It handles complex multi-step problems, nuanced writing, and long-context tasks with notable coherence, though its sheer size means deployment requires serious compute resources. Think of it as the colleague who can do almost anything but needs a large desk to work at.

Capabilities

Capability scores are AI-generated based on model documentation, benchmarks, and technical specifications. Learn more

Reasoning & Logic

Exceptional

Instruction Following

Benchmark Scores

Benchmark	Score	Type	Recorded
SciCode	1.5	main_problem_pass@1	1mo ago
WinoGrande	87.2	accuracy	1mo ago
HellaSwag	88.5	accuracy	1mo ago
ARC-Challenge	95.0	accuracy	1mo ago
GSM8K	96.0	accuracy	1mo ago
TruthfulQA	65.3	accuracy	1mo ago

Glossary

CoherenceThe quality of maintaining consistent meaning and logical flow across multiple sentences or exchanges in a conversation.Instruction-FollowingThe ability of a model to understand and execute specific tasks or commands given in natural language prompts.Long-ContextThe ability of a model to process and understand very long sequences of text while maintaining coherence across distant parts of the input.ReasoningThe model's ability to work through multi-step logical problems and provide justified answers rather than just pattern-matching.

Capabilities

Capabilities

Benchmark Scores

Use Case Fit

Glossary