Models Capabilities Use Cases Benchmarks Papers Glossary

Models Capabilities Use Cases Benchmarks Papers Glossary

About Privacy Terms RSS

ThinkLLM

Spot an error in our data? Let us know.

Glossary/Sparse Mixture of Experts

Sparse Mixture of Experts

architecture

An architecture where only a subset of the model's specialized sub-networks (experts) activate for each input, reducing computation while maintaining capability.

Learn more on Wikipedia

Sparse Mixture of Experts — Glossary — ThinkLLM