Models Capabilities Use Cases Benchmarks Papers Glossary

Models Capabilities Use Cases Benchmarks Papers Glossary

About Privacy Terms RSS

ThinkLLM

Spot an error in our data? Let us know.

Glossary/Attention

Attention

architecture

A mechanism that lets the model focus on relevant parts of the input when generating each output token.

Learn more on Wikipedia

Related Capabilities

Performance retention over long documents and conversations

Attention — Glossary — ThinkLLM