Models Capabilities Use Cases Benchmarks Papers Glossary

Models Capabilities Use Cases Benchmarks Papers Glossary

About Privacy Terms RSS

ThinkLLM

Spot an error in our data? Let us know.

Glossary/Latency

Latency

performance

The time delay between sending a request and receiving the first response token from a model.

Learn more on Wikipedia

Latency — Glossary — ThinkLLM