Think
LLM
Models
Capabilities
Use Cases
Benchmarks
Papers
Glossary
Search
/
Glossary
/
Latency
Latency
performance
The time delay between sending a request and receiving the first response token from a model.
Learn more on Wikipedia
Latency — Glossary — ThinkLLM