Inference Efficiency — Glossary — ThinkLLM