Inference Serving — Glossary — ThinkLLM