Inference Latency — Glossary — ThinkLLM