Multi-Token Prediction — Glossary — ThinkLLM