Think
LLM
Models
Capabilities
Use Cases
Benchmarks
Papers
Glossary
Search
/
Glossary
/
Parallel Decoding
Parallel Decoding
techniques
Generating multiple output tokens at once instead of sequentially for faster inference.
Parallel Decoding — Glossary — ThinkLLM