Models Capabilities Use Cases Benchmarks Papers Glossary

Models Capabilities Use Cases Benchmarks Papers Glossary

About Privacy Terms RSS

ThinkLLM

Spot an error in our data? Let us know.

Glossary/Draft Head

Draft Head

architecture

The smaller neural network component in speculative decoding that quickly generates candidate tokens before verification by the main model.

Draft Head — Glossary — ThinkLLM