Think
LLM
Models
Capabilities
Use Cases
Benchmarks
Papers
Glossary
Search
/
Glossary
/
Inference-Time Reward Model
Inference-Time Reward Model
techniques
A model used during generation to score outputs without requiring retraining of the main system.
Inference-Time Reward Model — Glossary — ThinkLLM