Models Capabilities Use Cases Benchmarks Papers Glossary

Models Capabilities Use Cases Benchmarks Papers Glossary

About Privacy Terms RSS

ThinkLLM

Spot an error in our data? Let us know.

Glossary/Self-Play

Self-Play

techniques

Training method where a model plays against itself or generates both solutions and evaluations, risking the model learning to exploit itself.

Self-Play — Glossary — ThinkLLM