Preference-Based Fine-tuning — Glossary — ThinkLLM