Reinforcement Learning with Verifiable Rewards (RLVR) — Glossary — ThinkLLM