Sparse Reward — Glossary — ThinkLLM