Unified Policy Value Decomposition for Rapid Adaptation — ThinkLLM