Examining Reasoning LLMs-as-Judges in Non-Verifiable LLM Post-Training — ThinkLLM