Stepwise Credit Assignment for GRPO on Flow-Matching Models — ThinkLLM