VEPO: Variable Entropy Policy Optimization for Low-Resource Language Foundation Models — ThinkLLM