Do Metrics for Counterfactual Explanations Align with User Perception? — ThinkLLM