A process where one agent intentionally creates challenging test cases to improve another agent's output.