A controlled experiment measuring how much an AI system improves human performance compared to working without it.