A technique that compresses a large, complex model into a smaller one by training the smaller model to mimic the larger model's behavior.