A training technique where a model learns to maintain performance even when its weights are compressed to use less memory and compute.