Scaling gradient values to maintain consistent learning rates across different parameter groups or layers.