Using optimal hyperparameters found at small scale to train larger models without expensive retuning.