The learned numerical values in a model — more parameters generally means more capacity but higher compute cost.