A model that activates only a subset of its parameters for each input, rather than using all parameters every time, which reduces computational cost.