Storing intermediate computed features during inference to reuse them in later steps, reducing redundant computation.