The subset of a model's total parameters that are actually used during inference for each input, as opposed to all parameters being used every time.