Modifying a model's output probability distribution at inference time to satisfy constraints without changing the model's weights.