Controlling model behavior during generation without retraining, by modifying inputs or intermediate computations.