A safety training approach that guides a model to behave according to a set of principles or rules, helping it generate more helpful and harmless responses.