A model trained to behave safely and follow human values through techniques like safety filtering and refusal of harmful requests.