A specialized model or component that filters and evaluates user inputs or outputs to prevent harmful content from reaching users or being generated.