Models Capabilities Use Cases Benchmarks Papers Glossary

Models Capabilities Use Cases Benchmarks Papers Glossary

About Privacy Terms RSS

ThinkLLM

Spot an error in our data? Let us know.

Glossary/Risk Categories

Risk Categories

behavior

Predefined groups of harmful content types (such as violence, hate speech, or misinformation) that a safety model is trained to recognize and flag.

Risk Categories — Glossary — ThinkLLM