Models Capabilities Use Cases Benchmarks Papers Glossary

Models Capabilities Use Cases Benchmarks Papers Glossary

About Privacy Terms RSS

ThinkLLM

Spot an error in our data? Let us know.

Glossary/Refusal Detection

Refusal Detection

behavior

The ability to identify when a model declines to answer a request, which can indicate the model recognized a harmful or unsafe prompt.

Refusal Detection — Glossary — ThinkLLM