Think
LLM
Models
Capabilities
Use Cases
Benchmarks
Papers
Glossary
Search
/
Glossary
/
Misalignment
Misalignment
techniques
When an AI model's goals or behaviors diverge from the intended goals of its creators or users.
Misalignment — Glossary — ThinkLLM