Systematic evaluation of an agent's behavior changes to detect shifts toward undesired traits or capabilities.