A collection of test scenarios used to evaluate model safety, specific to a language, sector, or regulatory regime.