Using natural language descriptions to control non-content aspects of speech like tone, emotion, or prosody.