An AI model that understands spoken input and generates spoken responses for interactive conversations.