A compact text-to-speech focused model from Boson AI, weighing in at 4 billion parameters. It operates on text input and produces text output, suggesting it may handle TTS-related scripting, phoneme processing, or speech synthesis pipelines rather than raw audio directly. Details about its specific performance characteristics and language coverage are limited.