Emotions, Previews, and Natural Conversations
Control speaker emotions with tags, preview voices instantly, and enjoy more natural conversation pacing.
What's new?
We've introduced several updates to give you more control over performance and improve the natural flow of your podcasts.
- Emotion Tags: You can now direct the speaker's tone. Add tags like [laughter], [sighs], [breaths], or specific emotions like (angry), (whispering), or (excited) directly in the script.
- Voice Previews: Listen to immediate samples of any voice in the selector before generating full audio.
- Natural Pauses: We've tuned the audio generation to include more natural, dynamic pauses between speakers, eliminating robotic silence.
- New Default Voices: Meet Kyle and Tessa, our new default hosts optimized for realistic, engaging delivery.
- Multilingual Support: All voices are now multilingual. You can write scripts in any supported language using any voice you choose.
Why it matters
- Better Performance Control: Emotion tags allow you to add nuance to key moments, preventing flat delivery in dramatic or humorous lines.
- Faster Workflow: Voice previews save you time and credits by letting you audition voices instantly without generating a full line.
- More Human Audio: The improved pacing and natural pauses make conversations feel less like a readout and more like a real interaction.
Getting started
To use emotion tags, simply type them into your script text (e.g., "(excited) I can't believe it!"). For voice previews, look for the play button in the voice selector.
— Stan