Granular audio tags now allow precise control over expressive speech generation in Gemini 3.1 Flash TTS. This update enables developers to direct specific vocal nuances and emotional tones. While the feature is an incremental improvement, it provides Google DeepMind users more direct authority over synthetic voice output for specialized applications.