Granular audio tags now allow precise control over expressive speech generation in Gemini 3.1 Flash TTS. This update targets specific vocal nuances rather than relying on generic prompts. Developers can now direct AI inflection with higher accuracy. It is an incremental improvement for Google DeepMind's audio toolkit, focusing on fine-grained user control.