Google Launches Gemini 3.1 Flash TTS with Enhanced Emotional Expression and Multi-Speaker Capabilities

Gate News message, April 17 — Google unveiled Gemini 3.1 Flash TTS, an advanced text-to-speech model with enhanced emotional expression and control features, on April 15. The new model will be rolled out progressively through developer APIs, enterprise Vertex AI, and collaboration tools.

The model’s core capabilities include natural language-based audio tags for fine-tuning speed, intonation, and emotion, plus a “Director Mode” for specifying scenes and character roles to generate more nuanced voice outputs. A multi-speaker feature enables simultaneous dialogue generation, allowing more natural conversation flows suitable for podcasts, audio content, and AI assistants. The model supports over 70 languages and dialects, reflecting regional accents and expressions for localized voice experiences globally.

Google emphasized performance and cost efficiency, achieving high scores on blind human evaluation benchmarks while reducing computational costs through its Flash architecture—designed for large-scale enterprise adoption. Generated audio includes SynthID watermarking to identify AI-generated content and combat misinformation.

The move reflects intensifying competition in voice interfaces. OpenAI is combining real-time voice features with conversational AI for human-like interactions, while Meta is expanding investments in AI characters with voice-based social experiences. Industry observers note that while high-level acting and creative work may remain human-driven for now, repetitive and large-scale production markets could see gradual AI adoption in dubbing, advertising, and audiobook sectors.

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.

Related Articles

Data reveals that “Claude got less intelligent” isn’t an urban legend; an AI model’s instability can become a corporate risk.

The article discusses the phenomenon of unstable performance of LLMs (large language models) in real-world applications by AI companies, calling it “de-intelligence,” and provides examples to illustrate its real impact on business workflows. Data shows that most mainstream models are currently in a degraded state, affecting companies’ productivity and stability. Companies need to start treating model stability as a new standard; otherwise, they will face infrastructure risks.

ChainNewsAbmedia19m ago

OpenAI Updates Codex to AI Agent That Controls Desktop, Automating Development Workflows

OpenAI's upgraded Codex evolves from a coding assistant to an autonomous agent for desktop environments, capable of managing applications, automating workflows, and integrating with over 100 apps. This shift enhances task continuity and workflow automation, reflecting a competitive landscape in AI coding tools.

GateNews22m ago

Google Integrates AI Search into Chrome, Enabling Conversational Web Browsing

Google is enhancing Chrome with AI-powered search, allowing conversational browsing and context-aware responses. The new functionality also features multi-tab integration, improving user experience for various tasks by consolidating open tabs and providing tailored information.

GateNews52m ago

Shinsegae Group Abandons OpenAI Collaboration for Reflection AI Partnership, Shifts Retail Strategy

Shinsegae Group has halted its partnership with OpenAI, opting for an expanded collaboration with Reflection AI to enhance AI in retail operations. This decision aims to streamline efforts and address concerns about AI commerce effectiveness.

GateNews1h ago

OpenAI and Google Add Support for HWP Format, Hancom Seeks Valuation Rebound

OpenAI's ChatGPT now supports HWP and HWPX file formats, enabling Korean users to upload documents directly for analysis without conversion. This enhances usability for local businesses and could boost Hancom's stock recovery amidst recent declines.

GateNews1h ago

Google Removes 175.5M Ads in South Korea Using AI Enforcement, Suspends 326K Advertiser Accounts

In 2025, Google removed 175.5 million violating ads in South Korea using AI, suspended 326k accounts, and faced a $50 million fine for privacy violations, highlighting a trend of increasing enforcement and AI's role in combating ad fraud.

GateNews1h ago
Comment
0/400
No comments