Building next-gen audio-first experiences: Lessons from PlayStation

190
Следующее
Популярные
56 дней – 4 3683:59
Welcome to Bigtable
136 дней – 27 0753:41
Build with AI: A guide to Gemini CLI
Опубликовано 25 июня 2026, 16:22
Speech is the next frontier of multimodal interaction, transforming how users engage with technology. Move beyond basic voice commands and discover what it takes to deploy low-latency, audio-first agents at global scale. In this session, we’ll break down how PlayStation partnered with Google Cloud to redefine their player experience, evolving from standard voice tech to an intelligent system that achieved a massive reduction in operational costs while significantly increasing caption accuracy. We’ll dive deep into the technical architecture behind these results, demonstrating how Gemini Audio on Gemini Enterprise Agent Platform enables real-time multilingual speech-to-text (STT) support and hyper-natural text-to-speech (TTS) cues that improve accessibility for everyone. Whether you’re solving for immersive gaming or enterprise support, join us to get the production-ready blueprint for building voice interfaces that are faster, smarter, and more cost-effective.

Watch more: 100+ sessions from Google Cloud Next 26 → googlecloudevents.com/next-veg...
SSubscribe to Google Cloud Tech → goo.gle/GoogleCloudTech

Speakers: Haris Ioannou, Golda James

BRK2-088
#googlecloudnext
автотехномузыкадетское