The agent-quality flywheel: Using Gemini Enterprise Agent Platform evaluations to optimize agents

242
Опубликовано 25 июня 2026, 16:24
Treating agent quality as a rigorous engineering discipline is the only way to scale. Stop guessing and start measuring. Join this session for a deep dive into the state-of-the-art techniques Google uses to build our own agents, and learn how to make them a part of your process. We’ll demonstrate how you can adopt the “Quality Flywheel” methodology, which includes bootstrapping effective offline evaluation with synthetic test generation, using LLM-as-a-judge autoraters and trajectory evaluations, performing user and environment simulation, identifying systemic failures in production with multi-turn autoraters and loss-pattern clustering, aligning test coverage with actual usage, and using automated optimization capabilities to scientifically refine performance. Ramp up with confidence.

Watch more: 100+ sessions from Google Cloud Next 26 → googlecloudevents.com/next-veg...
Subscribe to Google Cloud Tech → goo.gle/GoogleCloudTech

Speakers: Dima Melnyk, Alex Martin, Daniel Lewis

BRK3-023
#GoogleCloudNext
Случайные видео
204 дня – 10 1310:11
You Never Know! 🤞
260 дней – 871 6411:20
Xiaomi 15T Series | Xiaomi HyperAI
26.04.25 – 498 2110:32
PC Watercool with a Firetruck
автотехномузыкадетское