The agent-quality flywheel: Using Gemini Enterprise Agent Platform evaluations to optimize agents

242

Google Cloud Platform1.38 млн

Следующее

4 часа – 20044:03

Beyond the hype: Orchestrating end-to-end developer workflows with agents

Популярные

7 дней – 5 1258:50

7 tips for using Antigravity 2.0 on enterprise codebases, coding phase

6 дней – 2 9854:27

Governed agents: Looker + MCP explained

Опубликовано 25 июня 2026, 16:24

Treating agent quality as a rigorous engineering discipline is the only way to scale. Stop guessing and start measuring. Join this session for a deep dive into the state-of-the-art techniques Google uses to build our own agents, and learn how to make them a part of your process. We’ll demonstrate how you can adopt the “Quality Flywheel” methodology, which includes bootstrapping effective offline evaluation with synthetic test generation, using LLM-as-a-judge autoraters and trajectory evaluations, performing user and environment simulation, identifying systemic failures in production with multi-turn autoraters and loss-pattern clustering, aligning test coverage with actual usage, and using automated optimization capabilities to scientifically refine performance. Ramp up with confidence.

Watch more: 100+ sessions from Google Cloud Next 26 → googlecloudevents.com/next-veg...
Subscribe to Google Cloud Tech → goo.gle/GoogleCloudTech

Speakers: Dima Melnyk, Alex Martin, Daniel Lewis

BRK3-023
#GoogleCloudNext

Свежие видео