How to evaluate agents in practice

11 121
10.2
Следующее
Популярные
21 день – 11 7611:41
Build AI employees with Gemini CLI
Опубликовано 12 декабря 2025, 17:01
Evaluating Agents with ADK → goo.gle/testagent

This video applies the theory of AI agent evaluation from our previous episode, guiding you through practical testing with Google's ADK. Learn about the 3-Tier Testing Pyramid, covering component level unit tests, trajectory level integration tests, and human review. Discover how to use ADK to design and run reliable, automated checks for your agents, ensuring they perform as expected in real world scenarios.

Chapters:
0:00 - Introduction to practical agent evaluation
1:05 - The 3-tier testing pyramid explained
1:15 - Tier 1: Component level unit tests
1:55 - Tier 2: Trajectory level integration tests
2:22 - Tier 3: End to end human review
3:14 - Agent Development Kit (ADK) in action
4:48 - [Demo] The 3-tier testing pyramid
9:23 - Summary and next steps

Subscribe to Google Cloud Tech → goo.gle/GoogleCloudTech

#GoogleCloud #AIAgents #ADK

Speakers: Annie Wang
Products Mentioned: Agent Development Kit
автотехномузыкадетское