последние час день неделя месяц всё

How to build an accuracy pipeline for your AI app

4 459

11

Google Cloud Platform1.32 млн

Следующее

5 дней – 4 36315:58

How to build a data agent with BigQuery and CloudSQL

Популярные

59 дней – 11 2474:50

How to build context systems for AI agents

93 дня – 13 7487:39

Connecting ADK Agents to MCP Servers

Опубликовано 2 февраля 2026, 17:00

Evaluating agents with ADK code lab → goo.gle/3NRVhSB
Evaluating single LLM outputs With Vertex AI evaluation code lab → goo.gle/4jYfYZ0

Large language models (LLMs) are great but sometimes they go rogue. How can developers gain real confidence in their AI systems while in production? Join Aja and Jason as they demonstrate how to implement an 'accuracy pipeline' using LLMs as your ultimate grading rubric, treating each evaluation prompt like a shiny new unit test.

Chapters:
0:00 - Intro
0:48 - What is hallucination?
2:35 - Testing AI answers for accuracy
5:43 - Offline evaluation
8:53 - Summary

More resources:
Agent evaluation in Vertex AI Gen AI evaluation service → goo.gle/3M0l3Dw

Watch more Real Terms for AI → goo.gle/AIwordsExplained
🔔 Subscribe to Google Cloud Tech → goo.gle/GoogleCloudTech

#GoogleCloud #AIInfrastructure

Speaker: Aja Hammerly, Jason Davenport
Products Mentioned: AI Infrastructure

Свежие видео

12 дней – 37 8060:15

Display | What’s next?: Wow | Samsung

12 дней – 14 385 0600:53

I Spilled Liquid on my PC...now what?

12 дней – 10 1211:38

From Discovery to Delivery | New Ways to Shop with Google

16 дней – 14759:07

Breaking the migration deadlock: What to do with your VMware workloads | Amazon Web Services

17 дней – 8162:56

Resolve crises and break technical deadlocks using Gemini in the Gmail side panel

17 дней – 56 9791:08

If You Own a Pool, Watch This

Случайные видео

25 дней – 48 7200:15

Cover screen preview | Galaxy Z Fold7 | Samsung

129 дней – 7752:56

Making VIVE AI work seamlessly with Gemini

02.07.24 – 16 5553:24

How to import and use custom shapes in Microsoft Visio for the web

16.11.23 – 5 5411:45

AMD Radeon™ PRO W7600 up to 43% faster in SOLIDWORKS® than NVIDIA

18.12.22 – 7 3875:41

PC Build Guide – How to Choose Graphics Cards and Drives – DIY in 5 Ep 187

08.03.07 – 55 56751:48

Eric Schmidt at the Morgan Stanley Technology Conference

8 часов – 4440:48

Agent Mode in Excel: Turn Sales Data Into an Exec Dashboard

1 день – 53 13415:12

If You’re Going To Buy It, MAKE It Your Own

6 дней – 1925:28

Going Deep: AWS + NFL Next Gen Stats | Ep.5: The Stack Behind the Stats | Amazon Web Services

11 дней – 3481:33

BMW Group transforms digital driving experience on AWS | Amazon Web Services

17 дней – 3 325 65312:05

Xiaomi 17 Ultra - More Camera than Phone!

17 дней – 18 0030:36

Play Fallout 76: Burning Springs on Samsung Gaming Hub

23 дня – 3 2181:45

Accelerate Copilot Adoption with the Copilot Adoption Community in Viva Engage

2 дня – 75 2220:28

Turn your AirPods into a camera remote for iPhone. #Shorts

9 дней – 1 0472:15

Supporting endangered languages

авто техно музыка детское

Последние техно видео О рейтинге Добавить канал English