последние час день неделя месяц всё

Autoscaling your AI agent under load

3 768

11.3

Google Cloud Platform1.37 млн

Следующее

214 дней – 9 0787:59

Meet Cloud SQL: Google Cloud's fully managed and intelligent relational database service

Популярные

17 дней – 3 1676:09

Your AI agent is forgetful. Here’s how to give it a brain.

6 дней – 5964:07

Real time fraud detection with AlloyDB AI

Опубликовано 21 октября 2025, 22:54

This video demonstrates how to effectively autoscale your AI agent under heavy user load. We simulate a stress test on a decoupled architecture, combining a GPU-powered Gemma LLM with a lightweight ADK agent on Google Cloud Run. Discover how Cloud Run intelligently provisions resources to handle high demand, ensuring graceful scaling and cost efficiency by only scaling the bottleneck component.

Chapters:
0:00 - Introduction: The Challenge of Load
0:19 - Load Testing with Locust
1:31 - Observing Autoscaling in Cloud Run
2:02 - Key Learnings: Decoupling and Cost Efficiency
2:31 - Conclusion

Resources:
Codelab → goo.gle/475sUpV
GitHub Repository → goo.gle/3KJVc1Y
Google Cloud Run GPU → goo.gle/48sn3NV
ADK Documentation → goo.gle/3LauFL8

Subscribe to Google Cloud Tech → goo.gle/GoogleCloudTech

#GoogleCloud #LLM #Gemma #ADK #CloudRun

Speakers: Amit Maraj
Products Mentioned: Cloud Run, Gemma, AI Infrastructure, Cloud GPUs

Свежие видео

4 дня – 2 6682:59:25

From the I/O main stage to the terminal

4 дня – 3 0121:05

Search your inbox with your voice using Gmail Live! Just announced at Google IO #Shorts 🔊

5 дней – 1 283 7151:34

Have You Ever Seen One of These?

9 дней – 1 7020:29

How to test web platforms with Chrome Dev

10 дней – 13819:32

Compiling the Full Diffusion Pipeline: 4x Faster Image Generation on MI355X

12 дней – 5 0591:31

🔍 Catch the recap of The Android Show | I/O Edition!

Случайные видео

29 дней – 1 7010:12

Still carrying all this? AGM M11 fixes it

305 дней – 6 1460:55

How are you using AI in your development workflow? 💭

26.01.25 – 525 2130:44

Unboxing Redmi Note 14 Pro

03.02.24 – 590 5914:27:21

Let's Find Out If I Was Wrong - WAN Show February 2, 2024

03.07.21 – 13 8586:29

AKKO 3068V2 Tokyo R2 Bluetooth Mechanical Keyboard（68 keys）with AKKO Tokyo Mouse Pad

18.05.11 – 2 6781:53

New Macbook Air 2011 Rumors! June/July, Sandy Bridge, Thunderbolt Ports, WWDC Launch?

6 часов – 1 2321:02

Google Gemini Spark - The OpenClaw KILLER?

2 дня – 14 8514:31

Gemini Omni | I/O 2026 Keynote

2 дня – 69:55

Built to Last: The Case for Security-First, Open-Architecture Edge AI | Intel Business

2 дня – 3 3600:38

Consider this your sign to discover what's new from #GoogleIO 📲✨

3 дня – 2 23018:44

The latest from Google Pay and Google Wallet

3 дня – 1 38917:44

Build adaptive widgets for cars, phones, watches, and more

3 дня – 680:59

JJRC C8839 1/14 2.4G 4WD RC Car Off-Road Truck - Shop on Banggood

15 часов – 9 4432:25:45

Xiaomi Launch May 2026 Early Reveal —— The Ultimate New Product Challenge

1 день – 103 4024:19:51

Microsoft Gives Up On The Copilot Key - WAN Show May 22, 2026

авто техно музыка детское

Последние техно видео О рейтинге Добавить канал English