This AI agent runs on Cloud Run + NVIDIA GPUs

4 371
11.8
Следующее
87 дней – 13 9099:02
Building your own MCP server with ADK
Популярные
Опубликовано 13 ноября 2025, 17:01
Source code for the smart health agent → goo.gle/4nJsFax

Have you ever wondered how to build a real AI agent application on a serverless NVIDIA GPU? In this video, Martin Omander (Google) sits down with Jay Rodge (NVIDIA) to walk through a complete setup. Jay demonstrates a smart health agent that runs on Cloud Run with an NVIDIA L4 GPU. Watch along as the duo dive right into the code and architecture.

See how Martin and Jay run open source models like Gemma with Ollama on Cloud Run, use LangGraph to build a multi-agent workflow (RAG + tools), explain the architecture for splitting a an app into a CPU frontend (with Gradio), showcase a GPU backend, and explain why a developer would host their own model vs. calling a managed API.

Chapters:
0:00 - Intro
0:40 - Demo of the smart health app
2:25 - How the app was built
5:00 - Code for multi-agent
5:33 - LangGraph vs ADK
5:50 - Hosting an LLM vs calling Gemini API
6:40 - Developer experience
7:06 - Wrap up

Watch more Serverless Expeditions → goo.gle/ServerlessExpeditions
🔔 Subscribe to Google Cloud Tech → goo.gle/GoogleCloudTech

#GoogleCloud #CloudRun #Serverless

Speakers: Martin Omander, Jay Rodge
Products Mentioned: Cloud Run, Agent Development Kit
Случайные видео
19.11.22 – 24 61612:55
Building a Startup Team
17.07.19 – 690 6771:03
Mi A3 - #PhotosWithoutLimits
автотехномузыкадетское