How Anthropic uses Google Kubernetes Engine to run inference for Claude

768
25.6
Опубликовано 1 июля 2024, 15:39
Google Kubernetes Engine (GKE) provides cost efficiency and high performance to run AI inference on Google tensor processing units (TPUs) and NVIDIA graphics processing units. Join us to learn how Anthropic runs its inference workload for Claude on GKE, and how Anthropic achieved better price-perf on TPU v5e on GKE. We’ll also learn how GKE advanced management capabilities simplify Day-2 maintenance, and how Google Cloud Customer Support makes the entire experience a blast.

Speakers:

Watch more:
All sessions from Google Cloud Next → goo.gle/next24

#GoogleCloudNext

Event: Google Cloud Next 2024
Случайные видео
195 дней – 3 8090:38
How Do I use Loop with Callie
15.05.23 – 9 868 0410:31
A New World | every day better
автотехномузыкадетское