How Anthropic uses Google Kubernetes Engine to run inference for Claude

840
23.3
Опубликовано 1 июля 2024, 15:39
Google Kubernetes Engine (GKE) provides cost efficiency and high performance to run AI inference on Google tensor processing units (TPUs) and NVIDIA graphics processing units. Join us to learn how Anthropic runs its inference workload for Claude on GKE, and how Anthropic achieved better price-perf on TPU v5e on GKE. We’ll also learn how GKE advanced management capabilities simplify Day-2 maintenance, and how Google Cloud Customer Support makes the entire experience a blast.

Speakers:

Watch more:
All sessions from Google Cloud Next → goo.gle/next24

#GoogleCloudNext

Event: Google Cloud Next 2024
Случайные видео
248 дней – 6 84712:57
Modern Edge AI experiences
09.07.22 – 507 2689:28
The Ipad Is Changing!
автотехномузыкадетское