How Anthropic uses Google Kubernetes Engine to run inference for Claude

624
23.1
Следующее
Популярные
17 дней – 1 2150:38
Racing to the cloud with IoT
Опубликовано 1 июля 2024, 15:39
Google Kubernetes Engine (GKE) provides cost efficiency and high performance to run AI inference on Google tensor processing units (TPUs) and NVIDIA graphics processing units. Join us to learn how Anthropic runs its inference workload for Claude on GKE, and how Anthropic achieved better price-perf on TPU v5e on GKE. We’ll also learn how GKE advanced management capabilities simplify Day-2 maintenance, and how Google Cloud Customer Support makes the entire experience a blast.

Speakers:

Watch more:
All sessions from Google Cloud Next → goo.gle/next24

#GoogleCloudNext

Event: Google Cloud Next 2024
автотехномузыкадетское