Optimize model serving with GKE Inference Gateway

2 603
13.8
Следующее
270 дней – 2 0603:55
The real talk on agent evaluation
Популярные
Опубликовано 12 июня 2025, 23:00
Learn More →goo.gle/gke-inference-gateway

GKE Inference Gateway is an extension to the GKE Gateway that provides optimized routing and load balancing for serving generative Artificial Intelligence (AI) workloads. It simplifies the deployment, management, and observability of AI inference workloads.

Subscribe to Google Cloud Tech→ goo.gle/GoogleCloudTech

#GoogleCloud

Speakers: Mofi Rahman, Vaibhav Katkade
Products Mentioned: Google Kubernetes Engine (GKE), AI Infrastructure
автотехномузыкадетское