Optimize model serving with GKE Inference Gateway

2 625
13.7
Следующее
296 дней – 2 0713:55
The real talk on agent evaluation
Популярные
48 дней – 3 9586:00
Pros and cons of on-device AI
Опубликовано 12 июня 2025, 23:00
Learn More →goo.gle/gke-inference-gateway

GKE Inference Gateway is an extension to the GKE Gateway that provides optimized routing and load balancing for serving generative Artificial Intelligence (AI) workloads. It simplifies the deployment, management, and observability of AI inference workloads.

Subscribe to Google Cloud Tech→ goo.gle/GoogleCloudTech

#GoogleCloud

Speakers: Mofi Rahman, Vaibhav Katkade
Products Mentioned: Google Kubernetes Engine (GKE), AI Infrastructure
автотехномузыкадетское