Cloud Load Balancer, the secret to uptime for AI inference

1 010
8.6
Опубликовано 24 марта 2025, 23:35
See the detailed reference architecture → goo.gle/4bLQdap

Learn how to pair new cloud load balancing capabilities like custom metrics and service extensions with GKE Autopilot, which includes features like node auto-repair to automatically replace unhealthy nodes, and horizontal pod autoscaling to adjust resources based on application demand.

Subscribe to Google Cloud Tech → goo.gle/GoogleCloudTech

Speaker: Don McCasland
Products Mentioned: AI Infrastructure
автотехномузыкадетское