24/7 resiliency (Google Cloud Next '17)

2 928
42.4
Опубликовано 9 марта 2017, 3:34
From Site Reliability Engineering (SRE) to Customer Reliability Engineering (CRE) and Cloud Ops, there's a lot involved in keeping the Google cloud running, scaling and performing, across our organization and by extension for our customers. In this video, Mahesh Kallahalla, Luke Stone, and William Bonnell give you a close look into the internal procedures we use to continually improve reliability. They also discuss best practices for interacting with Google in order to reduce mean-time-to-detect and mean-time-to-recovery.

Missed the conference? Watch all the talks here: goo.gl/c1Vs3h
Watch more talks about Infrastructure & Operations here: goo.gl/k2LOYG
Свежие видео
9 дней – 23 5855:04
AMD Advancing AI 2024 Highlights
14 дней – 3280:19
Going AFK in VRChat be like...
автотехномузыкадетское