AWS re:Invent 2018: How AWS Minimizes the Blast Radius of Failures (ARC338)

33 982
27.8
Опубликовано 27 ноября 2018, 19:20
At AWS, we obsess over operational excellence. We have a deep understanding of system availability, informed by over a decade of experience operating the cloud and our roots of operating Amazon.com for nearly a quarter-century. One thing we've learned is that failures come in many forms, some expected, and some unexpected. It's vital to build from the ground up and embrace failure. A core consideration is how to minimize the "blast radius" of any failures. In this talk, we discuss a range of blast radius reduction design techniques that we employ, including cell-based architecture, shuffle-sharding, availability zone independence, and region isolation. We also discuss how blast radius reduction infuses our operational practices.
Случайные видео
119 дней – 7 143 5211:02
Semi-Century | Samsung
161 день – 201 06912:06
Pixel 8a Review: AI @ Google I/O!
автотехномузыкадетское