Building Blocks for Site Reliability

International Industry-Academia Workshop on Cloud Reliability and Resilience, EIT Digital, Berlin, Germany (2016)
Google Scholar

Abstract

How does Google run reliable systems? At the heart of Site Reliability Engineering is the idea of treating reliability as a software problem and and asking software engineers to design an operations function. This talk will examine the organizational, conceptual and technological building blocks that together comprise the concept of site reliability engineering at Google.