The overwhelming majority of a software system’s lifespan is spent in use, not in design or implementation. So, why does conventional wisdom insist that software engineers focus primarily on the design and development of large-scale computing systems?
In this collection of essays and articles, key members of Google’s Site Reliability Team explain how and why their commitment to the entire lifecycle has enabled the company to successfully build, deploy, monitor, and maintain some of the largest software systems in the world. You’ll learn the principles and practices that enable Google engineers to make systems more scalable, reliable, and efficient—lessons directly applicable to your organization.
This book is divided into four sections:
- Introduction—Learn what site reliability engineering is and why it differs from conventional IT industry practices
- Principles—Examine the patterns, behaviors, and areas of concern that influence the work of a site reliability engineer (SRE)
- Practices—Understand the theory and practice of an SRE’s day-to-day work: building and operating large distributed computing systems
- Management—Explore Google’s best practices for training, communication, and meetings that your organization can use
书籍下载&在线阅读
网站可靠性工作手册 中文版Site Reliability Engineering: How Google Runs Production Systems微信关注“码中人”公众号,获取免费赠书。
本站的大部分电子书均为开源电子书。
本站不制作 、不存储该资源,所有资源来自于其它网站。
如本电子书非开源图书,请尊重版权,购买正版书籍
本电子版仅供预览,下载后24小时内务必删除。
PS:如果链接失效,请留言告知我们,将尽快修复链接。