SRE實戰（影印版）

內容簡介

《SRE實戰（影印版英文版）》是軟體開發人員在網站災難性故障中的生存指南。隨著企業力求實現正常運行時間的大化，站點可靠性工程（Site Reli ability Engineering，SRE）首當其衝。當你的站點出現問題，修復故障已經迫在眉睫的時候，《SRE實戰（影印版英文版）》可以作為一個手把手的操作指南。

　　Nat Welch在可靠性工程方面豐富的實戰經驗源自於某些對於系統中斷事件極為敏感的網際網路大公司。他用於監控現代Web服務、設定警報和評估事件回響的方法都經過了實踐的考驗，學會這些必將助你一臂之力。

　　《SRE實戰（影印版英文版）》可不僅僅是教你如何應對災難，它還為你揭示了安全測試和發布軟體所需的工具和策略、長期增長計畫以及預見未來的瓶頸。通過《SRE實戰（影印版英文版）》，你將學會如何制定自己的強健行動計畫，以便在全公司的網站危機中凸顯你的價值。

圖書目錄

Preface

Chapter 1： Introduction

A brief history

What is SRE？

What is in the book？

SRE as a framework for new projects

Summary

References

Chapter 2： Monitoring

Why monitoring？

Instrumenting an application

What should we measure？

A short introduction to SLIs， SLOs， and error budgets

Service levels

Error budgets

Collecting and saving monitoring data

Polling applications

Nagios

Prometheus

Cacti

Sensu

Push applications

StatsD

Telegraf

ELK

Displaying monitoring information

Arbitrary queries

Graphs

Dashboards

Chatbots

Managing and maintaining monitoring data

Communicating about monitoring

SRE實戰（影印版）

基本介紹

內容簡介

圖書目錄

相關詞條

熱門詞條