×

微信扫一扫,快捷登录!

谷歌运维解密翻译作者讲解SRE

标签: 暂无标签

• 生产线管理员
• Ensure user-visible uptime and service quality
• Authority over production environment.
• 跟网站一起成长
• Steep learning curve, mostly due to complexity
• Continuous retraining, sites always being improved
• 基础架构设施
• Specializations for shared infrastructure
• Ensure those components have good reliability


it just works
• Service Level Objective (SLO)
• Monitoring/Deployment
• Capacity Planning
• 以一敌百
• Team manages monitoring and develops automation
• Implies use of scripting and data analysis tools
• Most failures need automated recoveries in place
• 救火队员和纵火犯合体
• Elevated risk during convenient working hours
• Learn of age mortality risk during preceding workday
• Infant mortality ideally also avoids meals


码农
• Not administration
• 报警系统重度(中毒)用户
• Holes may cause outage before notification occurs
• Routinely use multiple layers, levels and viewpoints
• Design the manual and automatic escalation paths
• 对未来负责
• Responsible for enabling growth and scaling
• Plan for requirements, identify inefficiencies
• File bugs and, where appropriate, fix them too






本帖子中包含更多资源

您需要 登录 才可以下载或查看,没有账号?立即注册

x




上一篇:SRE基础讲义一览
下一篇:SRE 工作职责金字塔
admin

写了 864 篇文章,拥有财富 29590,被 26 人关注

您需要登录后才可以回帖 登录 | 立即注册
B Color Link Quote Code Smilies

成为第一个吐槽的人

Powered by IT 运维管理
返回顶部