Back to webinars and podcasts
60 min

5 Technical Lessons Learned from Outages at AWS, Google and Microsoft

Every outage has a story and a lesson. InfoQ hosts this webinar featuring experts sharing both.
Listen to the podcast
Overview

We can learn something from every outage, regardless of whether it comes from a start-up or a hyperscaler like Amazon or Google.  

In this webinar, we hear from two reliability experts, Niall Murphy (former head of SRE at Microsoft and Google) and Anurag Gupta (former VP of AWS database and analytic services).  This session includes technical lessons learned large outages at Amazon and Google.

60 min

5 Technical Lessons Learned from Outages at AWS, Google and Microsoft

Every outage has a story and a lesson. InfoQ hosts this webinar featuring experts sharing both.
Register now
Overview

We can learn something from every outage, regardless of whether it comes from a start-up or a hyperscaler like Amazon or Google.  

In this webinar, we hear from two reliability experts, Niall Murphy (former head of SRE at Microsoft and Google) and Anurag Gupta (former VP of AWS database and analytic services).  This session includes technical lessons learned large outages at Amazon and Google.

What you'll learn

The importance of automation and how to build circuit breakers to mitigate risk

Scaling automation: what works at 1,000 customers may not work at 1 million customers.

Botched rollouts: don’t forget to check the failure rate of distributed jobs

The anti-80/20 rule. Avoiding your biggest potential Achilles heel by building redundancy into the systems you use the least.

How to conduct blameless post-mortems that maximize the lessons learned from any outage

Featured speakers
Anurag Gupta
Former VP of AWS Database and Analytic Services. Founder and CEO of Shoreline.io
Niall Murphy
SRE and engineering leader at Amazon, Google, and Azure. Author Google SRE Book
Eric Costlow
InfoQ Editor

Find more Shoreline resources

Looking for more information? Visit our other resource sections