Back to videos

How to Safely Fix Issues Without Escalation

The only real solution is incident automation.
1 min
play_arrow
Summary

Unpopular opinion on fixing issues:

Investing in DevOps tools to detect incidents and hiring qualified engineers to fix them is not the right way to do it.

Why do I see that as a problem?

I believe that simply detecting issues and hiring qualified people is necessary but not sufficient.

Let’s think about a four-nines SLO – that’s 4.4 minutes a month.

Every  issue will break your SLO, as it might take:
- 10-15 minutes on average to get an issue to somebody, and then
- up to an hour to fix it.

So there's no chance you can do anything except dissatisfy people whenever you have an issue.

The only real solution to that is automation.

That’s why I started Shoreline to build incident automation to help people:
- automatically fix issues in production
- grow the number of people who can safely fix things without escalation

For issues that require human judgment, the first person who looks at the issue should be able to resolve it rather than routing it through 6 other people.

And having a system take care of the mundane work is way better for me than dealing with something the 5th or 25th time.

Because, like everyone, I like sleeping at night.

Transcript

View more Shoreline videos

Looking for more? View our most recent videos
2 min
How Does Shoreline’s Incident Insights Work?
I know I should apply continuous improvement to operations. But where do I start? See how our free Incident Insights tool helps you remove noise and increase signal, making your team more productive and reducing costs by decreasing toil.
2 min
About Company Values
Part of the reason to create a company is to create the environment you want to be in.So it’s important that you reflect your values in your interview process. Otherwise, the sheer number of people joining will dilute things.
3 min
Shoreline Datadog Incident Repair Kit Demo
Create a library of best practice debugging tools and pre-built remediation actions so that everyone on-call is as good as your best SRE with Shoreline's Datadog Incident Repair Kit.