Back to videos

Risks of Automation vs. Human Errors

Automation is risky. Errors in the remediation code could worsen an outage. While that’s true, we also know that human error causes 5x more incidents than automation. You can fix code. You can't fix people.
2 min
play_arrow
Summary

“Automation is risky. Errors in the remediation code could worsen an outage.”

While that’s true, we also know that human error causes almost 5x more incidents than automation.

It’s because you can fix code, you can't fix people.

They come and go. Some have experience, some don't. And whoever happens to be on call is whoever happens to be on call.

People make mistakes. That's why when you're writing code, you don't just ship it.

It goes through testing, scripts, deployment, and all other processes.

But you don't have that opportunity when you're fixing something on call as you're dealing with it in the moment, under pressure.

That’s why the best way to reduce the risk in production ops is by doing more automation and leaving less in place for people.

Further, you can make automation less risky by using tools with circuit breakers that limit the number of times the automation runs, and that can deal with partial failures.

Basically, the tool must have the capability to understand the complexities of distributed systems.

So you can focus on automating just the individual issue that happens in the individual box.

Transcript

View more Shoreline videos

Looking for more? View our most recent videos
2 min
How to Bring Continuous Improvement in Operations
I deeply believe in making things 1% better each and every week by improving the performance of the software I've been responsible for and keeping my services up. Let’s talk about bringing continuous improvement to operations.
3 min
How to Reduce Alarm Noise
In any company, 50-80% of the alarms are noisy. Employees get trained to snooze these alarms – which isn’t always the right thing to do. Wouldn't it be better if you could easily see which are your top issues each week, and which alarms might be set incorrectly?
3 min
How to Setup Shoreline’s Incident Insights Tool
Learn step by step how to setup Shoreline's Incident Insights so that you can pinpoint the top causes of incidents, measure team health, and use trending data to drive continuous improvement. Get up and running in 2 minutes.