Back to videos

Why You Need Automation Today

A ton of tools help you observe your environment and maybe half a ton help you route things and deduplicate them. But there's hardly anything out there that actually fixes your environment. That's the reason we need automation in production ops today.
3 min
play_arrow
Summary

I talk a lot about bringing automation to production ops because it's the next problem to solve.

Let me explain.

- Almost everyone is moving to the cloud because it brings agility, allowing faster development and innovation.

- We have tools to automate other parts of the software development lifecycle, whether it’s running tests, building artifacts, configuration management, or deployments.

But sadly, managing the environment once it's in production is still an almost entirely manual job.

A ton of tools help you observe your environment and maybe half a ton help you route things and deduplicate them.

But there's hardly anything out there that actually fixes your environment.

And that's unfortunate because I pity the poor SREs who deal with this vastly faster pace of innovation and software development lifecycle.

They're getting code faster. It's more complex and even multi-cloud. It's got both Kubernetes and VMs alongside a bunch of microservices.

And they're responsible for plugging the holes in the dike wherever they happen, and the dikes are only getting bigger.

The only way to keep up with this challenging job is by ensuring that:
- the things that you do repetitively get automated away.
- the things that get escalated can be moved into processes so that your 1st line can manage them without escalation.
- things that happen for the first time can be debugged fleet-wide in parallel.

Personally, I never got excited when there was one more dashboard to look at or slightly better routing of an incident to somebody.

But I do get excited when some incident gets automated away forever because that reduces my labor, helping me keep up with an ever-growing, complex environment.

That's the reason we need automation in production ops today.

Transcript

View more Shoreline videos

Looking for more? View our most recent videos
2 min
How Notebooks Empower Your On-Call Teams
Some issues can't be automated. For things that require human judgment, we provide on-call teams with notebooks that are optimized for operations. That way you know what action to take and when.
3 min
Building a Culture Around Reliability
It's not some other team's job to keep your service up. Just like it's not some other team's job to fix your bugs or make sure that your system doesn't have vulnerabilities. We all have to own it. That is what a culture of reliability requires.
2 min
The Best Way to Improve Your On-Call
No one wants to do on-call because you can't control when the incident happens. Improve your on-call by building automations that eliminate common production incidents.