Back to videos

Why You Need Automation Today

A ton of tools help you observe your environment and maybe half a ton help you route things and deduplicate them. But there's hardly anything out there that actually fixes your environment. That's the reason we need automation in production ops today.
3 min
play_arrow
Summary

I talk a lot about bringing automation to production ops because it's the next problem to solve.

Let me explain.

- Almost everyone is moving to the cloud because it brings agility, allowing faster development and innovation.

- We have tools to automate other parts of the software development lifecycle, whether it’s running tests, building artifacts, configuration management, or deployments.

But sadly, managing the environment once it's in production is still an almost entirely manual job.

A ton of tools help you observe your environment and maybe half a ton help you route things and deduplicate them.

But there's hardly anything out there that actually fixes your environment.

And that's unfortunate because I pity the poor SREs who deal with this vastly faster pace of innovation and software development lifecycle.

They're getting code faster. It's more complex and even multi-cloud. It's got both Kubernetes and VMs alongside a bunch of microservices.

And they're responsible for plugging the holes in the dike wherever they happen, and the dikes are only getting bigger.

The only way to keep up with this challenging job is by ensuring that:
- the things that you do repetitively get automated away.
- the things that get escalated can be moved into processes so that your 1st line can manage them without escalation.
- things that happen for the first time can be debugged fleet-wide in parallel.

Personally, I never got excited when there was one more dashboard to look at or slightly better routing of an incident to somebody.

But I do get excited when some incident gets automated away forever because that reduces my labor, helping me keep up with an ever-growing, complex environment.

That's the reason we need automation in production ops today.

Transcript

View more Shoreline videos

Looking for more? View our most recent videos
3 min
How to Reduce Waste for Unexpected Demands
Shoreline's back ends are low utilization most of the time. But once an hour, we pull telemetry data from all agents, resulting in a CPU, memory, and network utilization spike. See how we convert over-provisioned resources for demand spikes to waste and eliminate it.
1 min
Using Shoreline.io to root-cause transient issues (like JVM garbage collection)
Shoreline makes it easy to collect diagnostic information when you're doing a root-cause analysis of an issue. This example shows how to automatically capture debugging information for slow Java garbage collection and then automatically bounce the process to alleviate customer pain.
3 min
3 Hacks to Reduce Your Cloud Computing Bill
Things that enabled me to do more with lower cloud computing costs