Back to videos

About Shoreline’s Fleet-Wide Debugging and Repair

Shoreline enables highly targeted fleet-wide debugging and repair allowing you to debug across the fleet in about the same amount of time as an individual box.
2 min
play_arrow
Summary

At Shoreline, we enable highly targeted fleet-wide debugging and repair.

It allows you to:
- run a command across all your boxes in parallel
- decide whether to run a second command that gives you more detail, or
- go in a different direction

It’s similar to what you’d do to debug an individual box, but you're debugging across the fleet in about the same amount of time.

You can do many things in this model that you couldn't through dashboards.

For example:

At AWS, a large-scale event happened once due to a BIOS upgrade.

There's no way we could have a log file or a dashboard for it.

The only way out was to log into the boxes and find out what the heck was going on.

So I had ~20 people run this manual parallelization process (which is obviously ridiculous).

But that was the only way back then.

Today, you can use Shoreline to safely run individual commands across a lot of boxes simultaneously, all by yourself.

It is executed in a parallel distributed framework (like everything else we do at Shoreline).

That’s how our fleet-wide debugging and repair works.

Have you ever done fleetwide debugging? Could you use this capability?

Transcript

View more Shoreline videos

Looking for more? View our most recent videos
2 min
Shoreline Incident Insights
A quick overview video that shows automated categorization, filtering, and analysis of incidents.
3 min
How to Reduce Alarm Noise
In any company, 50-80% of the alarms are noisy. Employees get trained to snooze these alarms – which isn’t always the right thing to do. Wouldn't it be better if you could easily see which are your top issues each week, and which alarms might be set incorrectly?
2 min
How Notebooks Empower Your On-Call Teams
Some issues can't be automated. For things that require human judgment, we provide on-call teams with notebooks that are optimized for operations. That way you know what action to take and when.