Category: Software, Security, Kubernetes, apple

There is established tooling for alert routing: Something breaks and it gets directed to an on-call dev or ops person to fix it. Ninety-nine percent of the time, that’s sufficient.

However, when you are dealing with unforeseeable circumstances, you don’t often have the wits about you to quickly make these decisions.

Egan said that “In major outage world, you are really responding to a symptom.

You need systems to go in and to where those problems are instead of making it an HR problem.”

Related Articles