What I’ve noticed with most organizations that want “the SRE” is that they have a hard time articulating WHY they want it. They think SRE is the solution to this (spoiler alert: it is) but don’t know where to start.
SRE is a way to align your organization on a definition of reliability and what will happen if the organization fails to meet those standards.
To get a better idea on where we can start, let’s take a look at Mikey Dickerson’s pyramid of SRE: Notice that monitoring is at the base of the pyramid.
It provides a team with a path to adopt SRE and to mature over time based on their needs.