Should Site Reliability Engineering https://thenewstack.io/boost-sre-productivity-with-observability-driven-automation/ be “shifted left” into the Continuous Integration stage of software deployment, that is before the software is even deployed? A recent Usenix opinion piece, “https://www.usenix.org/publications/loginonline/ci-alerting,” written by https://www.linkedin.com/in/tituswinters/, Principal Software Engineer at Google, explains how this potential practice could be useful. As Winters points out, CI systems are systems for automating the build-and-test routine: build the code, and run the tests as often as is reasonable.

CI on large-scale integration tests are the equivalent to canary deployments and when using high-fidelity test data, reporting large-scale integration test failures in staging are basically the same failures seen in production alerts.

Here’s a unit test failure: Engineer 1: “We got a test failure from our CI system.

Related Articles