When Reddit went down on March 13, 2022, it was a rude reminder that the company needed to manage its infrastructure in a different way. The notorious “Pi Day” site-wide outage, which, coincidentally, lasted for 314 minutes, came from a cluster-wide upgrade from Kubernetes 1.23 to 1.

Related Articles