Humans are built to make mistakes. It’s how we learn. That’s true whether you work for a small startup or a global enterprise. The trick is to try to avoid making the same mistakes, over and over.
As Site Reliability Engineers, or SREs, we spend our days (and sometimes nights and weekends) making sure the platforms we oversee run smoothly. We also follow careful protocols for responding when so...