I recently spoke with the developers of Humanitec, a Continuous Delivery platform for Kubernetes. Humanitec is interesting because, contrary to recent trends, it’s not based on a GitOps deployment wor...
Humans are built to make mistakes. It’s how we learn. That’s true whether you work for a small startup or a global enterprise. The trick is to try to avoid making the same mistakes, over and over.
‘Every great cause begins as a movement, becomes a business, and eventually degenerates into a racket’. —Eric Hoffer, moral and social philosopher
Java (and its other JDK-based siblings) is the most widely used programming language in large companies. Java developers are backend focused and used to building complex distributed systems. Yet these...
For most people the word ‘chaos’ means complete disorder and confusion. So what does it mean to engineer chaos? The distributed systems we build are becoming more and more complex, thus their state ca...
Being a Site Reliability Engineer, or SRE, is a hot job—and an expensive one to keep on staff.
Site Reliability Engineering, or SRE, an engineering practice formalised and named by Google, has helped many organisations maintain their platforms and ensure application performance and reliability,...
As Site Reliability Engineers, or SREs, we spend our days (and sometimes nights and weekends) making sure the platforms we oversee run smoothly. We also follow careful protocols for responding when so...
This is the conclusion of a three-part blog series. For more information, request our free e-book, SRE: The Cloud Native Approach to Operations. If you’ve been following parts 1 and 2 of this blog ser...
