Ditch the Template: Incident Write-ups They Want to Read

    Why should we try to make incident reports engaging?

    Regulating the Cloud for the Financial Services Sector

    In my White Paper - Banking on the Cloud, written in the summer of 2022, I highlight the increasing direct oversight that regulators are starting to have over the Cloud Service Providers (CSP).

    WTF a Developer Platform is Not

    Goodness, have I read and written a lot about platform engineering so far this year. Platform engineering is the sociotechnical discipline of crafting, building and combining all the common tools need...

    #HugOps and Psychological Safety - Think Root Causes Not Root People

    Benjamin Franklin once said “The Only Two Certainties In Life Are Death And Taxes”. But if he was an engineer he’d probably add another to that list, outages. Engineers at Facebook would doubtless agr...

    Why You Need Chaos Engineering Now More Than Ever

    About a year ago, brick and mortars like restaurants and grocery stores were scrambling to set up delivery and curbside pickup. A lot of them used chaos engineering, in production, to hunt for failure...

    Fire Drills: a Guide to Preparing for Your Next Incident

    Supporting Cloud Native applications is no easy task. Through offering Customer Reliability Engineering (CRE) support—essentially, Site Reliability Engineering (SRE) as a service—for multiple customer...

    WTF Is Continuous Improvement?

    When you’re offered a Covid-19 vaccine this year, which sort would you like? One that’s been through animal and human trials, received government approval, is made on a standardised production line, a...

    Isn't SRE Just DevOps?

    The truly Cloud Native way to work in teams, according to the Maturity Matrix, means SRE and DevOps. But what does that mean? You might be wondering, Isn’t SRE basically just DevOps?

    Comparing Chaos Engineering Tools for Kubernetes Workloads

    For most people the word ‘chaos’ means complete disorder and confusion. So what does it mean to engineer chaos? The distributed systems we build are becoming more and more complex, thus their state ca...