Model: Blameless Postmortem

November 22nd, 2020

postmortem has 3 important jobs

  1. explain what happened

  2. apologize

  3. commit to improvement

 

anomalies and root cause

we look for causes, and any anomaly either get's labeled as a root cause or a contributing factor

 

many times these anomalies are present during "ordinary operations, too.

We give them more weight that they deserve

 

anomalies are present all the time

 

learn from "near misses"

eg. type an incorrect command, but catch it before executing

  • how was it caught?

  • what safety net could have helped

    • catch it

    • prevent it from doing harm

 

(src: Notes: Beyond the Phoenix Project

(src: Book: release it! - Michael Nygard)