This resource first appeared in issue #47 on 23 Oct 2020 and has tags Technical Leadership: Systems: Incident Handling
A List of Post-Mortems - Dan Luu
In research computing, when it comes to running systems we could be a lot closer to industry best practices than we are. We’ve talked about post-mortems more than once; here’s a list of postmortems from many companies collected by Luu. It’s nice to see that they don’t necessarily have to be long or complicated or intricate; like risk management, just simple documents for ongoing clarity can be a huge step forward.