What happened?
What was the impact on customers and your business?
What was the root cause?
What data do you have to support this?
especially metrics and graphs
What were the critical pillar implications, especially security?
When architecting workloads you make trade-offs between pillars based upon your business context. These business decisions can drive your engineering priorities. You might optimize to reduce cost at the expense of reliability in development environments, or, for mission-critical solutions, you might optimize reliability with increased costs. Security is always job zero, as you have to protect your customers.
What lessons did you learn?
What corrective actions are you taking?
Actions items
Related items (trouble tickets etc)
5 Whys
