Mooving To Stability.png

The role of catastrophic failure in software design with Jeff Atwood

Recorded on February 15th, 2022

Join seasoned veteran, Jeff Atwood (yes, that Jeff Atwood of Stack Overflow and Discourse) as he discusses the role of catastrophic failure in software design.

Users of modern apps require as close to 100% uptime as possible, which also means they require quick results. When these expectations aren't met, we need to learn from them to create better design. But what if your fault tolerance design ends up being the cause of your issues? Sean Molloy, and BJ Maldonado talk with Jeff about how you can learn from failure to improve your software.

Hear directly from Jeff:
  • The importance of resilient design
  • How resilient design can, at times, be the cause for actual failure
  • What recovering from failure looks like
  • And whatever else comes up!


Watch the recording

If you haven’t already, be sure to sign up for the Moogsoft newsletter to receive updates on future episodes.



Jeff Atwood

Jeff Atwood
Co-Founder, Stack Overflow and Discourse

BJ Maldonado

BJ Maldonado
Site Reliability Engineer, Moogsoft

Sean Molloy

Sean Molloy
Sr. Principal Architect, Moogsoft