This episode we speak with Michael Kehoe, a Staff Site Reliability Engineer at LinkedIn. Topics include: Site Reliability Engineering, building satellites at NASA, LinkedIn’s Chaos Engineering project called Waterbear, using Chaos Engineering to test autoscaling, running Chaos Engineering experiments as regression tests in a release pipeline, and tips for starting a Chaos Engineering practice at your company.
- Michael’s Twitter
- Michael’s blog
- The InfoQ eMag with Michael’s article describing Project Waterbear (email required)
Our music is by Komiku. For more of Komiku’s music visit loyaltyfreakmusic.com.