Break Things on Purpose

Break Things on Purpose

breakthingsonpurpose.transistor.fm
A monthly podcast about Chaos Engineering, presented by Gremlin, Inc.


Kelsey Hightower
Jan 21 • 43 min
This episode we speak with Kelsey Hightower. Kelsey is a Principal Developer Advocate at Google. Topics include: Promise Theory, is Kubernetes hard, running databases on Kubernetes, the meat cloud, empathy sessions, how Kubernetes has helped standardize…
Kolton Andrus
Dec 21, 2019 • 48 min
This episode we speak with Kolton Andrus, the CEO and co-founder of Gremlin. Topics include: The role of a Call Leader in incidents, using Chaos Engineering as runtime validation, FIT and application level fault injection, Jesse Robbins and early…
Haley Tucker
Nov 20, 2019 • 38 min
This episode we speak with Haley Tucker. Haley is a Senior Software Engineer on the Resilience Engineering team at Netflix. Topics include: Running Chaos Engineering experiments as A/B tests, testing dependencies, fallbacks, testing in production, and why…
Matthew Simons
Oct 20, 2019 • 40 min
This episode we speak with Matthew Simons. Matthew is a Senior Product Development Manager at Workiva and he leads the Quality Assessment team there. Topics include: Supporting and encouraging reliability at Workiva, why Workiva moved from App Engine to…
Subbu Allamaraju
Sep 20, 2019 • 39 min
This episode we speak with Subbu Allamaraju. Subbu is a Senior Technologist at the Expedia Group. Topics include: Learning from incidents, changing culture, Why Complex Systems Fail, drifting into failure, forming a hypothesis, showing value from your…
Adrian Hornsby
Aug 20, 2019 • 36 min
This episode we speak with Adrian Hornsby, a Senior Tech Evangelist at Amazon Web Services. Topics include: Curiosity and breaking things, the cost of downtime, Jesse Robbins and early failure injection at Amazon, making the case to management for Chaos…
Caroline Dickey
Jul 20, 2019 • 37 min
In this episode we speak with Caroline Dickey, a Site Reliability Engineer at Mailchimp. Topics include: Having customer empathy, rolling out a Chaos Engineering program, and some of the Game Days that the Mailchimp team has conducted, including…
Paul Osman
Jun 20, 2019 • 33 min
This episode we speak with Paul Osman, who leads the Site Reliability Engineering team at Under Armour. Topics include: Paul’s beginnings in Chaos Engineering at 500 Pixels, Reliability Engineering at Pager Duty, bootstrapping the Chaos Engineering…
Michael Kehoe
May 20, 2019 • 30 min
This episode we speak with Michael Kehoe, a Staff Site Reliability Engineer at LinkedIn. Topics include: Site Reliability Engineering, building satellites at NASA, LinkedIn’s Chaos Engineering project called Waterbear, using Chaos Engineering to test…
Tammy Butow and Ana Medina
Apr 21, 2019 • 35 min
Welcome to our first episode of Break Things On Purpose, a podcast about Chaos Engineering. Our guests are Tammy Butow, Principal SRE at Gremlin, and Ana Medina, a Chaos Engineer at Gremlin. Topics include: What is Chaos Engineering? Planning an…