Oliver Leaver-Smith - On how "just a monitoring change" took down the entire site and resilience engineering - #5

Oliver Leaver-Smith - On how "just a monitoring change" took down the entire site and resilience engineering - #5

Author: Ronak Nathani, Guang Yang February 19, 2021 Duration: 1:01:22
Oliver Leaver-Smith, better known as Ols, is a Senior Devops Engineer at Sky Betting and Gaming. In this episode, we discuss how a seemingly simple monitoring change ended up taking down the entire site. We also talk about chaos and resilience engineering. We discuss how the team at Sky Betting and Gaming conducts fire drills (chaos engineering exercises) where they not only test the resiliency of their software systems but also their people systems. We walk through a recent example of a fire drill, how they have evolved over the past few years and the lessons learned in the process.

Behind every line of code, there's a person with a story, and that's where Software Misadventures finds its pulse. Hosts Ronak Nathani and Guang Yang pull up a chair with engineers, founders, and investors, but the conversation rarely stays in the technical manual. Instead, it wanders into the human territory of career detours, hard-won insights, and those unpredictable stumbles that often teach the most. This podcast is built on the idea that the journey is just as important as the destination, especially in the fast-moving tech world. You'll hear guests recount the projects that went sideways, the decisions they'd rethink, and the moments of clarity that emerged from the chaos. It’s a refreshingly honest look at the industry, emphasizing that expertise isn't just about what you build, but what you learn when things don't go as planned. Tune in for conversations that are less about perfect solutions and more about the real, sometimes messy, process of creating with technology. Each episode offers a blend of professional wisdom and personal narrative, making it a compelling listen for anyone curious about the lives woven into our digital landscape.
Author: Language: English Episodes: 55

Software Misadventures
Podcast Episodes
From High School Suspension to US Chief Data Scientist | DJ Patil [not-audio_url] [/not-audio_url]

Duration: 1:05:08
Known for coining the term "Data Scientist", DJ is a renowned technologist with a diverse background spanning academia, industry, and government. Having led product teams at companies like RelateIQ and LinkedIn, DJ was a…
Building Diverse Engineering Teams | Erica Lockheimer [not-audio_url] [/not-audio_url]

Duration: 1:20:22
Erica is a former VP of Engineering at LinkedIn. Having almost dropped out of college, Erica's journey in tech is a testament to her perseverance and dedication. In addition to leading engineering teams at LinkedIn, Eric…
Stories behind building HashiCorp | Mitchell Hashimoto [not-audio_url] [/not-audio_url]

Duration: 1:17:01
Mitchell co-founded HashiCorp in 2012 and created many important infrastructure tools, such as Terraform, Vagrant, Packer, and Consul. In addition to being a prolific engineer, Mitchell grew HashiCorp into a multi-billio…
Open sourcing LinkedIn's Derived Data Platform | Felix GV (LinkedIn) [not-audio_url] [/not-audio_url]

Duration: 1:01:09
What's it like to open source an internal project at a big tech company like LinkedIn? When should a company open source a project and what are the benefits and challenges that come along with it? If you want to open sou…