Special: When the Cloud Has a Bad Day: Cloudflare, AWS us-east-1 & GitHub Outages

Special: When the Cloud Has a Bad Day: Cloudflare, AWS us-east-1 & GitHub Outages

Author: Teller's Tech - DevOps, SRE and Cloud Podcast November 21, 2025 Duration: 12:54

In this special kickoff episode of Ship It Weekly, Brian walks through three major outages from the last few weeks and what they actually mean for DevOps, SRE, and platform teams.

Instead of just reading status pages, we look at how each incident exposes assumptions in our own architectures and runbooks:

Topics in this episode:

• Cloudflare’s global outage and what happens when your CDN/WAF becomes a single point of failure

• The AWS us-east-1 incident and why “multi-AZ in one region” isn’t a full disaster recovery strategy

• GitHub’s Git operations / Codespaces outage and how fragile our CI/CD and GitOps flows can be

• Practical questions to ask about your own setup: CDN bypass, cross-region readiness, backups for Git and CI

This episode is more of a themed “special” to kick things off.

Going forward, most episodes will follow a lighter news format: a couple of main stories from the week in DevOps/SRE/platform engineering, a quick tools and releases segment, and one culture/on-call or burnout topic. Specials like this will pop up when there’s a big incident or theme worth unpacking.

If you’re the person people DM when production is acting weird, or you’re responsible for the platform everyone ships on, this one’s for you.

Links from this episode

Cloudflare outage – November 18, 2025

https://blog.cloudflare.com/18-november-2025-outage/

https://www.thousandeyes.com/blog/cloudflare-outage-analysis-november-18-2025

AWS us-east-1 outage – October 2025

https://aws.amazon.com/message/101925/

https://www.thousandeyes.com/blog/aws-outage-analysis-october-20-2025

GitHub outage – November 18, 2025

https://us.githubstatus.com/incidents/f3f7sg2d1m20

https://currently.att.yahoo.com/att/github-down-now-not-just-211700617.html


For anyone building or running modern systems, the sheer volume of news, tools, and incident reports can be overwhelming. Ship It Weekly cuts through that noise. This isn't a surface-level scan of headlines. Host Brian Teller digs into the latest significant outages, major software releases, and insightful post-mortems, focusing squarely on the practical implications for DevOps, SRE, and platform engineering work. Each episode of the podcast breaks down a couple of key stories, providing the crucial context often missing from tech news. You'll hear analysis that translates events into actionable insights, answering the "so what?" for your own infrastructure and processes. The show also includes a quick rundown of tools or updates actually worth your attention, saving you hours of browsing. The tone is direct and informed, favoring depth over breadth. It’s designed for engineers and technical leaders who need a concise, reliable filter for the week's most relevant developments. Listen to this podcast for a focused recap that prioritizes what actually matters, delivered without fluff. You get the news, plus the necessary interpretation to understand how it might affect your systems, your team, and your on-call rotation. It's a weekly briefing that respects your time while aiming to make you more effective.
Author: Language: English Episodes: 37

Ship It Weekly - DevOps, SRE, Platform and Cloud Engineering News
Podcast Episodes