Special: When the Cloud Has a Bad Day: Cloudflare, AWS us-east-1 & GitHub Outages

Ship It Weekly - DevOps, SRE, and Platform Engineering News por Teller's Tech - DevOps SRE Podcast

Notas del episodio

In this special kickoff episode of Ship It Weekly, Brian walks through three major outages from the last few weeks and what they actually mean for DevOps, SRE, and platform teams.

Instead of just reading status pages, we look at how each incident exposes assumptions in our own architectures and runbooks:

Topics in this episode:

• Cloudflare’s global outage and what happens when your CDN/WAF becomes a single point of failure

• The AWS us-east-1 incident and why “multi-AZ in one region” isn’t a full disaster recovery strategy

• GitHub’s Git operations / Codespaces outage and how fragile our CI/CD and GitOps flows can be

• Practical questions to ask about your own setup: CDN bypass, cross-region readiness, backups for Git and CI

This episode is more of a themed “special” to  ... 

 ...  Leer más
Palabras clave
devopscloudflareawsgithuboutageSREcloud infrastructureCDNplatform engineeringWAFIncident managementPostmortemTeller’s TechBrian Teller