Notas del episodio
In this special kickoff episode of Ship It Weekly, Brian walks through three major outages from the last few weeks and what they actually mean for DevOps, SRE, and platform teams.
Instead of just reading status pages, we look at how each incident exposes assumptions in our own architectures and runbooks:
Topics in this episode:
• Cloudflare’s global outage and what happens when your CDN/WAF becomes a single point of failure
• The AWS us-east-1 incident and why “multi-AZ in one region” isn’t a full disaster recovery strategy
• GitHub’s Git operations / Codespaces outage and how fragile our CI/CD and GitOps flows can be
• Practical questions to ask about your own setup: CDN bypass, cross-region readiness, backups for Git and CI
This episode is more of a themed “special” to ...