<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>Where&apos;s the Postmortem?</title><description>Curated incident stories from the best engineering teams.</description><link>https://wheresthepostmortem.com/</link><item><title>When Someone Else Hijacked 1.1.1.1</title><link>https://wheresthepostmortem.com/postmortems/cloudflare-1111-bgp-hijack-2024/</link><guid isPermaLink="true">https://wheresthepostmortem.com/postmortems/cloudflare-1111-bgp-hijack-2024/</guid><description>A Brazilian ISP announced a more-specific route for 1.1.1.1, hijacking DNS traffic across 300 networks in 70 countries.</description><pubDate>Thu, 27 Jun 2024 00:00:00 GMT</pubDate></item><item><title>When a Stolen Token in an Archived Repo Compromised an Entire Platform</title><link>https://wheresthepostmortem.com/postmortems/heroku-oauth-token-breach-2022/</link><guid isPermaLink="true">https://wheresthepostmortem.com/postmortems/heroku-oauth-token-breach-2022/</guid><description>An attacker found a machine account token in an archived private repo, used it to steal customer OAuth tokens, and accessed private repositories across dozens of organizations.</description><pubDate>Thu, 07 Apr 2022 00:00:00 GMT</pubDate></item><item><title>73 Hours Down: When Service Discovery Took Down an Entire Platform</title><link>https://wheresthepostmortem.com/postmortems/roblox-73-hour-outage-2021/</link><guid isPermaLink="true">https://wheresthepostmortem.com/postmortems/roblox-73-hour-outage-2021/</guid><description>A Consul upgrade and a BoltDB storage pathology combined to take Roblox offline for 73 hours.</description><pubDate>Thu, 28 Oct 2021 00:00:00 GMT</pubDate></item><item><title>The Day Facebook Disappeared from the Internet</title><link>https://wheresthepostmortem.com/postmortems/facebook-global-outage-2021/</link><guid isPermaLink="true">https://wheresthepostmortem.com/postmortems/facebook-global-outage-2021/</guid><description>A backbone configuration change accidentally disconnected all Facebook data centers from each other and the internet for six hours.</description><pubDate>Mon, 04 Oct 2021 00:00:00 GMT</pubDate></item><item><title>One Customer Setting Broke the Internet for an Hour</title><link>https://wheresthepostmortem.com/postmortems/fastly-global-cdn-outage-2021/</link><guid isPermaLink="true">https://wheresthepostmortem.com/postmortems/fastly-global-cdn-outage-2021/</guid><description>A single valid customer config change triggered a dormant bug that took down 85% of Fastly&apos;s CDN, knocking Amazon, Reddit, and the BBC offline.</description><pubDate>Tue, 08 Jun 2021 00:00:00 GMT</pubDate></item><item><title>When Everyone Came Back from Holiday and Slack Couldn&apos;t Handle It</title><link>https://wheresthepostmortem.com/postmortems/slack-first-monday-outage-2021/</link><guid isPermaLink="true">https://wheresthepostmortem.com/postmortems/slack-first-monday-outage-2021/</guid><description>Cold client caches after the holiday break overwhelmed AWS Transit Gateways, triggering auto-scaling chaos that made the outage worse.</description><pubDate>Mon, 04 Jan 2021 00:00:00 GMT</pubDate></item><item><title>43 Seconds of Network Partition, 24 Hours of Degraded Service</title><link>https://wheresthepostmortem.com/postmortems/github-database-incident-2018/</link><guid isPermaLink="true">https://wheresthepostmortem.com/postmortems/github-database-incident-2018/</guid><description>A 43-second network partition triggered an automated cross-region MySQL failover that created a split-brain, degrading GitHub for 24 hours.</description><pubDate>Sun, 21 Oct 2018 00:00:00 GMT</pubDate></item><item><title>A Typo That Took Down the Internet for Four Hours</title><link>https://wheresthepostmortem.com/postmortems/aws-s3-outage-2017/</link><guid isPermaLink="true">https://wheresthepostmortem.com/postmortems/aws-s3-outage-2017/</guid><description>An engineer mistyped a command while debugging, removing far more S3 servers than intended. The restart took four hours because S3 had never been fully restarted.</description><pubDate>Tue, 28 Feb 2017 00:00:00 GMT</pubDate></item><item><title>When a Database Engineer Accidentally Deleted the Production Database</title><link>https://wheresthepostmortem.com/postmortems/gitlab-database-deletion-2017/</link><guid isPermaLink="true">https://wheresthepostmortem.com/postmortems/gitlab-database-deletion-2017/</guid><description>An engineer ran rm -rf on production. Five backup strategies all failed.</description><pubDate>Tue, 31 Jan 2017 00:00:00 GMT</pubDate></item><item><title>How Dead Code Caused a $440 Million Loss in 45 Minutes</title><link>https://wheresthepostmortem.com/postmortems/knight-capital-440m-trading-loss-2012/</link><guid isPermaLink="true">https://wheresthepostmortem.com/postmortems/knight-capital-440m-trading-loss-2012/</guid><description>A botched deployment activated 9-year-old dead code that bought high and sold low for 45 minutes straight.</description><pubDate>Wed, 01 Aug 2012 00:00:00 GMT</pubDate></item></channel></rss>