The library. Every incident, structured.
A growing archive of public postmortems, broken down into a consistent shape: what broke, why it cascaded, and what to take from it. New incidents added regularly.
28+
incidents
11+
years
13
organizations
/
sort
2 results · filtered
Google Cloud
id
incident
org
date
duration
severity
tags
FM-023
Service Control quota update rejects APIsA quota policy update with blank fields crashed Google Service Control globally, causing 503s across Google Cloud, Workspace, and Security Operations APIs.
Google Cloud
2025-06-12
3h
SEV-1
configquotaspanner
FM-014
The Automation Bug That Took Google's Network Control Plane OfflineA bug in Google's datacenter maintenance automation descheduled the network control plane in multiple physical locations at once. BGP withdrew within minutes, and traffic flowed onto an oversubscribed fail-static path until engineers could rebuild the configuration.
Google Cloud
2019-06-02
4h 25m
SEV-1
networkingautomationbgp