Reliability and Uptime Availability: How Twitter Beat the “Fail Whale”
Jennifer Fraser, Head of Mission Critical Engineering, Twitter Mazdak Hashemi, Sr. Director of Site Reliability Engineering & Infrastructure Operations, Twitter
Twitter today is an authentic global news source and a platform for important conversations. Twitter’s infrastructure has significantly transformed since the first Tweet was sent over 9 years ago. The definition of hyperscale, the Site Reliability Engineering team designed both framework and methodology for stress testing at scale, while the DC Mission Critical Engineering team improved uptime availability and reliability.