The January 2014 outage: a buggy upgrade script took Dropbox offline for days

Loading…

What happened

On Friday 10 January 2014, around 5:30 PM Pacific, Dropbox went down during what was supposed to be routine OS-upgrade maintenance. A subtle bug in the upgrade script caused it to reinstall the operating system on a small number of machines that were still actively serving production traffic, including some master-replica database pairs. With those databases knocked out, the service became unavailable.

Dropbox restored most functionality within about three hours, but full recovery dragged on because some of the affected MySQL databases were very large and slow to rebuild from backups. Core service was not fully restored until Sunday 12 January, roughly two days after the incident began. In a detailed engineering post-mortem, VP of Infrastructure Akhil Gupta explained the root cause and the fixes: a new verification step requiring machines to confirm their own state before executing destructive commands, and a tool to parallelize MySQL binary-log replay so future restores would be faster.

Dropbox stressed that no file data was lost — the affected databases held metadata, and users' files were never at risk.

Impact

The outage was one of the most visible reliability failures in Dropbox's history, taking down a service that tens of millions of people and businesses relied on for access to their files for the better part of a weekend. It became a widely cited case study in how a single automation bug can cascade into a multi-day outage, and in the value of public, technical post-mortems. The slow database restore also exposed how recovery time, not just failure prevention, is a core reliability concern for cloud storage.

Related issues

2024–2026 (ongoing)3 sources

Medium

Betting the company on Dash: the uncertain future of the core sync product

Dropbox has reorganized around Dash, an AI-powered search assistant, repeatedly describing its core file-sync product as 'mature' — leaving longtime users uncertain how much future investment the service they actually pay for will receive.

Reliability & Data LossProduct Changes & User BacklashCurrent / Ongoing Issues (2024–2026)

Read documentation

The January 2014 outage: a buggy upgrade script took Dropbox offline for days

What happened

Impact

Related issues

Betting the company on Dash: the uncertain future of the core sync product

Sources

Over-quota data loss: downgrade, lose write access, then watch files be deleted

Dropbox drops external-drive sync on macOS, stranding terabyte archives

Disabled accounts: losing every file at once, sometimes by automated flagging