Automated scanning of private files: PhotoDNA, CSAM hash-matching, and the 2015 silence

Loading…

2015 (ongoing practice)

MediumStatus: ongoingProduct: Core sync

Dropbox runs every uploaded image and video through hash-matching systems such as Microsoft's PhotoDNA to detect known child sexual abuse material — automated scanning of users' private files that the company initially refused to explain.

What happened

Like other major cloud providers, Dropbox proactively scans content uploaded to and shared on its service against hash databases of known child sexual abuse material (CSAM), using technologies including Microsoft's PhotoDNA and Google's CSAI Match alongside hash lists from the National Center for Missing & Exploited Children (NCMEC) and the Internet Watch Foundation. When a match is found, Dropbox removes access, disables the account, and reports to NCMEC as required by U.S. law.

The practice surfaced publicly in 2015 when a Dropbox user's shared link was blocked for matching a flagged file, and reporting (Gizmodo, 'Dropbox Refuses to Explain Its Mysterious Child Porn Detection Software') noted Dropbox would not detail how its detection worked. The privacy tension is structural: detecting content by fingerprint requires Dropbox to be able to inspect files server-side, which is only possible because it does not use end-to-end encryption. Critics note the same infrastructure that scans for CSAM could in principle be repurposed to other categories, and that hash-matching carries a small but non-zero false-positive risk.

Impact

The scanning is widely regarded as a justified child-safety measure, but it is also a concrete demonstration that 'your' files are routinely read and fingerprinted by automated systems the moment they touch Dropbox. It anchors the broader privacy argument that server-side access — necessary for this scanning — is what makes mass inspection, and by extension breach and surveillance, possible at all.

Related issues

2023–2026 (ongoing)3 sources

Medium

'Not training — today': lingering skepticism over Dropbox's AI data assurances

Dropbox repeatedly assures users that AI features do not train on their data and that content is deleted within 30 days — but because these are revocable policy promises layered over server-side access rather than technical guarantees, security commentators remain skeptical that the assurances will hold.

Privacy & Encryption ConcernsCurrent / Ongoing Issues (2024–2026)

Read documentation

Automated scanning of private files: PhotoDNA, CSAM hash-matching, and the 2015 silence

What happened

Impact

Related issues

'Not training — today': lingering skepticism over Dropbox's AI data assurances

Sources

Dropbox Dash goes mainstream (2025): an AI assistant that indexes everything you connect

Dash and the third-party AI connectors: trusting Dropbox to broker your data to OpenAI, Google, and Microsoft

Discontinuing Dropbox Vault: the PIN-protected folder turned ordinary