Skip to the content.

Metadata and Data Deletion: Removing Digital Traces

*Status: Level 1 Audience: All members*

Every file you create, every photo you take, every document you edit generates invisible metadata — data about data. This metadata often reveals more than the content itself: who created a file, when, where, with what software, and on what device. This guide covers finding, stripping, and permanently destroying metadata and sensitive data.


1. What Is Metadata and Why It Matters

1.1 Types of Metadata

File metadata (EXIF, document properties):

System metadata:

Network metadata:

1.2 Real-World Consequences of Unmanaged Metadata


2. Stripping File Metadata

2.1 ExifTool (All Platforms) — The Essential Tool

ExifTool is the definitive open-source tool for reading and removing metadata from virtually every file type. It is command-line based but powerful.

Installation:

Essential commands:

# View all metadata in a file
exiftool photo.jpg
exiftool document.docx
exiftool video.mp4

# Remove ALL metadata from a file
exiftool -all= photo.jpg

# Remove all metadata from ALL files in a directory
exiftool -all= /path/to/folder/

# Remove only GPS coordinates, keep other data
exiftool -GPS= photo.jpg

# Batch remove metadata and save originals as _original files
exiftool -all= -overwrite_original_in_place photo.jpg

# Verify: view what remains after stripping
exiftool photo.jpg  # Should show minimal or no EXIF fields

2.2 Photo Metadata on Mobile

Android — Scrambled EXIF:

iOS — Image Metadata Viewer:

Signal: Signal automatically strips metadata from photos shared through the app. Images received through Signal also have metadata stripped. This is one reason to share photos via Signal rather than via email or cloud link.

2.3 Document Metadata

Microsoft Office / LibreOffice documents:

Built-in stripping:

ExifTool for documents:

exiftool -all= document.docx
exiftool -all= presentation.pptx
exiftool -all= spreadsheet.xlsx

Safer approach: Convert to PDF with Dangerzone

Dangerzone (dangerzone.rocks) is a free, open-source tool that:

  1. Opens your document in a sandboxed container (Docker)
  2. Renders it to a pixel-perfect image
  3. Converts the image back to a PDF
  4. This process strips ALL embedded metadata, macros, active content, and potential malware from the document

The resulting PDF contains only visual information — no author names, no edit history, no embedded code. This is the safest method for sharing documents from unknown sources or sharing documents where you want zero metadata leakage.

2.4 Printer Steganography (Machine Identification Codes)

Many color laser printers embed invisible yellow tracking dots in every printed page. These dots encode:

This was how Reality Winner was identified. EFF maintains a list of printers known to use this system (eff.org/issues/printers).

Countermeasures:


3. Secure Data Deletion

3.1 Why Normal Deletion Doesn’t Work

When you “delete” a file, the operating system marks the space as available for reuse — but the data remains on the disk until it is overwritten by new data. Forensic tools (Autopsy, EnCase, FTK) can recover “deleted” files from conventional hard drives and sometimes from SSDs and flash storage.

3.2 Securely Deleting Files

On traditional hard drives (HDDs): Overwriting with random data makes recovery computationally infeasible.

On SSDs, flash drives, and phones: SSDs use wear-leveling algorithms that spread data across the physical storage in ways that make individual file overwriting unreliable. The data you “shred” may not be the data that ends up overwritten.

Best practice for SSDs and phones:

3.3 Securely Wiping Entire Drives

When retiring a device:

macOS:

Linux:

# Wipe an entire drive (replace /dev/sdX with target drive — BE CAREFUL)
sudo shred -vzn 3 /dev/sdX

# Or with dd:
sudo dd if=/dev/urandom of=/dev/sdX bs=1M status=progress

Windows:

3.4 Tails for Ephemeral Operation

The most thorough approach to data deletion: never write sensitive data to persistent storage in the first place.

Tails OS runs entirely in RAM. When you power off a Tails computer, all RAM is cleared. No traces remain on the host machine. This is the gold standard for sensitive work.

See the Tails OS Guide for setup instructions.


4. Browser and Application Data

4.1 Browser Data Cleanup

Browsers store significant amounts of potentially sensitive data:

What to clear regularly:

Firefox: Settings → Privacy & Security → Clear Data → select all categories → Clear

Automation: Configure Firefox to clear all data on close (Settings → Privacy & Security → Cookies and Site Data → Delete cookies and site data when Firefox is closed)

For maximum privacy: Use the Tor Browser (clears all session data on close by design) or use Firefox in Private Browsing mode for sensitive sessions.

4.2 Application Logs and Caches

Operating systems and applications generate extensive logs:

macOS:

Linux:

Windows:

4.3 Signal Cleanup

Signal stores messages on your device. When you delete a conversation or message in Signal, it is deleted from Signal’s local database — but may not be immediately overwritten in storage.


5. Data Minimization: Generating Less in the First Place

The most effective metadata management is not generating it at all.

Principles:

Monthly data hygiene practice:


This guide does not constitute legal advice. Laws vary by jurisdiction.

← Back to Index