PDF GeniePDF Genie

Metadata (PDF)

The hidden information embedded in a PDF about the file itself — title, author, subject, keywords, creation date, and the software that produced it.

PDF metadata is the "data about the data" — information that describes the file rather than its content. Every PDF carries metadata in two forms: the traditional document info dictionary and the more modern XMP (Extensible Metadata Platform) stream.

Common metadata fields

  • Title — the document's display title (often different from the filename)
  • Author — the person who created the document
  • Subject — a short description of what the document is about
  • Keywords — comma-separated tags for classification and search
  • Creator — the application that created the original document (e.g. "Microsoft Word")
  • Producer — the application that wrote the PDF (e.g. "Adobe PDF Library")
  • Creation date and Modification date

Why metadata matters

  • Searchability. Document management systems index PDF metadata — good titles and keywords make files findable.
  • Privacy. Metadata can leak information you don't want to share. A legal document might show the attorney's name as author and the original Word file's modification history.
  • Archival. PDF/A files have strict metadata requirements for long-term preservation.

Cleaning metadata before sharing

Before sending a PDF externally — especially redacted documents, contracts, or anonymous submissions — review the metadata. The author field, in particular, often contains information that survives rounds of editing.

Tools

  • Edit Metadata lets you set or clear the title, author, subject, and keywords on a PDF.

Alat berkaitan