Metadata (PDF)
The hidden information embedded in a PDF about the file itself — title, author, subject, keywords, creation date, and the software that produced it.
PDF metadata is the "data about the data" — information that describes the file rather than its content. Every PDF carries metadata in two forms: the traditional document info dictionary and the more modern XMP (Extensible Metadata Platform) stream.
Common metadata fields
- Title — the document's display title (often different from the filename)
- Author — the person who created the document
- Subject — a short description of what the document is about
- Keywords — comma-separated tags for classification and search
- Creator — the application that created the original document (e.g. "Microsoft Word")
- Producer — the application that wrote the PDF (e.g. "Adobe PDF Library")
- Creation date and Modification date
Why metadata matters
- Searchability. Document management systems index PDF metadata — good titles and keywords make files findable.
- Privacy. Metadata can leak information you don't want to share. A legal document might show the attorney's name as author and the original Word file's modification history.
- Archival. PDF/A files have strict metadata requirements for long-term preservation.
Cleaning metadata before sharing
Before sending a PDF externally — especially redacted documents, contracts, or anonymous submissions — review the metadata. The author field, in particular, often contains information that survives rounds of editing.
Tools
- Edit Metadata lets you set or clear the title, author, subject, and keywords on a PDF.