PDF Tools for Medical Records
Redact PHI, merge charts, compress for sharing, extract specific pages — handle medical PDFs with privacy built-in.
Recommended tools
Redact PDF →
Permanently black out sensitive text or regions from a PDF.
Merge PDF →
Combine multiple PDFs into one file, in the order you want.
Extract Pages →
Pick specific pages from a PDF and save them as a new file.
Compress PDF →
Reduce file size while keeping the best quality possible.
Protect PDF →
Add a password to encrypt and protect a PDF.
OCR PDF →
Recognize text in scanned PDFs so you can search and copy it.
Medical PDFs need privacy-first tools
Medical records contain some of the most sensitive personal data that exists — names, dates of birth, SSNs, diagnoses, medications, test results. Any tool that processes these files server-side is a privacy risk, and any redaction that leaves recoverable text is a HIPAA violation waiting to happen. PDF Genie runs most of its tools entirely in your browser, and our redaction rasterises pages so the removed text is physically gone. For medical-record workflows, that combination is the whole point.
AI-powered PHI redaction
Our Redact PDF tool includes an Auto-detect PII button that uses AI to identify protected health information — patient names, DOBs, MRNs, SSNs, diagnoses where they appear in structured fields, insurance IDs. It proposes redaction boxes you approve, saving 10-30 minutes per document vs manual scanning. All detection runs on text only; no page images leave the device.
Pulling relevant pages only
Often you need to share just the lab results page or the discharge summary from a 40-page chart. Extract Pages pulls the pages you need into a new PDF; Delete Pages does the inverse. Both run browser-side.
OCR for scanned charts
Older records are often scanned images. OCR PDF adds a text layer so subsequent redaction, search, and copy operations work. The OCR process runs on-server but files are deleted immediately after processing.
FAQ
Is PDF Genie HIPAA-compliant?⌄
We don't hold a HIPAA BAA today — that's on the roadmap. For covered-entity workflows, the safest approach is to use browser-only tools (merge, compress, redact, extract) which never upload files. Tools that require a server round-trip (OCR, AI redact/summarize) should be used only on documents you've already de-identified.
Can someone recover redacted patient data?⌄
No. Our redaction rasterises pages at 300 DPI before export; the underlying text stream is destroyed. Unlike PDF annotations that merely cover text, our output has no recoverable original.
Does the AI training model see patient data?⌄
No. Anthropic's Claude API, which powers our AI features, does not train on API inputs by default (per Anthropic's terms). We also don't store PDF text on our side; it's extracted, sent to Claude, and discarded.