PDF GeniePDF Genie

PDF Tools for Medical Records

Redact PHI, merge charts, compress for sharing, extract specific pages — handle medical PDFs with privacy built-in.

Recommended tools

Medical PDFs need privacy-first tools

Medical records contain some of the most sensitive personal data that exists — names, dates of birth, SSNs, diagnoses, medications, test results. Any tool that processes these files server-side is a privacy risk, and any redaction that leaves recoverable text is a HIPAA violation waiting to happen. PDF Genie runs most of its tools entirely in your browser, and our redaction rasterises pages so the removed text is physically gone. For medical-record workflows, that combination is the whole point.

AI-powered PHI redaction

Our Redact PDF tool includes an Auto-detect PII button that uses AI to identify protected health information — patient names, DOBs, MRNs, SSNs, diagnoses where they appear in structured fields, insurance IDs. It proposes redaction boxes you approve, saving 10-30 minutes per document vs manual scanning. All detection runs on text only; no page images leave the device.

Pulling relevant pages only

Often you need to share just the lab results page or the discharge summary from a 40-page chart. Extract Pages pulls the pages you need into a new PDF; Delete Pages does the inverse. Both run browser-side.

OCR for scanned charts

Older records are often scanned images. OCR PDF adds a text layer so subsequent redaction, search, and copy operations work. The OCR process runs on-server but files are deleted immediately after processing.

FAQ

Is PDF Genie HIPAA-compliant?

We don't hold a HIPAA BAA today — that's on the roadmap. For covered-entity workflows, the safest approach is to use browser-only tools (merge, compress, redact, extract) which never upload files. Tools that require a server round-trip (OCR, AI redact/summarize) should be used only on documents you've already de-identified.

Can someone recover redacted patient data?

No. Our redaction rasterises pages at 300 DPI before export; the underlying text stream is destroyed. Unlike PDF annotations that merely cover text, our output has no recoverable original.

Does the AI training model see patient data?

No. Anthropic's Claude API, which powers our AI features, does not train on API inputs by default (per Anthropic's terms). We also don't store PDF text on our side; it's extracted, sent to Claude, and discarded.

Other use cases