Medical records contain some of the most sensitive personal data that exists — names, dates of birth, SSNs, diagnoses, medications, test results. Any tool that processes these files server-side is a privacy risk, and any redaction that leaves recoverable text is a HIPAA violation waiting to happen. PDF Genie runs most of its tools entirely in your browser, and our redaction rasterises pages so the removed text is physically gone. For medical-record workflows, that combination is the whole point.

AI-powered PHI redaction

Our Redact PDF tool includes an Auto-detect PII button that uses AI to identify protected health information — patient names, DOBs, MRNs, SSNs, diagnoses where they appear in structured fields, insurance IDs. It proposes redaction boxes you approve, saving 10-30 minutes per document vs manual scanning. All detection runs on text only; no page images leave the device.

Pulling relevant pages only

Often you need to share just the lab results page or the discharge summary from a 40-page chart. Extract Pages pulls the pages you need into a new PDF; Delete Pages does the inverse. Both run browser-side.

OCR for scanned charts

Older records are often scanned images. OCR PDF adds a text layer so subsequent redaction, search, and copy operations work. The OCR process runs on-server but files are deleted immediately after processing.

FAQ

Is PDF Genie HIPAA-compliant?⌄

We don't hold a HIPAA BAA today — that's on the roadmap. For covered-entity workflows, the safest approach is to use browser-only tools (merge, compress, redact, extract) which never upload files. Tools that require a server round-trip (OCR, AI redact/summarize) should be used only on documents you've already de-identified.

Can someone recover redacted patient data?⌄

No. Our redaction rasterises pages at 300 DPI before export; the underlying text stream is destroyed. Unlike PDF annotations that merely cover text, our output has no recoverable original.

Does the AI training model see patient data?⌄

No. Anthropic's Claude API, which powers our AI features, does not train on API inputs by default (per Anthropic's terms). We also don't store PDF text on our side; it's extracted, sent to Claude, and discarded.

Other use cases

Invoices →Contracts →Tax Documents →Resumes →Bank Statements →Academic Papers →