PDF GeniePDF Genie
compressbenchmarkresearch

PDF Compression Benchmark: What Actually Changes at Each Quality Level

We compressed 200 real PDFs across five content types to see what each compression level actually does to file size, text quality, and image fidelity. Here's what we found.

P

By PDF Genie Editorial Team

·6 min read·1,061 words

Reviewed by the PDF Genie editorial team. See our editorial standards.

We spend a lot of time looking at other people's PDFs. Contracts, invoices, research papers, photo portfolios, scanned government forms — if it has a .pdf extension, we have probably run it through our Compress PDF tool at least once.

One question keeps coming up from users: "Which compression level should I actually pick?" The answer is not obvious. It depends entirely on what kind of content is inside your file. We wanted to give a more concrete answer than the generic 20-40% / 50-70% / 70-90% ranges we publish on the tool page, so we ran a small internal benchmark and wrote down what we observed.

The setup

We assembled 200 real PDFs from our own document archive and a set of public-domain sources, split roughly evenly across five content categories:

  • Text-heavy reports — annual reports, academic papers, specifications
  • Scanned documents — OCR-processed receipts, signed forms, handwritten notes
  • Image-heavy portfolios — product photography, real-estate listings, architecture drawings
  • Mixed business documents — slide decks, marketing one-pagers, invoices with logos
  • Form-based PDFs — tax forms, government filings, contract templates with fillable fields
Each file was compressed three times through our Compress PDF tool at each of the three quality levels (Less, Recommended, Extreme), and the output was compared against the original on three dimensions: file-size reduction, text searchability, and image quality at 100% zoom.

What we actually found

Text-heavy reports behave predictably

Documents that are mostly text with a few logos or charts compress the least in absolute percentage terms — because text is already efficiently encoded in the PDF stream. Typical results:

  • Less compression: 10-20% reduction, text and images both untouched
  • Recommended: 25-45% reduction, perceptibly softer logos but text still crisp
  • Extreme: 50-70% reduction, visible softness on any embedded image, text still perfectly readable
If your file is mostly text, Less is usually the right pick — you rarely save enough with the harder settings to justify the quality cost.

Scanned documents are where compression really pays off

Scanned PDFs are essentially a collection of rasterized page images, which means every byte in the file is compressible image data. This is where we saw the biggest reductions:

  • Less: 30-50%
  • Recommended: 60-80%
  • Extreme: 80-93%
If you are compressing a scanned contract or a multi-page receipt dump, Extreme often works well — the original scan was usually not high enough quality to notice the additional compression. The main thing to watch: if the document was scanned with an OCR text layer, aggressive compression may rasterize over it. Run it through our OCR PDF tool afterward if you need the text to remain searchable.

Image-heavy portfolios are where quality matters most

Design portfolios, product photography PDFs, and real-estate listings are the category where compression choice is the most visible. Our observations:

  • Less preserves colors and edges for print-quality use. Prefer this for anything you might present to a client.
  • Recommended is fine for email attachment and web review but you can see JPEG artifacts on edges if you look carefully.
  • Extreme is reasonable only for throwaway review copies. Artifacts are obvious.

Mixed business documents behave like their dominant content

A marketing one-pager with a photo hero, a chart, and three paragraphs of text behaves closer to the image side of the spectrum than the text side — its file size is dominated by the photo. Rule of thumb: if there's more than one big image in your PDF, the compression will feel more like a scanned document than like a text report.

Form-based PDFs need special handling

Filled tax forms, contracts, and government filings often embed form field metadata that doesn't respond well to rasterizing compression. We saw:

  • Less is safe for these — typically 10-20% reduction without touching form fields.
  • Recommended and Extreme sometimes flatten form fields into their rendered appearance, making the file no longer editable. This is fine if you want to lock the form (and you might — see our Flatten PDF tool for the dedicated version of that), but surprising if you just wanted a smaller file.

The unexpected finding: file size is a lousy proxy for compression quality

Before running this, we would have guessed that bigger compression gains correlate with worse output. That's mostly true — but not always. The best-case scenarios for Extreme compression are scanned documents that were originally saved at much higher DPI than necessary for on-screen reading. Running those through Extreme can produce a 90% reduction while visually identical on screen. The worst cases are image-heavy portfolios where the original images are already compressed to their minimum quality target — further compression there accumulates artifacts quickly.

Our general recommendation

Think about what's inside the file before you pick the level:

  • Text-dominant PDFs: Use Less. Reductions are modest but quality is untouched.
  • Scanned documents: Use Extreme. The original was usually not high-fidelity enough for the quality loss to matter.
  • Photo/design portfolios: Use Less for client-facing, Recommended for review copies.
  • Filled forms: Use Less. Avoid the harder settings if you need form fields to remain editable.
  • Mixed business documents: Try Recommended first; drop to Less if you notice quality issues.
If you want to test this yourself, run the same file through our Compress PDF tool at each level and compare. The tool runs entirely in your browser — no uploads, no limits on how many times you can experiment. For most use cases, one test run is all you need to decide.

What we're not claiming

A few honest caveats:

  • This is an internal benchmark, not peer-reviewed research. Our 200-file sample is balanced across categories but it's not randomly drawn from the global population of PDFs.
  • Results will shift with different source content, and there's real variance inside each category. We reported the middle of what we observed, not the extremes.
  • Different compression tools (iLovePDF, Smallpdf, Adobe) use different backends with different defaults. The ranges above describe our Compress PDF tool specifically.
We run these kinds of quick tests whenever we make a material change to our compression pipeline. If you'd like to see specific numbers for a specific file type, get in touch — we're always happy to look at real-world cases and will often publish what we learn.
Share this article:𝕏 TwitterLinkedIn
PDF Genie

Try it yourself — free

40+ PDF tools, no signup required. Works right in your browser.

Explore PDF Genie →