PDF Reader MCP is designed as a full-fidelity PDF intelligence layer for agents. The comparison below is category-based and focuses on the agent workflow: read a smart Agent Document Twin first, search cheaply when the task has a literal query, and request focused evidence only when the answer needs source-level proof.
| Capability | PDF Reader MCP | Text/CLI tools | Cloud PDF APIs | Generic filesystem MCP |
|---|---|---|---|---|
| MCP-native PDF tools | β V3 three-tool surface | β | β | |
| Preflight inspection and routing | β | β | β | |
| Literal search with evidence | β snippets, offsets, boxes, provenance | β | ||
| Text layer fidelity | β runs, lines, words, chars, metadata coverage | β | ||
| Agent Document Twin | β document map plus AST and evidence indexes | β | β | |
| Page rendering evidence | β bounded MCP image parts | β | β | |
| Region crop evidence | β PDF-coordinate crops | β | β | |
| Scanned-page OCR path | β configured local provider with provenance | β | β | |
| OCR-derived tables | β when OCR word boxes are available | β | β | |
| Table quality diagnostics | β cells, geometry, spans, warnings, continuation hints | β | β | |
| Formula/chart/figure/image enrichment | β configured visual-provider adapters | β | β | |
| Trust report | β hidden text, prompt-injection-like text, visual spoofing, unsafe links, redaction | β | β | |
| Accessibility report | β tagged-PDF, tag-visible coverage, forms, links, images, permissions, grades | β | β | |
| Citation chunks | β page, semantic, size, and table chunks | β | β | |
| Local-first default | β | β | β | β |
| No required API key | β | β | β | β |
| Reproducible release proof | β quality, corpus, provider, package-smoke, and release-gate artifacts | β | β | β |
Agents need more than extracted text. For high-value PDFs they need to know where content came from, which page or crop proves it, whether the reading order looks uncertain, whether a page needs OCR, whether a table has weak geometry, and whether hidden or unsafe content should be treated as untrusted data.
PDF Reader MCP exposes that as a compact V3 tool surface:
read_pdfis the default entrypoint. With onlysources, it profiles the PDF, chooses useful extraction options, and returns the linked Agent Document Twin.search_pdffinds source-backed text matches before spending more context on broad extraction or visual proof.pdf_evidencehandles focused follow-up operations:inspect,render_page,extract_regions,ocr_pages, andanalyze_regions.
- You want agents to start with one intelligent PDF read instead of learning a long list of extraction tools.
- You need stable page, element, chunk, crop, table, OCR, trust, and accessibility references for downstream citations.
- You need local-first execution and want OCR or visual models configured by the deployment, not selected by each request.
- You need source evidence for tables, charts, formulas, figures, and scanned pages.
- You want public benchmark artifacts and a release gate that prove the shipped capability surface.
PDF Reader MCP does not bundle heavy OCR, vision, formula, or layout model weights. The default package stays TypeScript-first and local-first; advanced OCR and visual understanding are enabled through explicit local providers and validated through provider benchmarks.