Docling

IBM's open-source document conversion library that parses PDF, DOCX, PPTX, and other formats into structured Markdown or JSON with table detection, figure extraction, and layout analysis.

Evaluated Mar 06, 2026 (0d ago) v2.x
Homepage ↗ Repo ↗ AI & Machine Learning python pdf docx pptx markdown json table-detection layout-analysis ibm rag document-ai
⚙ Agent Friendliness
64
/ 100
Can an agent use this?
🔒 Security
29
/ 100
Is it safe for agents?
⚡ Reliability
54
/ 100
Does it work consistently?

Score Breakdown

⚙ Agent Friendliness

MCP Quality
--
Documentation
78
Error Messages
75
Auth Simplicity
100
Rate Limits
100

🔒 Security

TLS Enforcement
0
Auth Strength
0
Scope Granularity
0
Dep. Hygiene
78
Secret Handling
88

Processes documents locally after initial model download. New library with limited public security audit history — review dependencies carefully before use with sensitive enterprise documents.

⚡ Reliability

Uptime/SLA
0
Version Stability
72
Breaking Changes
68
Error Recovery
74
AF Security Reliability

Best When

You need high-quality structured output (tables, headings, figures, reading order) from complex multi-format documents for LLM or RAG pipelines.

Avoid When

You need fast lightweight extraction from simple text-only PDFs or require sub-second per-page processing throughput.

Use Cases

  • Convert PDF research papers or technical documents into clean Markdown for RAG chunking and vector ingestion
  • Extract structured tables from complex PDFs into JSON for downstream data analysis without manual bbox tuning
  • Parse DOCX and PPTX files alongside PDFs in a unified document ingestion pipeline
  • Identify and extract figures, captions, and section headings as structured JSON for document understanding workflows
  • Preprocess enterprise documents (contracts, reports, manuals) into structured format for LLM-based question answering

Not For

  • Simple PDF page manipulation like merging, splitting, or rotating — use pypdf for structural PDF operations
  • Real-time document processing at very low latency — layout analysis models add significant overhead per page
  • Scanned documents without a text layer — Docling's OCR pipeline is still maturing; dedicated OCR services may produce better results on low-quality scans

Interface

REST API
No
GraphQL
No
gRPC
No
MCP Server
No
SDK
Yes
Webhooks
No

Authentication

Methods: none
OAuth: No Scopes: No

Library — no authentication required. Model weights downloaded automatically on first use.

Pricing

Model: open_source
Free tier: Yes
Requires CC: No

MIT licensed. Released by IBM Research in 2024. Model weights are downloaded from HuggingFace Hub on first use.

Agent Metadata

Pagination
none
Idempotent
Full
Retry Guidance
Not documented

Known Gotchas

  • Model weights are downloaded from HuggingFace Hub on first use (~1GB); agents must ensure internet access and sufficient disk space are available before the first conversion call.
  • Docling is a young library (released 2024) with a rapidly evolving API — pin to a specific version in production to avoid breaking changes between minor releases.
  • Layout analysis adds significant per-page processing time (seconds per page on CPU); agents with time budgets must set appropriate timeouts and consider GPU acceleration for bulk workloads.
  • The DoclingDocument JSON output schema is Docling-specific — downstream consumers must be built against this schema or use the .export_to_markdown() convenience method for portable output.
  • DOCX and PPTX conversion quality depends on whether the source was created with standard office tools; files with complex embedded objects, unusual fonts, or macros may produce incomplete extraction.

Alternatives

Full Evaluation Report

Detailed scoring breakdown, competitive positioning, security analysis, and improvement recommendations for Docling.

$99

Scores are editorial opinions as of 2026-03-06.

5173
Packages Evaluated
26151
Need Evaluation
173
Need Re-evaluation
Community Powered