Docling

IBM's open-source document conversion library that parses PDF, DOCX, PPTX, and other formats into structured Markdown or JSON with table detection, figure extraction, and layout analysis.

Evaluated Mar 06, 2026 (0d ago) v2.x

Homepage ↗ Repo ↗ AI & Machine Learning python pdf docx pptx markdown json table-detection layout-analysis ibm rag document-ai

⚙ Agent Friendliness

/ 100

Can an agent use this?

🔒 Security

/ 100

Is it safe for agents?

⚡ Reliability

/ 100

Does it work consistently?

Score Breakdown

⚙ Agent Friendliness

MCP Quality

Documentation

Error Messages

Auth Simplicity

100

Rate Limits

100

🔒 Security

TLS Enforcement

Auth Strength

Scope Granularity

Dep. Hygiene

Secret Handling

Processes documents locally after initial model download. New library with limited public security audit history — review dependencies carefully before use with sensitive enterprise documents.

⚡ Reliability

Uptime/SLA

Version Stability

Breaking Changes

Error Recovery

Best When

You need high-quality structured output (tables, headings, figures, reading order) from complex multi-format documents for LLM or RAG pipelines.

Avoid When

You need fast lightweight extraction from simple text-only PDFs or require sub-second per-page processing throughput.

Use Cases

• Convert PDF research papers or technical documents into clean Markdown for RAG chunking and vector ingestion
• Extract structured tables from complex PDFs into JSON for downstream data analysis without manual bbox tuning
• Parse DOCX and PPTX files alongside PDFs in a unified document ingestion pipeline
• Identify and extract figures, captions, and section headings as structured JSON for document understanding workflows
• Preprocess enterprise documents (contracts, reports, manuals) into structured format for LLM-based question answering

Not For

• Simple PDF page manipulation like merging, splitting, or rotating — use pypdf for structural PDF operations
• Real-time document processing at very low latency — layout analysis models add significant overhead per page
• Scanned documents without a text layer — Docling's OCR pipeline is still maturing; dedicated OCR services may produce better results on low-quality scans

Interface

REST API

GraphQL

gRPC

MCP Server

SDK

Yes

Webhooks

Authentication

Methods: none

OAuth: No Scopes: No

Library — no authentication required. Model weights downloaded automatically on first use.

Pricing

Model: open_source

Free tier: Yes

Requires CC: No

MIT licensed. Released by IBM Research in 2024. Model weights are downloaded from HuggingFace Hub on first use.

Agent Metadata

Pagination

none

Idempotent

Full

Retry Guidance

Not documented

Known Gotchas

⚠ Model weights are downloaded from HuggingFace Hub on first use (~1GB); agents must ensure internet access and sufficient disk space are available before the first conversion call.
⚠ Docling is a young library (released 2024) with a rapidly evolving API — pin to a specific version in production to avoid breaking changes between minor releases.
⚠ Layout analysis adds significant per-page processing time (seconds per page on CPU); agents with time budgets must set appropriate timeouts and consider GPU acceleration for bulk workloads.
⚠ The DoclingDocument JSON output schema is Docling-specific — downstream consumers must be built against this schema or use the .export_to_markdown() convenience method for portable output.
⚠ DOCX and PPTX conversion quality depends on whether the source was created with standard office tools; files with complex embedded objects, unusual fonts, or macros may produce incomplete extraction.

Alternatives

pdfplumber-api pypdf-api camelot-api unstructured azure-document-intelligence

Full Evaluation Report

Detailed scoring breakdown, competitive positioning, security analysis, and improvement recommendations for Docling.

$99

API endpoint ↗ Agent guide ↗ Report inaccuracy

Scores are editorial opinions as of 2026-03-06.