S

SmolDocling

256M-param VLM that converts any document to structured text

PriceFree / Open Source (Apache 2.0)Reviewed2026-04-27
Verdict — Ship
3 Ships1 Skips
Visit github.com

The Panel's Take

SmolDocling is a 256-million-parameter vision-language model from IBM Granite that converts documents — PDFs, scanned papers, tables, charts, forms — into clean, structured text with remarkable accuracy for its size. It introduces a new markup format called DocTags that captures not just text but document structure, reading order, and element types (headings, captions, tables, code blocks) in a way that downstream models and parsers can reliably consume. The "smol" in the name is intentional: at 256M parameters, SmolDocling runs fast enough to be deployed in production pipelines where larger VLMs would be prohibitively slow or expensive. Despite its compact size, IBM reports it achieves state-of-the-art performance across multiple document type benchmarks — outperforming much larger models on structured document parsing tasks. The key innovation is the DocTags format, which gives the model a precise vocabulary for describing document elements rather than trying to reconstruct structure from freeform text output. Built on top of the docling project (58.7k GitHub stars), SmolDocling is open source under Apache 2.0 and available on HuggingFace. The technical report is on arXiv (2503.11576). For teams building RAG pipelines, document intelligence tools, or any system that needs to ingest unstructured documents at scale, this is a practical, deployable solution.

Share this verdict

SmolDocling verdict: SHIP 🚀

3 ships · 1 skip from the expert panel

Full review: shiporskip.io/tool/smoldocling-256m-vlm-document-conversion-ibm-granite-2026

Weekly AI Tool Verdicts

Get the next verdict in your inbox

7 critics review a new AI tool every day. Weekly digest — free.

Embed this verdict

Tool makers can add a live ShipOrSkip badge to their site. Badge loads track impressions; clicks route back to this review.

Ship · 7.5/10
HTML badge
<a href="https://shiporskip.io/api/badge-click/smoldocling-256m-vlm-document-conversion-ibm-granite-2026" target="_blank" rel="noopener"><img src="https://shiporskip.io/api/badge/smoldocling-256m-vlm-document-conversion-ibm-granite-2026" alt="SmolDocling Ship verdict on ShipOrSkip" width="360" height="90" /></a>
Markdown badge
[![SmolDocling Ship verdict on ShipOrSkip](https://shiporskip.io/api/badge/smoldocling-256m-vlm-document-conversion-ibm-granite-2026)](https://shiporskip.io/api/badge-click/smoldocling-256m-vlm-document-conversion-ibm-granite-2026)
Iframe widget
<iframe src="https://shiporskip.io/embed/smoldocling-256m-vlm-document-conversion-ibm-granite-2026" title="SmolDocling ShipOrSkip verdict" width="360" height="260" style="border:0;border-radius:16px;max-width:100%;" loading="lazy"></iframe>

The reviews

256M params that actually handle real-world PDFs including tables, charts, and mixed layouts — this goes straight into my RAG preprocessing pipeline. The DocTags format is smart: giving the model a precise document vocabulary instead of asking it to improvise structure from scratch.

Helpful?

IBM's benchmark numbers for SmolDocling were measured on datasets curated by the same team. Real-world document parsing — especially for scanned documents with skew, noise, or unusual layouts — is where small VLMs consistently fall apart. Test it on your actual documents before committing it to production.

Helpful?

Efficient document parsing is critical infrastructure for the AI economy — most enterprise knowledge lives in PDFs and Word docs, not clean databases. A 256M model that can do this well enough to be deployed in high-throughput pipelines removes a major bottleneck from enterprise AI adoption.

Helpful?

Finally being able to reliably extract content from design-heavy PDFs — charts, callouts, multi-column layouts — without everything turning into garbage text is genuinely useful for content repurposing workflows. DocTags also makes it easier to preserve the editorial structure of source documents.

Helpful?

Bookmarks

Loading bookmarks...

No bookmarks yet

Bookmark tools to save them for later