Compare/Libretto vs SmolDocling

AI tool comparison

Libretto vs SmolDocling

Which one should you ship with? Here is the side-by-side panel verdict, pricing read, reviewer split, and community vote comparison.

L

Developer Tools

Libretto

AI browser automation that doesn't break every other deploy

Ship

75%

Panel ship

Community

Paid

Entry

Libretto is an open-source TypeScript toolkit for building and maintaining browser automations that are actually reliable. Unlike most AI-driven browser tools that use probabilistic reasoning to select elements at runtime, Libretto works by having the AI generate deterministic selectors and action sequences upfront — then executing them with zero LLM involvement at runtime. The AI is your authoring tool, not your runtime dependency. The core insight: most AI browser automations fail in production because they call an LLM on every page interaction. Libretto flips this by using AI to write and update the automation scripts, but running them as ordinary code. When a site changes and your automation breaks, Libretto detects the failure and prompts you to let AI update the selector — then it's deterministic again. Built by the team at Saffron Health, the library hit HN's front page today and is generating discussion as a more pragmatic alternative to fully autonomous browser agents. For anyone who's tried Playwright with AI wrappers and found them unreliable in CI/CD, this is the architecture that's been missing.

S

Developer Tools

SmolDocling

256M-param VLM that converts any document to structured text

Ship

75%

Panel ship

Community

Free

Entry

SmolDocling is a 256-million-parameter vision-language model from IBM Granite that converts documents — PDFs, scanned papers, tables, charts, forms — into clean, structured text with remarkable accuracy for its size. It introduces a new markup format called DocTags that captures not just text but document structure, reading order, and element types (headings, captions, tables, code blocks) in a way that downstream models and parsers can reliably consume. The "smol" in the name is intentional: at 256M parameters, SmolDocling runs fast enough to be deployed in production pipelines where larger VLMs would be prohibitively slow or expensive. Despite its compact size, IBM reports it achieves state-of-the-art performance across multiple document type benchmarks — outperforming much larger models on structured document parsing tasks. The key innovation is the DocTags format, which gives the model a precise vocabulary for describing document elements rather than trying to reconstruct structure from freeform text output. Built on top of the docling project (58.7k GitHub stars), SmolDocling is open source under Apache 2.0 and available on HuggingFace. The technical report is on arXiv (2503.11576). For teams building RAG pipelines, document intelligence tools, or any system that needs to ingest unstructured documents at scale, this is a practical, deployable solution.

Decision
Libretto
SmolDocling
Panel verdict
Ship · 3 ship / 1 skip
Ship · 3 ship / 1 skip
Community
No community votes yet
No community votes yet
Pricing
Open Source
Free / Open Source (Apache 2.0)
Best for
AI browser automation that doesn't break every other deploy
256M-param VLM that converts any document to structured text
Category
Developer Tools
Developer Tools

Reviewer scorecard

Builder
80/100 · ship

This is the right mental model for production browser automation. Using AI for authoring but not runtime means you get consistency in CI without random failures at 2am. I've been waiting for someone to build this properly.

80/100 · ship

256M params that actually handle real-world PDFs including tables, charts, and mixed layouts — this goes straight into my RAG preprocessing pipeline. The DocTags format is smart: giving the model a precise document vocabulary instead of asking it to improvise structure from scratch.

Skeptic
45/100 · skip

The 'AI updates your selectors' workflow sounds great until you're reviewing 50 AI-generated selector changes after a site redesign. You've just moved the flakiness from runtime to the maintenance loop. Also, 37 stars is very early — I'd wait for production case studies.

45/100 · skip

IBM's benchmark numbers for SmolDocling were measured on datasets curated by the same team. Real-world document parsing — especially for scanned documents with skew, noise, or unusual layouts — is where small VLMs consistently fall apart. Test it on your actual documents before committing it to production.

Futurist
80/100 · ship

The deterministic-at-runtime pattern will become the standard architecture for AI-assisted automation. Libretto is arriving exactly as enterprises start demanding reliability SLAs from their AI tooling. Early movers will have a significant advantage.

80/100 · ship

Efficient document parsing is critical infrastructure for the AI economy — most enterprise knowledge lives in PDFs and Word docs, not clean databases. A 256M model that can do this well enough to be deployed in high-throughput pipelines removes a major bottleneck from enterprise AI adoption.

Creator
80/100 · ship

As someone who automates repetitive web tasks constantly, this solves my biggest frustration — AI-written automations that fall apart the moment a site updates their CSS. The auto-repair loop is exactly what I need for long-running workflows.

80/100 · ship

Finally being able to reliably extract content from design-heavy PDFs — charts, callouts, multi-column layouts — without everything turning into garbage text is genuinely useful for content repurposing workflows. DocTags also makes it easier to preserve the editorial structure of source documents.

Weekly AI Tool Verdicts

Get the next comparison in your inbox

New AI tools ship daily. We compare them before you waste an afternoon.

Bookmarks

Loading bookmarks...

No bookmarks yet

Bookmark tools to save them for later