Question 1

Which is better: Claude Files API & Token-Efficient Tool Use or qmd?

Accepted Answer

Based on our expert panel, Claude Files API & Token-Efficient Tool Use has a stronger verdict with a 75% Ship rate. Claude Files API & Token-Efficient Tool Use received a panel verdict of Ship and qmd received Mixed.

Question 2

Is Claude Files API & Token-Efficient Tool Use free?

Accepted Answer

Claude Files API & Token-Efficient Tool Use pricing: Pay-as-you-go via Anthropic API token pricing; no separate Files API surcharge announced

Question 3

Is qmd free?

Accepted Answer

qmd pricing: Free, open source (MIT)

Question 4

What do experts say about Claude Files API & Token-Efficient Tool Use vs qmd?

Accepted Answer

Claude Files API & Token-Efficient Tool Use: Anthropic's Files API lets developers upload documents once and reference them across multiple Claude API calls, slashing redundant token usage and reducing latency at scale. Paired with new token-efficient tool use patterns, the update targets agentic and multi-step workflows where repeated context injection was previously a costly bottleneck. Together, these additions make building production-grade Claude integrations meaningfully cheaper and faster. qmd: qmd is a lightweight local search engine built by Tobi Luetke, CEO of Shopify, for indexing and querying personal knowledge bases, documentation, and meeting notes — entirely offline. It combines three retrieval approaches in a single pipeline: BM25 full-text search for exact keyword matches, vector semantic search via ONNX-based embeddings, and LLM re-ranking using GGUF models through node-llama-cpp. All three stages run locally with no cloud dependency.

The tool ships in multiple deployment modes: a CLI for ad-hoc queries, a Node.js library for programmatic use, an HTTP service for local API access, and — most useful for AI workflows — a native MCP server that lets Claude Code, Cursor, and similar editors query your local knowledge base directly during coding sessions. The hybrid retrieval approach means it handles both "find the exact error message from last week's standup notes" and "what was our decision about the auth architecture" equally well.

What makes this notable beyond its technical approach is provenance: Luetke shipped it as a personal tool he actually uses, not a startup product. The GitHub history shows active iteration and he's been talking about it on X. It's a credible signal of where pragmatic AI-augmented knowledge management is heading for technical users who prefer local-first tools.

Claude Files API & Token-Efficient Tool Use vs qmd

Claude Files API & Token-Efficient Tool Use

qmd

Bookmarks