AI tool comparison
Recall vs tldr MCP Gateway
Which one should you ship with? Here is the side-by-side panel verdict, pricing read, reviewer split, and community vote comparison.
Developer Tools
Recall
Find any file on your machine with a sentence — no tags, no indexing
75%
Panel ship
—
Community
Free
Entry
Recall is a local-first multimodal semantic search tool that lets you find any file on your computer using natural language — images, PDFs, audio, video, and text — without any manual tagging, folder organization, or metadata. Ask "that invoice from the dentist last spring" or "photo of the whiteboard with the Q3 roadmap" and it surfaces the right file. Under the hood, Recall uses Google's Gemini Embedding 2 to generate semantic embeddings for all your files and stores them in ChromaDB, a local vector database that runs entirely on your machine. Nothing leaves your device. The Raycast extension adds a visual grid UI so you can search from anywhere on macOS without opening a terminal. First-run indexing can take 20-30 minutes for large libraries, but subsequent queries are near-instant. The project is MIT-licensed and built by a solo developer. It's a clear response to the frustration that Spotlight, Find, and Windows Search still rely heavily on filename and metadata matching even in 2026. As Gemini Embedding 2 is free within generous limits, the operating cost is essentially zero for personal use.
Developer Tools
tldr MCP Gateway
Shrink 41+ MCP tool schemas by 86% before they hit your model
75%
Panel ship
—
Community
Paid
Entry
tldr is a local proxy that sits between your AI coding harness and upstream MCP servers, solving one of the most underappreciated problems in agentic workflows: context bloat from tool schema proliferation. When you connect GitHub MCP, filesystem MCP, and a few others, you can easily be sending 24,000+ tokens of tool schemas to the model before any work begins. Instead of passing all those schemas directly, tldr exposes exactly five wrapper tools to the model: search_tools, execute_plan, call_raw, inspect_tool, and get_result. The model learns which underlying tools exist on-demand through search_tools, then calls them through the proxy. GitHub MCP's 24,473-token schema surface compresses to 3,482 tokens — an 86% reduction. Output responses are further compressed through field stripping, a 4,096-token cap, and a 64KB byte limit. This is a genuinely practical solution for power users running multi-MCP setups who've noticed degraded performance as their tool count grows. The tradeoff is one extra hop of indirection, but the token savings pay for themselves in improved model attention and lower API costs.
Reviewer scorecard
“ChromaDB + Gemini Embedding 2 on local files is a setup I'd have spent a week configuring from scratch. Recall packages this cleanly with a Raycast extension that makes it actually usable day-to-day. The MIT license and zero vendor lock-in seal the deal for me.”
“This solves a real problem I've hit personally — when you connect enough MCP servers, you're wasting a quarter of your context window on tool definitions before a single line of code is written. The five-wrapper-tool approach is elegant and the compression numbers are concrete and reproducible.”
“Re-indexing after file changes, cold-start latency on large libraries, and the dependency on Gemini Embedding 2 (which isn't truly offline) are real friction points. Apple Intelligence already does some of this natively on-device. Wait for broader platform support before switching your file workflow.”
“This is a workaround for a problem that MCP server authors and model providers should fix natively. Adding another proxy layer to your local development setup increases debugging complexity, and the 4,096-token output cap could silently truncate important data from tool responses.”
“Semantic search for personal files is the foundation for personal AI agents. If your agent can find any piece of information you've ever touched, you unlock genuine memory at human-years scale. Recall is primitive but points at something important.”
“Schema proliferation is becoming a real scalability ceiling for agentic systems. tldr's dynamic tool discovery approach — where the model learns which tools exist on-demand — hints at how future agent routing layers will work at scale across hundreds of specialized MCP endpoints.”
“I have 80,000 photos, hundreds of PDFs, and years of Figma exports I can never find. The idea of describing an image or document and having it surface immediately is worth every minute of setup time. This is the dream of local AI finally shipping.”
“For anyone using AI agents to manage creative workflows across multiple platforms, the context savings translate directly to more coherent, focused outputs. Less schema bloat means the model spends more attention on your actual task.”
Weekly AI Tool Verdicts
Get the next comparison in your inbox
New AI tools ship daily. We compare them before you waste an afternoon.