AI tool comparison
lmscan vs qmd
Which one should you ship with? Here is the side-by-side panel verdict, pricing read, reviewer split, and community vote comparison.
LLM Tools
lmscan
Offline AI text detector that fingerprints which LLM actually wrote it
50%
Panel ship
—
Community
Free
Entry
Most AI text detectors are cloud services with opaque models, significant false positive rates, and zero explanation for why they flagged content. lmscan is a zero-dependency Python package that runs entirely offline using 12 statistical linguistic features: perplexity scoring, burstiness analysis, vocabulary density, syntactic variety, and others. It's not just detection — it fingerprints the specific LLM family responsible, distinguishing between GPT-4, Claude, Gemini, Llama, and Mistral outputs based on their characteristic writing signatures. Every result is fully explainable, showing which features drove the classification. The design philosophy is explicitly anti-black-box: every classification comes with a feature-by-feature breakdown, making it suitable for applications where you need to explain the result to a human (academic integrity, content moderation, employment screening). The CLI interface drops into CI/CD pipelines for automated content checking, and the Python API integrates into document processing workflows. No API key, no network call, no vendor lock-in. Very early project — minimal stars and community traction as of this writing. The statistical approach trades accuracy for explainability, which means sufficiently paraphrased AI text will evade detection just as it does on competing services. But for a free, fully offline, explainable baseline for AI text analysis, it occupies a niche that no established tool does cleanly. Worth monitoring for teams that need local, auditable AI detection without vendor dependency.
Developer Tools
qmd
Local doc search engine with BM25 + vectors + LLM re-ranking — by Shopify's CEO
50%
Panel ship
—
Community
Free
Entry
qmd is a lightweight local search engine built by Tobi Luetke, CEO of Shopify, for indexing and querying personal knowledge bases, documentation, and meeting notes — entirely offline. It combines three retrieval approaches in a single pipeline: BM25 full-text search for exact keyword matches, vector semantic search via ONNX-based embeddings, and LLM re-ranking using GGUF models through node-llama-cpp. All three stages run locally with no cloud dependency. The tool ships in multiple deployment modes: a CLI for ad-hoc queries, a Node.js library for programmatic use, an HTTP service for local API access, and — most useful for AI workflows — a native MCP server that lets Claude Code, Cursor, and similar editors query your local knowledge base directly during coding sessions. The hybrid retrieval approach means it handles both "find the exact error message from last week's standup notes" and "what was our decision about the auth architecture" equally well. What makes this notable beyond its technical approach is provenance: Luetke shipped it as a personal tool he actually uses, not a startup product. The GitHub history shows active iteration and he's been talking about it on X. It's a credible signal of where pragmatic AI-augmented knowledge management is heading for technical users who prefer local-first tools.
Reviewer scorecard
“The zero-dependency, fully offline angle makes this immediately viable for enterprise environments where you can't send content to a third-party API for compliance reasons. The LLM fingerprinting feature is genuinely novel — I haven't seen another tool that tries to attribute text to specific model families. Early days, but the CI/CD integration and explainable output make it worth piloting for document pipelines where you need auditable AI detection.”
“Hybrid BM25 + vector + LLM re-rank is the right architecture for personal knowledge search — each layer catches what the others miss. The MCP server mode is genuinely useful: being able to ask Claude Code 'what did we decide about X last month' against my own notes changes the workflow. MIT licensed and from someone who ships real products.”
“Statistical AI text detection is a fundamentally broken approach — anyone who rewrites AI output a couple of times will evade it, and false positive rates on certain human writing styles (non-native English speakers, highly technical prose) can be significant. The LLM fingerprinting claim sounds exciting but needs rigorous benchmark testing before I'd trust it in a real content moderation or academic integrity context. Ship it when there's an accuracy paper.”
“This is a well-executed weekend project, not a production tool. It requires GGUF models and manual embedding setup — a meaningful friction barrier for non-technical users. The 'built by a CEO' narrative drives GitHub stars more than the technical differentiation. Obsidian with a local AI plugin gets you here with better UX.”
“As AI-generated content saturates every channel, the tools for detecting and attributing it become infrastructure, not just features. lmscan's offline, explainable approach points toward the right architecture: detection capability should be embeddable and auditable, not locked behind API calls. The specific LLM attribution angle — figuring out which model family produced text — will become increasingly important for provenance tracking and regulatory compliance.”
“The pattern here — local hybrid retrieval as an MCP server feeding into AI coding agents — will be ubiquitous in two years. Today it's a technical power-user tool; tomorrow it's how everyone's AI assistant knows the institutional context behind the code. qmd is an early, clean implementation of that pattern.”
“If you're a creator who worries about AI-generated content flooding your niche or competitors using AI to impersonate your style, this is theoretically relevant. But the accuracy question is real — statistical detection won't catch polished AI content, and false positives could flag your own work. Interesting concept that needs a lot more development before it's trustworthy for real editorial decisions.”
“I manage a lot of notes, references, and creative briefs, but the setup friction here — GGUF models, CLI configuration — makes this inaccessible for most creators. The concept is great; the UX needs a front-end before it reaches beyond developers.”
Weekly AI Tool Verdicts
Get the next comparison in your inbox
New AI tools ship daily. We compare them before you waste an afternoon.