AI tool comparison
Gemma 4 Multimodal Fine-Tuner vs qmd
Which one should you ship with? Here is the side-by-side panel verdict, pricing read, reviewer split, and community vote comparison.
Developer Tools
Gemma 4 Multimodal Fine-Tuner
Fine-tune Gemma 4 with text, images & audio on your Mac
75%
Panel ship
—
Community
Paid
Entry
Gemma 4 Multimodal Fine-Tuner is an open-source toolkit that lets developers fine-tune Google's Gemma 4 and 3n models across all three modalities — text, images, and audio — using only Apple Silicon hardware. It runs natively on PyTorch with Metal Performance Shaders (MPS), bypassing the NVIDIA requirement that has historically blocked Mac users from serious local fine-tuning work. The toolkit handles the full training pipeline including dataset prep, LoRA adapters, and multi-modal data collation. It ships with working example notebooks, a validation suite, and clean abstractions that don't require deep familiarity with the underlying MPS stack. Apple Silicon's unified memory architecture actually helps here — large multimodal batches fit in memory that would otherwise require GPU VRAM splitting on CUDA setups. Posted to Hacker News on April 7 as a Show HN, it pulled 109 upvotes and 165 GitHub stars within hours. The timing is sharp: Gemma 4 just dropped days ago with new multimodal capabilities, and the community immediately wanted local fine-tuning. This fills that gap faster than Google's own tooling.
Developer Tools
qmd
Local doc search engine with BM25 + vectors + LLM re-ranking — by Shopify's CEO
50%
Panel ship
—
Community
Free
Entry
qmd is a lightweight local search engine built by Tobi Luetke, CEO of Shopify, for indexing and querying personal knowledge bases, documentation, and meeting notes — entirely offline. It combines three retrieval approaches in a single pipeline: BM25 full-text search for exact keyword matches, vector semantic search via ONNX-based embeddings, and LLM re-ranking using GGUF models through node-llama-cpp. All three stages run locally with no cloud dependency. The tool ships in multiple deployment modes: a CLI for ad-hoc queries, a Node.js library for programmatic use, an HTTP service for local API access, and — most useful for AI workflows — a native MCP server that lets Claude Code, Cursor, and similar editors query your local knowledge base directly during coding sessions. The hybrid retrieval approach means it handles both "find the exact error message from last week's standup notes" and "what was our decision about the auth architecture" equally well. What makes this notable beyond its technical approach is provenance: Luetke shipped it as a personal tool he actually uses, not a startup product. The GitHub history shows active iteration and he's been talking about it on X. It's a credible signal of where pragmatic AI-augmented knowledge management is heading for technical users who prefer local-first tools.
Reviewer scorecard
“This is exactly what Apple Silicon owners have been waiting for. Running text + image + audio fine-tuning locally without needing a cloud GPU or NVIDIA hardware is genuinely useful — and the LoRA support keeps resource usage manageable. Ship immediately for anyone experimenting with Gemma 4 on a MacBook Pro M4.”
“Hybrid BM25 + vector + LLM re-rank is the right architecture for personal knowledge search — each layer catches what the others miss. The MCP server mode is genuinely useful: being able to ask Claude Code 'what did we decide about X last month' against my own notes changes the workflow. MIT licensed and from someone who ships real products.”
“MPS fine-tuning is still notably slower than CUDA and can be flaky with large batch sizes. The project is only days old with no production track record, and Gemma 4's licensing requires careful review for commercial use. Wait for community validation and more stable release before relying on this for anything serious.”
“This is a well-executed weekend project, not a production tool. It requires GGUF models and manual embedding setup — a meaningful friction barrier for non-technical users. The 'built by a CEO' narrative drives GitHub stars more than the technical differentiation. Obsidian with a local AI plugin gets you here with better UX.”
“Apple Silicon is quietly becoming the dominant edge compute platform for AI. Tooling that democratizes multimodal fine-tuning to every Mac owner — without cloud dependencies — is a meaningful step toward truly personal AI. The unified memory architecture is still underexploited; this project starts to change that.”
“The pattern here — local hybrid retrieval as an MCP server feeding into AI coding agents — will be ubiquitous in two years. Today it's a technical power-user tool; tomorrow it's how everyone's AI assistant knows the institutional context behind the code. qmd is an early, clean implementation of that pattern.”
“The idea of fine-tuning a vision+audio model on my own photos and recordings locally, without uploading anything to a server, is compelling. A custom Gemma 4 that knows my style and voice? That's actually useful for creative workflows. Once the docs improve, this has real potential for independent creators.”
“I manage a lot of notes, references, and creative briefs, but the setup friction here — GGUF models, CLI configuration — makes this inaccessible for most creators. The concept is great; the UX needs a front-end before it reaches beyond developers.”
Weekly AI Tool Verdicts
Get the next comparison in your inbox
New AI tools ship daily. We compare them before you waste an afternoon.