Question 1

Which is better: Craft Agents or Gemma 4 Multimodal Fine-Tuner?

Accepted Answer

Based on our expert panel, Craft Agents has a stronger verdict with a 75% Ship rate. Craft Agents received a panel verdict of Ship and Gemma 4 Multimodal Fine-Tuner received Ship.

Question 2

Is Craft Agents free?

Accepted Answer

Craft Agents pricing: Open Source / Free (Apache 2.0)

Question 3

Is Gemma 4 Multimodal Fine-Tuner free?

Accepted Answer

Gemma 4 Multimodal Fine-Tuner pricing: Open Source

Question 4

What do experts say about Craft Agents vs Gemma 4 Multimodal Fine-Tuner?

Accepted Answer

Craft Agents: Craft Agents OSS is an open-source desktop application built on Anthropic's Claude Agent SDK, offering a polished GUI for managing multiple AI agent sessions simultaneously. Built by Luki Labs and released under Apache 2.0, it fills the gap between raw API access and the full Claude.ai web interface — giving developers and power users a native desktop experience with serious capability depth.

The app supports three permission modes that make it genuinely useful for real work: Explore (read-only, safe for exploring codebases), Ask to Edit (approval-based, for supervised automation), and Auto (unrestricted, for trusted workflows). It connects to MCP servers, REST APIs from Google, Slack, and Microsoft, and local filesystems, with real-time streaming responses and full tool call visualization. A multi-session workflow with Todo → In Progress → Needs Review → Done status tracking makes it feel more like a project management system than a chat interface.

Built on Electron + React with encrypted credential storage and a headless server mode, Craft Agents is architecturally serious. It's available as a one-line installer for macOS, Linux, and Windows. With the Claude Agent SDK gaining traction, this is the first polished desktop client that treats agents as long-running workflows rather than single-turn conversations. Gemma 4 Multimodal Fine-Tuner: Gemma 4 Multimodal Fine-Tuner is an open-source toolkit that lets developers fine-tune Google's Gemma 4 and 3n models across all three modalities — text, images, and audio — using only Apple Silicon hardware. It runs natively on PyTorch with Metal Performance Shaders (MPS), bypassing the NVIDIA requirement that has historically blocked Mac users from serious local fine-tuning work.

The toolkit handles the full training pipeline including dataset prep, LoRA adapters, and multi-modal data collation. It ships with working example notebooks, a validation suite, and clean abstractions that don't require deep familiarity with the underlying MPS stack. Apple Silicon's unified memory architecture actually helps here — large multimodal batches fit in memory that would otherwise require GPU VRAM splitting on CUDA setups.

Posted to Hacker News on April 7 as a Show HN, it pulled 109 upvotes and 165 GitHub stars within hours. The timing is sharp: Gemma 4 just dropped days ago with new multimodal capabilities, and the community immediately wanted local fine-tuning. This fills that gap faster than Google's own tooling.

Craft Agents vs Gemma 4 Multimodal Fine-Tuner

Craft Agents

Gemma 4 Multimodal Fine-Tuner

Bookmarks