Question 1

Which is better: LM Studio 0.4.0 or Stash?

Accepted Answer

Based on our expert panel, LM Studio 0.4.0 has a stronger verdict with a 100% Ship rate. LM Studio 0.4.0 received a panel verdict of Ship and Stash received Ship.

Question 2

Is LM Studio 0.4.0 free?

Accepted Answer

LM Studio 0.4.0 pricing: Free

Question 3

Is Stash free?

Accepted Answer

Stash pricing: Open Source

Question 4

What do experts say about LM Studio 0.4.0 vs Stash?

Accepted Answer

LM Studio 0.4.0: LM Studio 0.4.0 is the biggest update to the popular local LLM runner since its launch, introducing a proper headless CLI that separates the model inference engine from the GUI entirely. The new `lms` / `llmster` command starts LM Studio as a daemon — no display required — making local models viable in CI pipelines, remote servers, Docker containers, and scheduled tasks for the first time.

The update ships three major features alongside the CLI: continuous batching for parallel requests (multiple simultaneous users against one running model), a stateful `/v1/chat` REST API that preserves conversation state across calls without the client managing message history, and an interactive terminal chat via `lms chat` with streaming and system prompt support. The headless mode pairs naturally with Claude Code via a `claude-lm` alias that routes Claude's tool calls to the local model.

LM Studio 0.4.0 landed on Hacker News with 216 points, driven heavily by the "Running Gemma 4 locally" angle — Gemma 4's efficiency makes it one of the best models to run under 0.4.0's new architecture. The stateful API is particularly notable: it means the inference server maintains context between API calls, which dramatically simplifies agent loop implementations that don't want to re-send full conversation history on every turn. Stash: Stash is an open-source persistent memory infrastructure for AI agents built on PostgreSQL and pgvector. Unlike retrieval-augmented generation, which searches static documents, Stash actively learns from agent experience — consolidating raw observations into facts, relationships, causal links, and higher-order patterns over time.

The system exposes 28 MCP tools covering the full cognitive stack: episode storage, fact synthesis, entity graph management, goal tracking, failure pattern recognition, and self-correction when contradictions emerge. It deploys via Docker Compose in three steps and works with any OpenAI-compatible API — Claude, GPT, local models via Ollama. Hierarchical namespaces let agents keep user facts separate from project facts separate from self-knowledge.

This fills a real gap in the agent ecosystem. Most agent frameworks treat each session as stateless, which means agents repeat the same mistakes and lose hard-won context. Stash gives agents a persistent cognitive layer that compounds. It surfaced on Hacker News this week to notable developer interest and is worth watching as MCP adoption accelerates.

LM Studio 0.4.0 vs Stash

LM Studio 0.4.0

Stash

Bookmarks