Question 1

Which is better: LM Studio 0.4.0 or MemPalace?

Accepted Answer

Based on our expert panel, LM Studio 0.4.0 has a stronger verdict with a 100% Ship rate. LM Studio 0.4.0 received a panel verdict of Ship and MemPalace received Ship.

Question 2

Is LM Studio 0.4.0 free?

Accepted Answer

LM Studio 0.4.0 pricing: Free

Question 3

Is MemPalace free?

Accepted Answer

MemPalace pricing: Free / Open Source (MIT). Self-hosted.

Question 4

What do experts say about LM Studio 0.4.0 vs MemPalace?

Accepted Answer

LM Studio 0.4.0: LM Studio 0.4.0 is the biggest update to the popular local LLM runner since its launch, introducing a proper headless CLI that separates the model inference engine from the GUI entirely. The new `lms` / `llmster` command starts LM Studio as a daemon — no display required — making local models viable in CI pipelines, remote servers, Docker containers, and scheduled tasks for the first time.

The update ships three major features alongside the CLI: continuous batching for parallel requests (multiple simultaneous users against one running model), a stateful `/v1/chat` REST API that preserves conversation state across calls without the client managing message history, and an interactive terminal chat via `lms chat` with streaming and system prompt support. The headless mode pairs naturally with Claude Code via a `claude-lm` alias that routes Claude's tool calls to the local model.

LM Studio 0.4.0 landed on Hacker News with 216 points, driven heavily by the "Running Gemma 4 locally" angle — Gemma 4's efficiency makes it one of the best models to run under 0.4.0's new architecture. The stateful API is particularly notable: it means the inference server maintains context between API calls, which dramatically simplifies agent loop implementations that don't want to re-send full conversation history on every turn. MemPalace: MemPalace is an open-source persistent memory system for LLMs that takes a philosophically different approach from every summarization-based alternative: it stores conversations verbatim, forever, and retrieves them with semantic precision. Where systems like MemGPT or standard RAG pipelines compress memories into lossy summaries, MemPalace treats exact wording as sacred — because often the specific phrasing of something a user said six months ago is the thing that matters.

The storage architecture uses a hierarchical "memory palace" metaphor: people and projects are wings, topics are rooms, individual memories are drawers. Semantic retrieval is scoped to sub-trees rather than doing a flat vector search across everything, which dramatically reduces false positives and improves precision at depth. The system claims a 96.6% score on LongMemEval — the highest publicly reported score among free tools — and integrates with any OpenAI-compatible API endpoint.

Verbatim storage does mean storage costs grow linearly with usage, and there's no built-in forgetting mechanism yet (which some see as a bug and others as a feature). But for personal assistants, coding agents, and any application where "you told me X last Tuesday" accuracy matters, MemPalace's approach to memory is architecturally more honest than the alternatives.

LM Studio 0.4.0 vs MemPalace

LM Studio 0.4.0

MemPalace

Bookmarks