Question 1

Which is better: LM Studio 0.4.0 or MemPalace?

Accepted Answer

Based on our expert panel, LM Studio 0.4.0 has a stronger verdict with a 100% Ship rate. LM Studio 0.4.0 received a panel verdict of Ship and MemPalace received Skip.

Question 2

Is LM Studio 0.4.0 free?

Accepted Answer

LM Studio 0.4.0 pricing: Free

Question 3

Is MemPalace free?

Accepted Answer

MemPalace pricing: Free / open source (MIT)

Question 4

What do experts say about LM Studio 0.4.0 vs MemPalace?

Accepted Answer

LM Studio 0.4.0: LM Studio 0.4.0 is the biggest update to the popular local LLM runner since its launch, introducing a proper headless CLI that separates the model inference engine from the GUI entirely. The new `lms` / `llmster` command starts LM Studio as a daemon — no display required — making local models viable in CI pipelines, remote servers, Docker containers, and scheduled tasks for the first time.

The update ships three major features alongside the CLI: continuous batching for parallel requests (multiple simultaneous users against one running model), a stateful `/v1/chat` REST API that preserves conversation state across calls without the client managing message history, and an interactive terminal chat via `lms chat` with streaming and system prompt support. The headless mode pairs naturally with Claude Code via a `claude-lm` alias that routes Claude's tool calls to the local model.

LM Studio 0.4.0 landed on Hacker News with 216 points, driven heavily by the "Running Gemma 4 locally" angle — Gemma 4's efficiency makes it one of the best models to run under 0.4.0's new architecture. The stateful API is particularly notable: it means the inference server maintains context between API calls, which dramatically simplifies agent loop implementations that don't want to re-send full conversation history on every turn. MemPalace: MemPalace is an open-source persistent memory system for AI agents that organizes memories hierarchically — people and projects become "wings", topics become "rooms" — enabling scoped semantic retrieval rather than flat vector search. It claims 96.6% on LongMemEval and a 170-token overhead per session. MIT licensed, self-hosted.

The project went viral almost instantly after actress and director Milla Jovovich pushed it to GitHub, claiming she built it with Claude Code alongside engineer Ben Sigman. The "palace" metaphor maps well to how humans naturally organize associative memory, and the architectural idea of scoped context windows (retrieve only the relevant "room") is legitimately interesting for long-running agent sessions.

The controversy: GitHub issue #214 exposed that the headline benchmark measures ChromaDB's default embeddings, not the palace structure itself. The README was updated to walk back the "100% accuracy" claim. A pump-and-dump crypto token ($PALACE) also appeared within 24 hours of the GitHub push. The underlying memory architecture has real merit — the noise-to-signal ratio is just high right now.

LM Studio 0.4.0 vs MemPalace

LM Studio 0.4.0

MemPalace

Bookmarks