Question 1

Which is better: Bonsai-8B or MemPalace?

Accepted Answer

Based on our expert panel, Bonsai-8B has a stronger verdict with a 75% Ship rate. Bonsai-8B received a panel verdict of Ship and MemPalace received Skip.

Question 2

Is Bonsai-8B free?

Accepted Answer

Bonsai-8B pricing: Free / Apache 2.0

Question 3

Is MemPalace free?

Accepted Answer

MemPalace pricing: Free / open source (MIT)

Question 4

What do experts say about Bonsai-8B vs MemPalace?

Accepted Answer

Bonsai-8B: Bonsai-8B is PrismML's latest model in their BitNet-inspired lineage — an 8.2B parameter language model that has been quantized end-to-end to true 1-bit precision (weights stored as -1 or +1), compressing the entire model to just 1.15 GB. That's roughly 12-14x smaller than a standard FP16 equivalent. Unlike post-training quantization hacks that lose substantial quality, PrismML trained Bonsai-8B with 1-bit arithmetic baked into the forward pass from the start.

Benchmark results are competitive for the size class: 63.8 on MMLU, 72.1 on HellaSwag, and 54.2 on GSM8K — while running at 131 tokens/sec on an M4 Pro MacBook and 44 tokens/sec on an iPhone 17 Pro Max. That makes it the fastest locally-runnable 8B model in its weight class on Apple Silicon. The MLX-optimized weights are available on Hugging Face today under Apache 2.0.

The significance goes beyond benchmarks. Getting a capable open-weight model to run at interactive speeds on consumer hardware — with no API key, no GPU, no cloud dependency — is a meaningful step toward truly private, offline AI. This follows PrismML's earlier "Ternary Bonsai" (1.58-bit) but represents a cleaner binary architecture that's easier to accelerate on custom silicon. MemPalace: MemPalace is an open-source persistent memory system for AI agents that organizes memories hierarchically — people and projects become "wings", topics become "rooms" — enabling scoped semantic retrieval rather than flat vector search. It claims 96.6% on LongMemEval and a 170-token overhead per session. MIT licensed, self-hosted.

The project went viral almost instantly after actress and director Milla Jovovich pushed it to GitHub, claiming she built it with Claude Code alongside engineer Ben Sigman. The "palace" metaphor maps well to how humans naturally organize associative memory, and the architectural idea of scoped context windows (retrieve only the relevant "room") is legitimately interesting for long-running agent sessions.

The controversy: GitHub issue #214 exposed that the headline benchmark measures ChromaDB's default embeddings, not the palace structure itself. The README was updated to walk back the "100% accuracy" claim. A pump-and-dump crypto token ($PALACE) also appeared within 24 hours of the GitHub push. The underlying memory architecture has real merit — the noise-to-signal ratio is just high right now.

Bonsai-8B vs MemPalace

Bonsai-8B

MemPalace

Bookmarks