Ternary Bonsai

1.58-bit LLMs that run at 82 tok/s on M4 Pro and on your iPhone

Price — Open Source / Apache 2.0 / FreeReviewed — 2026-04-20

Expert verdict

Ship

3-1

▲ 3 Ships— 1 Skips

Visit prismml.com

The Panel's Take

PrismML's Ternary Bonsai is a family of aggressively quantized language models that take the BitNet concept to its logical extreme. Each weight is constrained to one of three values — {-1, 0, +1} — with a shared FP16 scale factor per 128-weight group. No higher-precision escape hatches, no hybrid layers. The result is a 9x reduction in memory footprint versus standard 16-bit models. The numbers are striking: the 8B model fits in 1.75 GB and hits 82 tokens per second on an M4 Pro. More impressively, it runs at 27 tokens per second on an iPhone 17 Pro Max — fast enough for real-time conversation on-device. The 8B variant scores 75.5 average across standard benchmarks, outperforming many models that are 9-10x larger. The 4B and 1.7B variants push further into mobile-optimized territory. All three models are released under the Apache 2.0 license, available on Hugging Face and GitHub, and integrated into the Locally AI iOS app for immediate on-device deployment. For developers building privacy-sensitive applications or anyone tired of paying cloud inference costs, Ternary Bonsai offers a compelling on-device alternative that doesn't require a beefy GPU.

The reviews

Builder

Ship

“82 tokens per second on M4 Pro in 1.75 GB is a genuinely impressive engineering achievement. For local tooling, code assistants, or any latency-sensitive workload where I don't want cloud round-trips, this hits a sweet spot that larger quantized models miss. Apache 2.0 means I can embed it in commercial apps without legal headaches.”

Helpful?

Skeptic

Skip

“A 75.5 benchmark average sounds good until you compare it against 8B models quantized with GGUF Q8 — which score similarly and have years of tooling, community support, and production deployments behind them. The 9x memory savings matter on constrained devices but less so on any machine with 16GB+ RAM. Niche but real use case.”

Helpful?

Futurist

Ship

“On-device AI at 27 tokens per second on a phone is the inflection point that makes LLMs a platform primitive rather than a cloud service. Once inference is this cheap and fast on commodity hardware, the entire economic model of AI-as-API-call collapses. Ternary quantization is an early signal of where efficiency research is heading.”

Helpful?

Creator

Ship

“The prospect of running a capable LLM entirely on my iPhone without sending any data to a server is genuinely exciting for creative work with sensitive material. Drafting, editing, and ideation without a cloud subscription or privacy concerns — I'd pay for that, and here it's free.”

Helpful?

Share this verdict

Ternary Bonsai verdict: SHIP 🚀

3 ships · 1 skip from the expert panel

Full review: https://shiporskip.io/tool/ternary-bonsai-prismml-158-bit-llm-iphone-apple-silicon-2026?utm_source=share_card&utm_medium=social&utm_campaign=verdict_share&utm_content=x_share

Weekly AI Tool Verdicts

Get the next verdict in your inbox

7 critics review a new AI tool every day. Weekly digest — free.

Compare Ternary Bonsai with Others

Ternary Bonsai vs Heretic 1.3 Ternary Bonsai vs DeepSeek V4 Ternary Bonsai vs Google Gemma 4 Ternary Bonsai vs Qwen3.6-27B Ternary Bonsai vs Ling-2.6-Flash

Looking for Ternary Bonsai alternatives?

Compare Ternary Bonsai with every other AI Models tool reviewed by our panel.

See all AI Models alternatives

Embed this verdict

Tool makers can add a live ShipOrSkip badge to their site. Badge loads track impressions; clicks route back to this review.

Ship · 7.5/10

HTML badge

<a href="https://shiporskip.io/api/badge-click/ternary-bonsai-prismml-158-bit-llm-iphone-apple-silicon-2026" target="_blank" rel="noopener"><img src="https://shiporskip.io/api/badge/ternary-bonsai-prismml-158-bit-llm-iphone-apple-silicon-2026" alt="Ternary Bonsai Ship verdict on ShipOrSkip" width="360" height="90" /></a>

Markdown badge

[![Ternary Bonsai Ship verdict on ShipOrSkip](https://shiporskip.io/api/badge/ternary-bonsai-prismml-158-bit-llm-iphone-apple-silicon-2026)](https://shiporskip.io/api/badge-click/ternary-bonsai-prismml-158-bit-llm-iphone-apple-silicon-2026)

Iframe widget

<iframe src="https://shiporskip.io/embed/ternary-bonsai-prismml-158-bit-llm-iphone-apple-silicon-2026" title="Ternary Bonsai ShipOrSkip verdict" width="360" height="260" style="border:0;border-radius:16px;max-width:100%;" loading="lazy"></iframe>

Ternary Bonsai

Bookmarks