Compare/Dive into LLMs vs How LLMs Work

AI tool comparison

Dive into LLMs vs How LLMs Work

Which one should you ship with? Here is the side-by-side panel verdict, pricing read, reviewer split, and community vote comparison.

D

Education & Research

Dive into LLMs

University-grade open curriculum for understanding (not just using) LLMs

Mixed

50%

Panel ship

Community

Free

Entry

Dive into LLMs is a structured LLM programming tutorial series from Shanghai Jiao Tong University covering fine-tuning, RLHF alignment, RAG pipelines, jailbreak attacks and defenses, watermarking techniques, GUI agents, and multimodal models. Each module includes slides, Jupyter notebooks with runnable code, and accompanying video lectures. The curriculum is designed for developers and researchers who want to go beyond prompt engineering into actually understanding how large language models work, how they're trained, and how to modify and deploy them. Topics span from transformer fundamentals through modern alignment techniques like DPO and GRPO. Recent additions cover GUI agents and multimodal architectures. The course has partnered with Huawei's Ascend community for localized deployment content. With 29k+ GitHub stars and trending hard today, this is one of the most-starred educational resources in the LLM ecosystem. Unlike blog posts and YouTube tutorials, the Jupyter notebooks mean you can run and modify every example yourself — making abstract concepts like RLHF tangible in a way that passive reading can't match.

H

Education

How LLMs Work

Andrej Karpathy's LLM lecture, rebuilt as an interactive visual experience

Ship

75%

Panel ship

Community

Free

Entry

"How LLMs Work" is a free, browser-based interactive guide that walks through the complete pipeline for building large language models — from raw web scraping to RLHF-trained conversational assistant. Created by Yash Narwal and based on Andrej Karpathy's technical deep-dive lecture, it's been getting significant traction on Hacker News (214+ points) for turning dense ML theory into something genuinely accessible. The site covers data collection and deduplication, Byte Pair Encoding tokenization with a live demo, pre-training and next-token prediction, inference with a probability sampling simulator, post-training with RLHF, and RAG. Each section uses animated visualizations, clickable pipeline diagrams, and canvas-based graphics — not static explainer images. The progressive narrative structure follows a single piece of text through every stage of the pipeline, making abstract concepts concrete. In an era where everyone uses LLMs but few understand how they work, this kind of high-quality educational resource matters for a different reason than tools: it raises the baseline competency of the entire developer ecosystem. Better-informed builders ask better questions, make better design decisions, and push vendors toward more transparency. This is the kind of project the HN community rewards — and deserves the signal boost.

Decision
Dive into LLMs
How LLMs Work
Panel verdict
Mixed · 2 ship / 2 skip
Ship · 3 ship / 1 skip
Community
No community votes yet
No community votes yet
Pricing
Free
Free
Best for
University-grade open curriculum for understanding (not just using) LLMs
Andrej Karpathy's LLM lecture, rebuilt as an interactive visual experience
Category
Education & Research
Education

Reviewer scorecard

Builder
80/100 · ship

Every dev who uses LLMs in production should understand fine-tuning and alignment at the level this curriculum teaches. The Jupyter notebooks are the key — being able to run RLHF examples on a small model changes your mental model for how alignment actually works.

80/100 · ship

Best visual explanation of tokenization I've seen — the live BPE demo finally made it click for me after years of reading static diagrams. Bookmarked for onboarding new engineers and explaining RAG to non-technical stakeholders.

Skeptic
45/100 · skip

There are dozens of LLM curricula on GitHub — fast.ai, Andrej Karpathy's videos, the Stanford CS224N lectures. Unless you specifically need SJTU's framing or the Huawei Ascend content, it's hard to argue this is uniquely worth your time over the better-known alternatives.

45/100 · skip

It's a beautiful explainer, but Karpathy's own YouTube lectures already do this and go deeper. Building on someone else's lecture without significant original contribution is fine, but 'Ship or Skip' implies you'd use it now — this is more bookmark-and-forget.

Futurist
80/100 · ship

The world needs millions more people who understand LLMs at the fine-tuning and alignment level — not just the API level. Open curricula like this are how that happens. The jailbreak and watermarking modules are especially forward-looking for an increasingly adversarial AI landscape.

80/100 · ship

The gap between AI capability and public understanding is the single biggest risk factor for good AI policy. Tools like this that translate technical reality into accessible visuals are infrastructure for an informed society — more important than most 'real' tools.

Creator
45/100 · skip

This is squarely for researchers and ML engineers, not creative practitioners. I appreciate the effort but nothing here helps me do my work better today — it's a long-form learning investment that most creators won't need to make.

80/100 · ship

The scroll-based animation and progressive reveals are exactly how technical content should be designed. Whoever built this UX understands both pedagogy and web craft — it's a masterclass in making complex systems legible through thoughtful visual design.

Weekly AI Tool Verdicts

Get the next comparison in your inbox

New AI tools ship daily. We compare them before you waste an afternoon.

Bookmarks

Loading bookmarks...

No bookmarks yet

Bookmark tools to save them for later