AI tool comparison
Dive into LLMs vs How LLMs Work
Which one should you ship with? Here is the side-by-side panel verdict, pricing read, reviewer split, and community vote comparison.
Education & Research
Dive into LLMs
University-grade open curriculum for understanding (not just using) LLMs
50%
Panel ship
—
Community
Free
Entry
Dive into LLMs is a structured LLM programming tutorial series from Shanghai Jiao Tong University covering fine-tuning, RLHF alignment, RAG pipelines, jailbreak attacks and defenses, watermarking techniques, GUI agents, and multimodal models. Each module includes slides, Jupyter notebooks with runnable code, and accompanying video lectures. The curriculum is designed for developers and researchers who want to go beyond prompt engineering into actually understanding how large language models work, how they're trained, and how to modify and deploy them. Topics span from transformer fundamentals through modern alignment techniques like DPO and GRPO. Recent additions cover GUI agents and multimodal architectures. The course has partnered with Huawei's Ascend community for localized deployment content. With 29k+ GitHub stars and trending hard today, this is one of the most-starred educational resources in the LLM ecosystem. Unlike blog posts and YouTube tutorials, the Jupyter notebooks mean you can run and modify every example yourself — making abstract concepts like RLHF tangible in a way that passive reading can't match.
Education
How LLMs Work
Andrej Karpathy's LLM lecture, rebuilt as an interactive visual experience
75%
Panel ship
—
Community
Free
Entry
"How LLMs Work" is a free, browser-based interactive guide that walks through the complete pipeline for building large language models — from raw web scraping to RLHF-trained conversational assistant. Created by Yash Narwal and based on Andrej Karpathy's technical deep-dive lecture, it's been getting significant traction on Hacker News (214+ points) for turning dense ML theory into something genuinely accessible. The site covers data collection and deduplication, Byte Pair Encoding tokenization with a live demo, pre-training and next-token prediction, inference with a probability sampling simulator, post-training with RLHF, and RAG. Each section uses animated visualizations, clickable pipeline diagrams, and canvas-based graphics — not static explainer images. The progressive narrative structure follows a single piece of text through every stage of the pipeline, making abstract concepts concrete. In an era where everyone uses LLMs but few understand how they work, this kind of high-quality educational resource matters for a different reason than tools: it raises the baseline competency of the entire developer ecosystem. Better-informed builders ask better questions, make better design decisions, and push vendors toward more transparency. This is the kind of project the HN community rewards — and deserves the signal boost.
Reviewer scorecard
“Every dev who uses LLMs in production should understand fine-tuning and alignment at the level this curriculum teaches. The Jupyter notebooks are the key — being able to run RLHF examples on a small model changes your mental model for how alignment actually works.”
“Best visual explanation of tokenization I've seen — the live BPE demo finally made it click for me after years of reading static diagrams. Bookmarked for onboarding new engineers and explaining RAG to non-technical stakeholders.”
“There are dozens of LLM curricula on GitHub — fast.ai, Andrej Karpathy's videos, the Stanford CS224N lectures. Unless you specifically need SJTU's framing or the Huawei Ascend content, it's hard to argue this is uniquely worth your time over the better-known alternatives.”
“It's a beautiful explainer, but Karpathy's own YouTube lectures already do this and go deeper. Building on someone else's lecture without significant original contribution is fine, but 'Ship or Skip' implies you'd use it now — this is more bookmark-and-forget.”
“The world needs millions more people who understand LLMs at the fine-tuning and alignment level — not just the API level. Open curricula like this are how that happens. The jailbreak and watermarking modules are especially forward-looking for an increasingly adversarial AI landscape.”
“The gap between AI capability and public understanding is the single biggest risk factor for good AI policy. Tools like this that translate technical reality into accessible visuals are infrastructure for an informed society — more important than most 'real' tools.”
“This is squarely for researchers and ML engineers, not creative practitioners. I appreciate the effort but nothing here helps me do my work better today — it's a long-form learning investment that most creators won't need to make.”
“The scroll-based animation and progressive reveals are exactly how technical content should be designed. Whoever built this UX understands both pedagogy and web craft — it's a masterclass in making complex systems legible through thoughtful visual design.”
Weekly AI Tool Verdicts
Get the next comparison in your inbox
New AI tools ship daily. We compare them before you waste an afternoon.