Building an LLM
Advanced2 hr
Build intuition for what training really does by watching a small GPT come together from scratch, then keep a from-the-ground-up reference to go deeper.
Training loopModel weightsGPT architectureFrom scratch
Learn from these
Let's build GPT: from scratch, in code, spelled out
Andrej Karpathy · 1.8 hr
The Illustrated GPT-2
ArticleJay Alammar
Build a Large Language Model (From Scratch)
CourseSebastian Raschka

