One of Andrej Karpathy's small, pedagogical GPT implementations — a minimal codebase intended to make transformer internals legible to anyone willing to read a few hundred lines of PyTorch. Part of the same teaching lineage as nanoGPT, and a common starting point for engineers learning how modern language models actually work.