23.5. Building a Transformer from Scratch Workshop#
23.5.1. Workshop Summary#
This workshop provides a practical, interactive way to learn about transformers by building a simple language model. Participants dive into model components, training pipelines, and the ingredients of a typical end-to-end language model set-up.
23.5.1.1. Prerequisites#
Familiarity with PyTorch framework
Familiarity with foundational machine learning topics, such as feedforward networks and gradient descent
23.5.2. Workshop Slides#
To download the “Building a Transformer from Scratch” workshop slides, click the link below.
KempnerLLM Distributed Training Workshop