2026-03-23 | Transformer From Scratch
Goal: Transformer from scratch: Implement Toy IOI Model
Summary: Progress: Finished Transformer Architecture
Work sessions
| In | Out |
|---|---|
| 06:30 | 07:30 |
Goals
AutoInterp
Copying these over from yesterday:
- Implement Toy IOI Transformer from scratch
- Use Claude to verify that my Pytorch Transformer implementation is correct
- Create Dataset of IOI Sentences
- Train Model and verify model generalization
- Understand how layer norms can be folded into the surrounding parts of a model
Linguistics
-
Hodge Laplacian, understand how to derive/hand calculate
-
CCG for Dyck Language