Skip to content

2026-03-24 | AutoInterp Hackathon

Goal: SAEs, CLTs, Visualization, and MOLTs

Summary: Trained Symbolic IOI Transformer and TopK SAE

Work sessions

In Out
17:50 07:30

AutoInterp Goals

Copying these over from yesterday:

  1. Implement Toy IOI Transformer from scratch
  2. Use Claude to verify that my Pytorch Transformer implementation is correct
  3. Create Dataset of IOI Sentences
  4. Train Model and verify model generalization

  5. Train SAEs and understand SAE math

  6. Train PLT and CLTs and understand math

  7. Understand Attention Interactions

  8. Train MOLTs

  9. Visualizations should be used throughout