Skip to content
Doing The Thing
Dyck Transformer Probe
Initializing search
GitHub
Doing The Thing
GitHub
Doing The Thing: From Mechanical Turk to Mech Interp
Dlog
Dlog
2025-10-17 | And So It Begins
2025-10-18 | ARENA
2025-10-19 | Shannon
2025-10-20 | Setup
2025-10-21 | Setup Cont.
2025-10-22 | Doc Updates
2025-10-23 | MDPs
2025-10-24 | Transformer Lens
2025-10-25 | Applications
2025-10-26 | Applications
2025-10-27 | Stats
2025-10-28 | Apps
2025-10-29 | Apps
2025-10-30 | Apps
2025-10-31 | Stats
2025-11-01 | Relative Links
2025-11-02 | Stats (a little)
2025-11-03 | Stats
2025-11-04 | Dyck Probe Debugging
2025-11-05 | Career
2025-11-06 | RRG
2025-11-07 | Reproducing Papers
2025-11-08 | Reproducing Papers
2025-11-09 | Apps
2025-11-10 | Courses
2025-11-11 | Neo et al. 2024
2025-11-12 | Neo et al. 2024 cont.
2025-11-13 | Neo et al. 2024 cont.
2025-11-14 | Apps
2025-11-15 | Reproducing Neo et al.
2025-11-16 | Reproducing Neo et al.
2025-11-17 | Career Advising
2025-11-18 | Reproducing Neo et al.
2025-11-19 | Reproducing Neo et al.
2025-11-20 | Apps
2025-11-21 | Apps
2025-11-28 | Reproducing Neo et al.
2025-11-29 | Reproducing Neo et al.
2025-12-01 | Reproducing Neo et al.
2025-12-02 | Reproducing Neo et al.
2025-12-03 | Matrix Rank Concept
2025-12-04 | Mentorship
2025-12-05 | No Zero Days
2025-12-06 | Prep NeurIPS
2025-12-07 | Takeaways from NeurIPS
2025-12-08 | Timebox Prompt Probe
2025-12-09 | AI Safety Talk/Catchup
2025-12-10 | Prep NeurIPS
2025-12-11 | Semantic Trajectories
2025-12-12 | Semantic Trajectories
2025-12-13 | Documentation
2025-12-18 | Cross-linguistic Quantifier Scope Probe
2025-12-20 | Break
2025-12-26 | Multilingual Semantics Probe
2025-12-27 | Multilingual Semantics Probe
2025-12-28 | Multilingual Semantics Probe
2025-12-29 | Multilingual Semantics Probe
2025-12-30 | Multilingual Semantics Probe
2025-12-31 | Multilingual Semantics Probe
2026-01-01 | Multilingual Semantics Probe
2026-01-02 | Multilingual Semantics Probe
2026-01-03 | Multilingual Semantics Probe
2026-01-04 | Travel to ARBOx
2026-01-05 | ARBOx Day 1
2026-01-06 | ARBOx Day 2
2026-01-07 | ARBOx Day 3
2026-01-08 | ARBOx Day 4
2026-01-09 | ARBOx Day 5
2026-01-10 | Apps
2026-01-11 | Apps
2026-01-12 | ARBOx Day 6
2026-01-13 | ARBOx Day 6
2026-01-14 | ARBOx Day 8
2026-01-15 | ARBOx Day 9
2026-01-16 | ARBOx Day 10 (Last Day :/ )
2026-01-17 | ARBOx Reflections
2026-01-17 | Apps
2026-01-19 | Apps
2026-01-20 | LISA Coworking - Circuits
2026-01-21 | LISA Coworking - ACDC and Meetings
2026-01-22 | LISA Coworking - ACDC
2026-01-23 | LISA Coworking - ACDC
2026-01-24 | Reading && Thesis
2026-01-25 | Thesis Proposal
2026-01-29 | Recursion Reading Group
2026-01-30 | SPAR Interview + Thesis Work
2026-01-31 | SPAR Interview + Thesis Work
2026-02-01 | CodeSignal Prep + AutoInterp Prep
2026-02-02 | AutoInterp Agenda 2026 Meeting
2026-02-03 | Recursion Reading Group
2026-02-04 | Predictive Concept Decoders
2026-02-05 | Maxwell Project
2026-02-06 | Maxwell Project
2026-02-07 | Building PCD on a Student Budget
2026-02-12 | Understand Rayleigh Quotient for Eigenvectors
2026-02-13 | Deriving Rayleigh Equations
2026-02-16 | Star and Path Spectrums
2026-02-17 | Star and Path Spectrums
2026-02-18 | Dyck Graphs + Interpret Spectrums
2026-02-20 | Dyck Graphs + Interpret Spectrums
2026-02-24 | Circuits + Documentation
2026-02-25 | Research Agenda
2026-02-26 | MOLTs + Dependency Grammars
2026-02-27 | MOLTs + Dependency Grammars
2026-02-28 | Download CCGBank (Combinatory Categorical Grammar)
2026-03-01 | CCG
2026-03-02 | CCG
2026-03-03 | CCG
2026-03-04 | CCG
2026-03-05 | CCG
2026-03-06 | Toy IOI Model
2026-03-07 | MOLTs Drawing
2026-03-08 | IOI Transformer
2026-03-09 | IOI Transformer
2026-03-10 | IOI Transformer
2026-03-11 | MOLTs
2026-03-12 | CCG Parsing
2026-03-13 | Travel
2026-03-14 | Travel
2026-03-15 | GPU Setup
2026-03-16 | GPU Setup
2026-03-17 | EleutherAI Delphi
2026-03-21 | Bilinear Maps
2026-03-22 | Transformer From Scratch
2026-03-23 | Transformer From Scratch
2026-03-24 | AutoInterp Hackathon
2026-03-25 | AutoInterp Hackathon
2026-03-26 | AutoInterp Hackathon
2026-03-27 | Predictive Concept Decoders
2026-03-28 | Predictive Concept Decoders
2026-03-29 | Apps
2026-03-30 | Predictive Concept Decoders
Notes
Notes
Reference Links
Running Todo List
Concepts
Concepts
Einsum Guide for Attention Pattern
Stats and Probability Concepts
Papers
Papers
Information theory
Information theory
Revisting the Uniform Information Density Hypothesis
Interpretability
Interpretability
Towards Automated Circuit Discovery for Mechanistic Interpretability
Automated Interpretability Agenda
Mixture of Linear Transforms (MOLTs)
Interpreting Context Look Ups
Detecting and reducing scheming in AI models
Projects
Projects
Dyck Transformer Probe
Multilingual Semantics Probe
Building Predictive Concept Decoders (PCD) on a Student Budget
Reflections
Reflections
ARBOx3
NeurIPS Mech Interp Workshop 2025
Kyle's Theory of Change
Dyck Transformer Probe
Work in progress ntoes for probing models for syntactic dependency