2025-11-13 | Neo et al. 2024 cont.

Goal: Reproducing Neo et al. 2024 + Documenting + Apps

Summary: Applying to Research; implementing Section 4.1 Identifying Neurons (github reproduction)

Work sessions

Neo et al. 2024

Continued updating Interpreting Context Look Ups Notes
Implementing Section 4.1 Identifying Neurons but found that the distribution of next-token neurons are highly skewed to the final layer in GPT2-Small and GPT2-Large. Additionally, applying RMS norm on \(W_{down}[:,i]\) did not reproduce the same Figure 3 as in the paper

Without Layer Norm: Identifying Neurons Without Layer Norm

With Layer Norm: (matches the Figure 3 better distribution but orders of magnitude different) Identifying Neurons With Layer Norm