Skip to content

2025-11-13 | Neo et al. 2024 cont.

Goal: Reproducing Neo et al. 2024 + Documenting + Apps

Summary: Applying to Research; implementing Section 4.1 Identifying Neurons (github reproduction)

Work sessions

In Out
01:00 01:35
08:00 09:00
14:00 14:55
15:15 15:45
18:20 19:50
22:30 23:59

Neo et al. 2024

  1. Continued updating Interpreting Context Look Ups Notes
  2. Implementing Section 4.1 Identifying Neurons but found that the distribution of next-token neurons are highly skewed to the final layer in GPT2-Small and GPT2-Large. Additionally, applying RMS norm on \(W_{down}[:,i]\) did not reproduce the same Figure 3 as in the paper

Without Layer Norm: Identifying Neurons Without Layer Norm

With Layer Norm: (matches the Figure 3 better distribution but orders of magnitude different) Identifying Neurons With Layer Norm