October 14, 2025 | Lee et al.
Current training data attribution (TDA) methods treat the influence one sample has on another as sta...
October 14, 2025 | Urdshals et al.
We study neural network compressibility by using singular learning theory to extend the minimum desc...
August 1, 2025 | Wang et al.
Understanding how language models develop their internal computational structure is a central proble...
April 25, 2025 | Baker et al.
We develop a linear response framework for interpretability that treats a neural network as a Bayesi...
April 10, 2025 | Murfet and Troiani
We develop a correspondence between the structure of Turing machines and the structure of singularit...
February 8, 2025 | Lehalleur et al.
In this position paper, we argue that understanding the relation between structure in the data distr...
January 29, 2025 | Carroll et al.
Modern deep neural networks display striking examples of rich internal computational structure.
October 4, 2024 | Wang et al. | ICLR | Spotlight
We introduce refined variants of the Local Learning Coefficient (LLC), a measure of model complexity...
June 16, 2024 | Wang et al. | ICML HiLD Workshop | Best Paper
The development of the internal structure of neural networks throughout training occurs in tandem wi...
February 4, 2024 | Hoogland et al. | TMLR | Best Paper at 2024 ICML HiLD Workshop
We show that in-context learning emerges in transformers in discrete developmental stages, when they...
October 10, 2023 | Chen et al.
We investigate phase transitions in a Toy Model of Superposition (TMS) using Singular Learning Theor...
August 23, 2023 | Lau et al. | AISTATS 2025
Deep neural networks (DNN) are singular statistical models which exhibit complex degeneracies.
October 14, 2025 | Lee et al.
Current training data attribution (TDA) methods treat the influence one sample has on another as sta...
October 14, 2025 | Urdshals et al.
We study neural network compressibility by using singular learning theory to extend the minimum desc...
August 1, 2025 | Wang et al.
Understanding how language models develop their internal computational structure is a central proble...
April 25, 2025 | Baker et al.
We develop a linear response framework for interpretability that treats a neural network as a Bayesi...
April 10, 2025 | Murfet and Troiani
We develop a correspondence between the structure of Turing machines and the structure of singularit...
February 8, 2025 | Lehalleur et al.
In this position paper, we argue that understanding the relation between structure in the data distr...
January 29, 2025 | Carroll et al.
Modern deep neural networks display striking examples of rich internal computational structure.
October 4, 2024 | Wang et al. | ICLR | Spotlight
We introduce refined variants of the Local Learning Coefficient (LLC), a measure of model complexity...
June 16, 2024 | Wang et al. | ICML HiLD Workshop | Best Paper
The development of the internal structure of neural networks throughout training occurs in tandem wi...
February 4, 2024 | Hoogland et al. | TMLR | Best Paper at 2024 ICML HiLD Workshop
We show that in-context learning emerges in transformers in discrete developmental stages, when they...
October 10, 2023 | Chen et al.
We investigate phase transitions in a Toy Model of Superposition (TMS) using Singular Learning Theor...
August 23, 2023 | Lau et al. | AISTATS 2025
Deep neural networks (DNN) are singular statistical models which exhibit complex degeneracies.