Scaling Neural Networks: Laws and Limits

Speaker:

Cengiz Pehlevan, Broad Institute of MIT/Harvard

Date and Time:

Tuesday, May 5, 2026 - 1:00pm to 2:00pm

Location:

Fields Institute, Room 309

Abstract:

Scaling up neural network models has enabled unprecedented capabilities through learning, but we still lack the first-principles understanding needed to ensure their safety, reliability, and efficiency. I will show how tools from statistical mechanics and random matrix theory allow us to analyze neural networks in appropriate infinite-size scaling limits, thereby mapping the learning regimes that govern observed scaling behavior. These results account for the main features of empirical neural scaling laws; enable transfer of near-optimal hyperparameters across model sizes, yielding significant computational benefits; and provide a framework for understanding emergent behaviors such as in-context learning.

The Fields Institute for
Research in Mathematical Sciences

Scaling Neural Networks: Laws and Limits

Scheduled as part of

People and Contacts

Calendar and Events