The Foundations of AI Seminar Series is dedicated to topics of interest in artificial intelligence, machine learning, both empirically and theoretically, as well as related areas. Our goal is for these meetings to serve as a forum for discussions and quick dissemination of results. We invite anyone interested in the latest advancements in AI/ML to join us!

Next Seminar


Learning Dynamics of Overparametrized Neural Networks

René Vidal

Speaker: René Vidal (UPenn) Date: 19-06-2024, 3pm-4pm (BST) Location: Computer Science Building, CS1.04, University of Warwick, Coventry, UK

Abstract

This talk will provide a detailed analysis of the dynamics of gradient based methods in overparameterized models. For linear networks, we show that the weights converge to an equilibrium at a linear rate that depends on the imbalance between input and output weights (which is fixed at initialization) and the margin of the initial solution. For ReLU networks, we show that the dynamics has a feature learning phase, where neurons collapse to one of the class centers, and a classifier learning phase, where the loss converges to zero at a rate 1/t.


About René Vidal

René Vidal, a global pioneer of data science, is the Rachleff University Professor, with joint appointments in the Department of Radiology in the Perelman School of Medicine and the Department of Electrical and Systems Engineering in the School of Engineering and Applied Science. Dr. Vidal has been named a Penn Integrates Knowledge University Professor at the University of Pennsylvania.

René Vidal received his B.S. degree in Electrical Engineering (highest honors) from the Pontificia Universidad Catolica de Chile in 1997 and his M.S. and Ph.D. degrees in Electrical Engineering and Computer Sciences from the University of California at Berkeley in 2000 and 2003, respectively. He was a research fellow at the National ICT Australia in 2003 and joined The Johns Hopkins University in 2004 as a faculty member in the Department of Biomedical Engineering and the Center for Imaging Science.

Upcoming Events


Speaker Image

Learning Dynamics of Overparametrized Neural Networks

René Vidal - Rachleff University Professor, University of Pennsylvania, USA
Calendar Icon Jun 19, 2024 at 3:00PM
Location Icon Compute Science Bulding,CS1.04

More Info
Speaker Image

TBD

Ilija Bogunovic - Assistant Professor, UCL, UK
Calendar Icon Oct 11, 2024 at 2:00PM
Location Icon TBD

More Info
Speaker Image

Causal Effect Estimation with Context and Confounders

Arthur Gretton - Professor, UCL, UK
Calendar Icon TBD
Location Icon TBD
*** This talk will be rescheduled. ***

More Info

Organising Team

Fanghui Liu

Fanghui Liu

Assistant Professor, CS Department, University of Warwick

Paris Giampouras

Paris Giampouras

Assistant Professor, CS Department, University of Warwick

Long Tran-Thanh

Long Tran-Thanh

Associate Professor, CS Department, University of Warwick