back to list

When?

June 10, 2026, 17:15-19:00

Where?

Hörsaal der Kernphysik
S2|14 24
Schlossgartenstr. 9
64289 Darmstadt

Hörsaal der Kernphysik , S2|14 24 , Schlossgartenstr. 9 , 64289 Darmstadt

Organiser

FB Mathematik

giesselmann@mathematik.tu-darmstadt.de

Prof. Dr. Felix Krahmer, TU Darmstadt, ETIT

Various recent works have shown that for wide, overparameterized neural networks, training with Stochastic Gradient Descent (SGD) often leads to interpolation of the training data without sacrificing generalization performance. A key parameter that is not only closely connected to generalization properties, but is also closely tied to other desiderata such as robustness and resistance to adversarial perturbations is the Lipschitz constant of the neural network. While empirically, the Lipschitz constant has been shown not to increase with network width, theoretical findings only provide bounds with logarithmic growth in the width and only for the random initialization of neural networks with the rectified linear unit (ReLU) as an activation function. In this talk, we present results that close this gap for neural networks with smooth activations by showing that, both at random initialization and throughout lazy training, the local Lipschitz constant of deep neural networks does not increase with network width. More precisely, we present novel non-asymptotic (finite width) upper bounds and corroborate them by numerical experiments.
This is joint work with Apostolos Evangelidis (Technical University of Munich). 

/globalcontent/veranstaltungskalender/Optimierung_1774010839294_255.jpeg
 

Tags

Mathematisches Kolloquium, Mathematik, Numerik, Optimierung