Width Independent Bounds for the Local Lipschitz Constant of Deep Neural Networks
Mathematisches Kolloquium im Sommersemester 2026
When?
June 10, 2026, 17:15-19:00
Where?
Hörsaal der Kernphysik
S2|14 24
Schlossgartenstr. 9
64289 Darmstadt
Organiser
FB Mathematik
Contact
Prof. Dr. Felix Krahmer, TU Darmstadt, ETIT
Various recent works have shown that for wide, overparameterized neural networks, training with Stochastic Gradient Descent (SGD) often leads to interpolation of the training data without sacrificing generalization performance. A key parameter that is not only closely connected to generalization properties, but is also closely tied to other desiderata such as robustness and resistance to adversarial perturbations is the Lipschitz constant of the neural network. While empirically, the Lipschitz constant has been shown not to increase with network width, theoretical findings only provide bounds with logarithmic growth in the width and only for the random initialization of neural networks with the rectified linear unit (ReLU) as an activation function. In this talk, we present results that close this gap for neural networks with smooth activations by showing that, both at random initialization and throughout lazy training, the local Lipschitz constant of deep neural networks does not increase with network width. More precisely, we present novel non-asymptotic (finite width) upper bounds and corroborate them by numerical experiments.
This is joint work with Apostolos Evangelidis (Technical University of Munich).
Tags
Mathematisches Kolloquium, Mathematik, Numerik, Optimierung