Optimization Landscape of Neural Networks

doi:10.1017/9781009025096.005

4 - Optimization Landscape of Neural Networks

Published online by Cambridge University Press: 29 November 2022

René Vidal ,

Zhihui Zhu and

Benjamin D. Haeffele

Edited by

Philipp Grohs and

Gitta Kutyniok

Show author details

Philipp Grohs: Affiliation:
Universität Wien, Austria
Gitta Kutyniok: Affiliation:
Ludwig-Maximilians-Universität Munchen

Book contents

Get access

Summary

This chapter summarizes recent advances on the analysis of the optimization landscape of neural network training. We first review classical results for linear networks trained with a squared loss and without regularization. Such results show that under certain conditions on the input-output data spurious local minima are guaranteed not to exist, i.e. critical points are either saddle points or global minima. Moreover, the globally optimal weights can be found by factorizing certain matrices obtained from the input-output covariance matrices.We then review recent results for deep networks with parallel structure, positively homogeneous network mapping and regularization, and trained with a convex loss. Such results show that the non-convex objective on theweights can be lower-bounded by a convex objective on the network mapping. Moreover, when the network is sufficiently wide, local minima of the non-convex objective that satisfy a certain condition yield global minima of both the non-convex and convex objectives, and that there is always a non-increasing path to a global minimizer from any initialization.

Keywords

deep learning non-convex optimization

Type: Chapter
Information: Mathematical Aspects of Deep Learning , pp. 200 - 228

DOI: https://doi.org/10.1017/9781009025096.005 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2022

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

4 - Optimization Landscape of Neural Networks

Summary

Keywords

Access options

Book purchase

Temporarily unavailable

Book contents

4 - Optimization Landscape of Neural Networks

Summary

Keywords

Access options

Book purchase

Temporarily unavailable

Save book to Kindle

Save book to Dropbox

Save book to Google Drive