8

[2105.04026] The Modern Mathematics of Deep Learning

 3 years ago
source link: https://arxiv.org/abs/2105.04026
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

[Submitted on 9 May 2021]

The Modern Mathematics of Deep Learning

Download PDF

We describe the new field of mathematical analysis of deep learning. This field emerged around a list of research questions that were not answered within the classical framework of learning theory. These questions concern: the outstanding generalization power of overparametrized neural networks, the role of depth in deep architectures, the apparent absence of the curse of dimensionality, the surprisingly successful optimization performance despite the non-convexity of the problem, understanding what features are learned, why deep architectures perform exceptionally well in physical problems, and which fine aspects of an architecture affect the behavior of a learning task in which way. We present an overview of modern approaches that yield partial answers to these questions. For selected approaches, we describe the main ideas in more detail.

Comments: This review paper will appear as a book chapter in the book "Theory of Deep Learning" by Cambridge University Press Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML) Cite as: arXiv:2105.04026 [cs.LG]   (or arXiv:2105.04026v1 [cs.LG] for this version)

About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK