Logo del repository
  1. Home
 
Opzioni

Modeling the Influence of Data Structure on Learning in Neural Networks: The Hidden Manifold Model

Goldt, S.
•
Mezard, M.
•
Krzakala, F.
•
Zdeborova, L.
2020
  • journal article

Periodico
PHYSICAL REVIEW. X
Abstract
Understanding the reasons for the success of deep neural networks trained using stochastic gradient-based methods is a key open problem for the nascent theory of deep learning. The types of data where these networks are most successful, such as images or sequences of speech, are characterized by intricate correlations. Yet, most theoretical work on neural networks does not explicitly model training data or assumes that elements of each data sample are drawn independently from some factorized probability distribution. These approaches are, thus, by construction blind to the correlation structure of real-world datasets and their impact on learning in neural networks. Here, we introduce a generative model for structured datasets that we call the hidden manifold model. The idea is to construct high-dimensional inputs that lie on a lower-dimensional manifold, with labels that depend only on their position within this manifold, akin to a single-layer decoder or generator in a generative adversarial network. We demonstrate that learning of the hidden manifold model is amenable to an analytical treatment by proving a "Gaussian equivalence property"(GEP), and we use the GEP to show how the dynamics of two-layer neural networks trained using one-pass stochastic gradient descent is captured by a set of integro-differential equations that track the performance of the network at all times. This approach permits us to analyze in detail how a neural network learns functions of increasing complexity during training, how its performance depends on its size, and how it is impacted by parameters such as the learning rate or the dimension of the hidden manifold.
DOI
10.1103/PhysRevX.10.041044
WOS
WOS:000595237900001
Archivio
http://hdl.handle.net/20.500.11767/117821
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85097579260
https://arxiv.org/abs/1909.11500
Diritti
open access
Soggetti
  • Settore FIS/07 - Fisi...

Scopus© citazioni
13
Data di acquisizione
Jun 14, 2022
Vedi dettagli
Web of Science© citazioni
53
Data di acquisizione
Mar 28, 2024
Visualizzazioni
1
Data di acquisizione
Apr 19, 2024
Vedi dettagli
google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback