Logo del repository
  1. Home
 
Opzioni

An analytic theory of shallow networks dynamics for hinge loss classification*

Pellegrini, F
•
Biroli, G
2021
  • journal article

Periodico
JOURNAL OF STATISTICAL MECHANICS: THEORY AND EXPERIMENT
Abstract
Neural networks have been shown to perform incredibly well in classification tasks over structured high-dimensional datasets. However, the learning dynamics of such networks is still poorly understood. In this paper we study in detail the training dynamics of a simple type of neural network: a single hidden layer trained to perform a classification task. We show that in a suitable mean-field limit this case maps to a single-node learning problem with a time-dependent dataset determined self-consistently from the average nodes population. We specialize our theory to the prototypical case of a linearly separable data and a linear hinge loss, for which the dynamics can be explicitly solved in the infinite dataset limit. This allows us to address in a simple setting several phenomena appearing in modern networks such as slowing down of training dynamics, crossover between rich and lazy learning, and overfitting. Finally, we assess the limitations of mean-field theory by studying the case of large but finite number of nodes and of training samples.
DOI
10.1088/1742-5468/ac3a76
WOS
WOS:000735617700001
Archivio
https://hdl.handle.net/20.500.11767/135290
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85122544051
https://arxiv.org/abs/2006.11209
https://ricerca.unityfvg.it/handle/20.500.11767/135290
Diritti
metadata only access
Soggetti
  • machine learning

  • Settore FIS/03 - Fisi...

google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback