Logo del repository
  1. Home
 
Opzioni

A topological description of loss surfaces based on Betti Numbers

Bucarelli Maria Sofia
•
D'Inverno Giuseppe Alessio
•
Bianchini Monica
altro
Silvestri Fabrizio
2024
  • journal article

Periodico
NEURAL NETWORKS
Abstract
In the context of deep learning models, attention has recently been paid to studying the surface of the loss function in order to better understand training with methods based on gradient descent. This search for an appropriate description, both analytical and topological, has led to numerous efforts in identifying spurious minima and characterize gradient dynamics. Our work aims to contribute to this field by providing a topological measure for evaluating loss complexity in the case of multilayer neural networks. We compare deep and shallow architectures with common sigmoidal activation functions by deriving upper and lower bounds for the complexity of their respective loss functions and revealing how that complexity is influenced by the number of hidden units, training models, and the activation function used. Additionally, we found that certain variations in the loss function or model architecture, such as adding an l2 regularization term or implementing skip connections in a feedforward network, do not affect loss topology in specific cases.
DOI
10.1016/j.neunet.2024.106465
WOS
WOS:001262458000001
Archivio
https://hdl.handle.net/20.500.11767/143310
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85196948612
https://www.sciencedirect.com/science/article/pii/S0893608024003897
https://arxiv.org/abs/2401.03824
https://ricerca.unityfvg.it/handle/20.500.11767/143310
Diritti
open access
Soggetti
  • Betti number

  • Loss surface

  • ResNet

  • l2 regularization

google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback