Logo del repository
  1. Home
 
Opzioni

A Stochastic Modified Limited Memory BFGS for Training Deep Neural Networks

Yousefi M.
•
Martinez Calomardo A.
2022
  • book part

Abstract
In this work, we study stochastic quasi-Newton methods for solving the non-linear and non-convex optimization problems arising in the training of deep neural networks. We consider the limited memory Broyden-Fletcher-Goldfarb-Shanno (BFGS) update in the framework of a trust-region approach. We provide an almost comprehensive overview of recent improvements in quasi-Newton based training algorithms, such as accurate selection of the initial Hessian approximation, efficient solution of the trust-region subproblem with a direct method in high accuracy and an overlap sampling strategy to assure stable quasi-Newton updating by computing gradient differences based on this overlap. We provide a comparison of the standard L-BFGS method with a variant of this algorithm based on a modified secant condition which is theoretically shown to provide an increased order of accuracy in the approximation of the curvature of the Hessian. In our experiments, both quasi-Newton updates exhibit comparable performances. Our results show that with a fixed computational time budget the proposed quasi-Newton methods provide comparable or better testing accuracy than the state-of-the-art first-order Adam optimizer.
DOI
10.1007/978-3-031-10464-0_2
WOS
WOS:000889454000002
Archivio
https://hdl.handle.net/11368/3032998
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85135049869
https://link.springer.com/chapter/10.1007/978-3-031-10464-0_2
Diritti
open access
license:copyright editore
license:copyright editore
license uri:iris.pri02
license uri:iris.pri02
FVG url
https://arts.units.it/request-item?handle=11368/3032998
Soggetti
  • Quasi-Newton method

  • Limited memory BFGS

  • Trust region

  • Stochastic optimizati...

  • Deep neural network

google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback