Logo del repository
  1. Home
 
Opzioni

A non-monotone trust-region method with noisy oracles and additional sampling

NataÅ¡a KrejiÄ
•
NataÅ¡a Krklec JerinkiÄ
•
ANGELES MARTINEZ CALOMARDO
•
Mahsa Yousefi
2024
  • journal article

Periodico
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS
Abstract
In this work, we introduce a novel stochastic second-order method, within the framework of a non-monotone trust-region approach, for solving the unconstrained, nonlinear, and non-convex optimization problems arising in the training of deep neural networks. The proposed algorithm makes use of subsampling strategies that yield noisy approximations of the finite sum objective function and its gradient. We introduce an adaptive sample size strategy based on inexpensive additional sampling to control the resulting approximation error. Depending on the estimated progress of the algorithm, this can yield sample size scenarios ranging from mini-batch to full sample functions. We provide convergence analysis for all possible scenarios and show that the proposed method achieves almost sure convergence under standard assumptions for the trust-region framework. We report numerical experiments showing that the proposed algorithm outperforms its state-of-the-art counterpart in deep neural network training for image classification and regression tasks while requiring a significantly smaller number of gradient evaluations.
DOI
10.1007/s10589-024-00580-w
WOS
WOS:001236119000001
Archivio
https://hdl.handle.net/11368/3077138
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85194707127
https://link.springer.com/article/10.1007/s10589-024-00580-w
Diritti
open access
license:creative commons
license uri:http://creativecommons.org/licenses/by/4.0/
FVG url
https://arts.units.it/bitstream/11368/3077138/3/s10589-024-00580-w.pdf
Soggetti
  • Stochastic optimizati...

  • Second-order method

  • Non-monotone trust-re...

  • Quasi-Newton

  • Deep neural networks ...

  • Adaptive sampling

google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback