Logo del repository
  1. Home
 
Opzioni

Resilience of Bayesian Layer-Wise Explanations under Adversarial Attacks

Carbone G.
•
Bortolussi L.
•
Sanguinetti G.
2022
  • conference object

Abstract
We consider the problem of the stability of saliency-based explanations of Neural Network predictions under adversarial attacks in a classification task. Saliency interpretations of deterministic Neural Networks are remarkably brittle even when the attacks fail, i.e. for attacks that do not change the classification label. We empirically show that interpretations provided by Bayesian Neural Networks are considerably more stable under adversarial perturbations of the inputs and even under direct attacks to the explanations. By leveraging recent results, we also provide a theoretical explanation of this result in terms of the geometry of the data manifold. Additionally, we discuss the stability of the interpretations of high level representations of the inputs in the internal layers of a Network. Our results demonstrate that Bayesian methods, in addition to being more robust to adversarial attacks, have the potential to provide more stable and interpretable assessments of Neural Network predictions.
DOI
10.1109/IJCNN55064.2022.9892788
WOS
WOS:000867070907025
Archivio
https://hdl.handle.net/20.500.11767/132270
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85140725807
https://arxiv.org/abs/2102.11010
Diritti
open access
license:non specificato
license uri:iris.pri00
Soggetti
  • Adversarial attacks

  • Bayesian Neural Netwo...

  • Saliency explanations...

  • Settore FIS/07 - Fisi...

google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback