Logo del repository
  1. Home
 
Opzioni

VT-ADL: A Vision Transformer Network for Image Anomaly Detection and Localization

Mishra P.
•
Verk R.
•
Fornasier D.
altro
Foresti G. L.
2021
  • conference object

Abstract
We present a transformer-based image anomaly detection and localization network. Our proposed model is a combination of a reconstruction-based approach and patch embedding. The use of transformer networks helps preserving the spatial information of the embedded patches, which is later processed by a Gaussian mixture density network to localize the anomalous areas. In addition, we also publish BTAD, a real-world industrial anomaly dataset. Our results are compared with other state-of-the-art algorithms using publicly available datasets like MNIST and MVTec.
DOI
10.1109/ISIE45552.2021.9576231
Archivio
http://hdl.handle.net/11390/1215631
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85118783619
https://ricerca.unityfvg.it/handle/11390/1215631
Diritti
metadata only access
Soggetti
  • Anomaly dataset

  • Anomaly Detection

  • Anomaly segmentation

  • Gaussian density appr...

  • Vision transformer

google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback