Logo del repository
  1. Home
 
Opzioni

Is ImageNet Always the Best Option? An Overview on Transfer Learning Strategies for Document Layout Analysis

De Nardin A.
•
Zottin S.
•
Colombi E.
altro
Foresti G. L.
2024
  • conference object

Abstract
Semantic segmentation models have shown impressive performance in the context of historical document layout analysis, but their effectiveness is reliant on having access to a large number of high-quality annotated images for training. A popular approach to address the lack of training data in other domains is to rely on transfer learning to transfer the knowledge learned from a large-scale, general-purpose dataset (e.g. ImageNet) to a domain-specific task. However, this approach has been shown to lead to unsatisfactory results when the target task is completely unrelated to the data employed for the pre-training process, which is the case when working on document layout analysis. For this reason, in the present paper, we provide an overview of domain-specific transfer learning for document layout segmentation. In particular, we show how relying on document-related images for the pre-training process leads to consistently improved performance and faster convergence compared to training from scratch or even relying on a large, general purpose, dataset such as ImageNet.
DOI
10.1007/978-3-031-51026-7_41
Archivio
https://hdl.handle.net/11390/1272686
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85184122509
https://ricerca.unityfvg.it/handle/11390/1272686
Diritti
metadata only access
Soggetti
  • Document Layout Analy...

  • Fine Tuning Approach

  • Page Segmentation

google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback