Logo del repository
  1. Home
 
Opzioni

A One-Shot Learning Approach to Document Layout Segmentation of Ancient Arabic Manuscripts

De Nardin A.
•
Zottin S.
•
Piciarelli C.
altro
Foresti G. L.
2024
  • conference object

Abstract
Document layout segmentation is a challenging task due to the variability and complexity of document layouts. Ancient manuscripts in particular are often damaged by age, have very irregular layouts, and are characterized by progressive editing from different authors over a large time window. All these factors make the semantic segmentation process of specific areas, such as main text and side text, very difficult. However, the study of these manuscripts turns out to be fundamental for historians and humanists, so much so that in recent years the demand for machine learning approaches aimed at simplifying the extraction of information from these documents has consistently increased, leading document layout analysis to become an increasingly important research area. In order for machine learning techniques to be applied effectively to this task, however, a large amount of correctly and precisely labeled images is required for their training. This is obviously a limitation for this field of research as ground truth must be precisely and manually crafted by expert humanists, making it a very time-consuming process. In this paper, with the aim of overcoming this limitation, we present an efficient document layout segmentation framework, which while being trained on only one labeled page per manuscript still achieves state-of-the-art performance compared to other popular approaches trained on all the available data when tested on a challenging dataset of ancient Arabic manuscripts.
DOI
10.1109/WACV57701.2024.00794
WOS
WOS:001222964608027
Archivio
https://hdl.handle.net/11390/1280224
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85191960125
https://ricerca.unityfvg.it/handle/11390/1280224
Diritti
closed access
license:non pubblico
license uri:iris.2.pri01
Soggetti
  • Algorithm

  • Algorithm

  • and algorithm

  • Application

  • Education

  • formulation

  • Image recognition and...

  • Machine learning arch...

google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback