Logo del repository
  1. Home
 
Opzioni

Improving Features Extraction for Supervised Invoice Classification

BARTOLI, Alberto
•
DAVANZO, GIORGIO
•
MEDVET, Eric
•
SORIO, ENRICO
2010
  • conference object

Abstract
An essential step in the understanding of printed documents is the classification of such documents based on their class, i.e., on the nature of information they contain and their lay out. In this work we are concerned with automatic classi fication of such documents. This task is usually accom plished by extracting a suitable set of low-level features from each document which are then fed to a classifier. The quality of the results depends primarily on the clas sifier, but they are also heavily influenced by the specific features used. In this work we focus on the feature ex traction part and propose a method that characterizes each document based on the spatial density of black pixels and of image edges. We assess our proposal on a real-world dataset composed of 560 invoices belonging to 68 differ ent classes. These documents have been digitalized after their printed counterparts have been handled by a corpo rate environment, thus they contain a substantial amount of noise—big stamps and handwritten signatures at unfor tunate positions and so on. We show that our proposal is accurate, even a with very small learning set.
Archivio
http://hdl.handle.net/11368/2294274
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-77954607455
http://www.actapress.com/Abstract.aspx?paperId=37733
Diritti
metadata only access
google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback