Logo del repository
  1. Home
 
Opzioni

X-Class: Associative classification of XML documents by structure

Costa G.
•
Ortale R.
•
Ritacco E.
2013
  • journal article

Periodico
ACM TRANSACTIONS ON INFORMATION SYSTEMS
Abstract
The supervised classification of XML documents by structure involves learning predictive models in which certain structural regularities discriminate the individual document classes. Hitherto, research has focused on the adoption of prespecified substructures. This is detrimental for classification effectiveness, since the a priori chosen substructures may not accord with the structural properties of the XML documents. Therein, an unexplored question is how to choose the type of structural regularity that best adapts to the structures of the available XML documents. We tackle this problem through X-Class, an approach that handles all types of tree-like substructures and allows for choosing the most discriminatory one. Algorithms are designed to learn compact rule-based classifiers in which the chosen substructures discriminate the classes of XML documents. X-Class is studied across various domains and types of substructures. Its classification performance is compared against several rule-based and SVM-based competitors. Empirical evidence reveals that the classifiers induced by X-Class are compact, scalable, and at least as effective as the established competitors. In particular, certain substructures allow the induction of very compact classifiers that generally outperform the rule-based competitors in terms of effectiveness over all chosen corpora of XML data. Furthermore, such classifiers are substantially as effective as the SVM-based competitor, with the additional advantage of a high-degree of interpretability. © 2013 ACM 1046-8188/2013/01-ART2 s15.00.
DOI
10.1145/2414782.2414785
WOS
WOS:000315057000003
Archivio
https://hdl.handle.net/11390/1248956
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-84873410735
https://ricerca.unityfvg.it/handle/11390/1248956
Diritti
closed access
Soggetti
  • Structural XML classi...

  • XML mining

  • XML transactional mod...

google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback