Logo del repository
  1. Home
 
Opzioni

Short Text Categorization Exploiting Contextual Enrichment and External Knowledge

MIZZARO, Stefano
•
PAVAN, Marco
•
SCAGNETTO, Ivan
•
M. Valenti
2014
  • conference object

Abstract
We address the problem of the categorization of short texts, like those posted by users on social networks and microblogging platforms. We specifically focus on Twitter. Since short texts do not provide sufficient word occurrences, and they often contain abbreviations and acronyms, traditional classification methods such as "Bag-of-Words" have limitations. Our proposed method enriches the original text with a new set of words, to add more semantic value by using information extracted from webpages of the same temporal context. Then we use those words to query Wikipedia, as an external knowledge base, with the final goal to categorize the original text using a predefined set of Wikipedia categories. We also present a first experimental evaluation that confirms the effectiveness of the algorithm design and implementation choices, highlighting some critical issues with short texts.
DOI
10.1145/2632188.2632205
Archivio
http://hdl.handle.net/11390/1036349
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-84908635503
http://dl.acm.org/citation.cfm?doid=2632188.2632205
Diritti
closed access
Soggetti
  • context-aware retriev...

  • enrichment

  • wikipedia

  • evaluation

Scopus© citazioni
14
Data di acquisizione
Jun 2, 2022
Vedi dettagli
Visualizzazioni
1
Data di acquisizione
Apr 19, 2024
Vedi dettagli
google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback