Logo del repository
  1. Home
 
Opzioni

What to put in the bag? Comparing and contrasting procedures for text clustering

ONDELLI, STEFANO
2012
  • journal article

Periodico
STATISTICA APPLICATA
Abstract
This study takes into account the issue of text clustering against the specific background of bag-of-words approaches and from different viewpoints. The most common algorithms for text clustering include instructions to summarise textual features in simple quantitative measures and use them to recognise the degree of similarity (or dissimilarity) between texts. These procedures involve several choices concerning the vocabularies of texts and measures of similarity. By comparing and contrasting the results obtained through eleven different procedures aimed at clustering the texts of three different corpora, this study discusses the importance of those choices and is focused on understanding for which environments they may be suitable
Archivio
http://hdl.handle.net/11368/2640659
Diritti
metadata only access
Soggetti
  • statistics

  • linguistics

  • text analysis

  • word clusters

Visualizzazioni
1
Data di acquisizione
Apr 19, 2024
Vedi dettagli
google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback