Logo del repository
  1. Home
 
Opzioni

Effects on curve clustering of different transformations of chronological textual data

TREVISANI, MATILDE
•
Tuzzi, Arjuna
2016
  • conference object

Abstract
Chronological corpora are collections of texts ordered in time. In bag-of-words approaches, data are typically the frequencies of individual words in the set of texts being grouped into equal-distant time points. In our work the temporal course of a word occurrence is viewed as a proxy of a word life-cycle: recognition of temporal shapes and clustering of words having similar life-cycles are the basic objective. However, the strong asymmetry of frequency spectrum typical of textual data has to be taken into account when defining the specific purpose of clustering and, hence, any type of further processing of data. By adopting a functional data approach and a distance-based curve clustering, the effect of selected data transformations on the generation of word groups is examined.
Archivio
http://hdl.handle.net/11368/2846552
http://convegni.unica.it/cladag2015/cladag-book-of-abstracts-epub-format/
Diritti
closed access
FVG url
https://arts.units.it/request-item?handle=11368/2846552
Soggetti
  • chronological corpora...

  • data transformation

  • curve clustering

  • spline smoothing

  • textual data

Visualizzazioni
3
Data di acquisizione
Apr 19, 2024
Vedi dettagli
google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback