Logo del repository
  1. Home
 
Opzioni

Automatic topography of high-dimensional data sets by non-parametric density peak clustering

d'Errico M.
•
Facco E.
•
Laio A.
•
Rodriguez Garcia A.
2021
  • journal article

Periodico
INFORMATION SCIENCES
Abstract
Data analysis in high-dimensional spaces aims at obtaining a synthetic description of a data set, revealing its main structure and its salient features. We here introduce an approach providing this description in the form of a topography of the data, namely a human-readable chart of the probability density from which the data are harvested. The approach is based on an unsupervised extension of Density Peak clustering and on a non-parametric density estimator that measures the probability density in the manifold containing the data. This allows finding automatically the number and the height of the peaks of the probability density, and the depth of the “valleys” separating them. Importantly, the density estimator provides a measure of the error, which allows distinguishing genuine density peaks from density fluctuations due to finite sampling. The approach thus provides robust and visual information about the density peaks height, their statistical reliability and their hierarchical organization, offering a conceptually powerful extension of the standard clustering partitions. We show that this framework is particularly useful in the analysis of complex data sets.
DOI
10.1016/j.ins.2021.01.010
WOS
WOS:000641000800009
Archivio
https://hdl.handle.net/11368/3034860
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85101528334
https://www.sciencedirect.com/science/article/pii/S0020025521000116
Diritti
open access
license:copyright editore
license:creative commons
license uri:iris.pri02
license uri:http://creativecommons.org/licenses/by-nc-nd/4.0/
FVG url
https://arts.units.it/request-item?handle=11368/3034860
Soggetti
  • Clustering-algorithm

  • Density-peak-clusteri...

  • Hierarchy-visualizati...

  • High-dimensional-data...

  • Non-parametric-densit...

google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback