Logo del repository
  1. Home
 
Opzioni

Intrinsic Dimension Estimation for Discrete Metrics

Macocco, Iuri
•
Glielmo, Aldo
•
Grilli, Jacopo
•
Laio, Alessandro
2023
  • journal article

Periodico
PHYSICAL REVIEW LETTERS
Abstract
Real-world datasets characterized by discrete features are ubiquitous: from categorical surveys to clinical questionnaires, from unweighted networks to DNA sequences. Nevertheless, the most common unsupervised dimensional reduction methods are designed for continuous spaces, and their use for discrete spaces can lead to errors and biases. In this Letter we introduce an algorithm to infer the intrinsic dimension (ID) of datasets embedded in discrete spaces. We demonstrate its accuracy on benchmark datasets, and we apply it to analyze a metagenomic dataset for species fingerprinting, finding a surprisingly small ID, of order 2. This suggests that evolutive pressure acts on a low-dimensional manifold despite the high dimensionality of sequences' space.
DOI
10.1103/PhysRevLett.130.067401
WOS
WOS:000929789700002
Archivio
https://hdl.handle.net/20.500.11767/131772
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85148430650
https://arxiv.org/abs/2207.09688
Diritti
closed access
Soggetti
  • Settore FIS/03 - Fisi...

google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback