Logo del repository
  1. Home
 
Opzioni

Unsupervised detection of semantic correlations in big data

Acevedo S.
•
Alex Rodriguez
•
Laio A.
2025
  • journal article

Periodico
COMMUNICATIONS PHYSICS
Abstract
In real-world data, information is stored in extremely large feature vectors. These variables are typically correlated due to complex interactions involving many features simultaneously. Such correlations qualitatively correspond to semantic roles and are naturally recognized by both the human brain and artificial neural networks. This recognition enables, for instance, the prediction of missing parts of an image or text based on their context. We present a method to detect these correlations in high-dimensional data represented as binary numbers. We estimate the binary intrinsic dimension of a dataset, which quantifies the minimum number of independent coordinates needed to describe the data, and is therefore a proxy of semantic complexity. The proposed algorithm is largely insensitive to the so-called curse of dimensionality, and can therefore be used in big data analysis. We test this approach identifying phase transitions in model magnetic systems and we then apply it to the detection of semantic correlations of images and text inside deep neural networks.
DOI
10.1038/s42005-025-02115-z
WOS
WOS:001489367500002
Archivio
https://hdl.handle.net/11368/3110600
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-105005402827
https://www.nature.com/articles/s42005-025-02115-z
Diritti
open access
license:creative commons
license:creative commons
license uri:http://creativecommons.org/licenses/by-nc-nd/4.0/
license uri:http://creativecommons.org/licenses/by-nc-nd/4.0/
FVG url
https://arts.units.it/bitstream/11368/3110600/1/Acevedo_s42005-025-02115-z.pdf
Soggetti
  • Intrinsic Dimension

  • Semantical correlatio...

google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback