Logo del repository
  1. Home
 
Opzioni

Web-Based Data Collection and Quality Issues in Co-Authorship Network Analysis

Domenico De Stefano
•
Vittorio Fuccella
•
Susanna Zaccarin
2019
  • conference object

Abstract
In this contribution we discuss data quality issues related to the application of web scraping techniques to the Cineca IRIS platform to derive co-authorship data among Italian university scholars. First, a semi-automatic tool is adopted to retrieve metadata from the platform, then a disambinguation network-based approach is considered to deal with author name disambiguation. This combined procedure is used to derive the co-authorship relations among Italian academic statisticians on the basis of the publications they inserted in the IRIS system until 2017.
Archivio
http://hdl.handle.net/11368/2946992
https://it.pearson.com/content/dam/region-core/italy/pearson-italy/pdf/Dirigenti e istituzioni/ISTITUZIONI-HE-PDF-sis2019_V4.pdf
Diritti
closed access
license:copyright editore
FVG url
https://arts.units.it/request-item?handle=11368/2946992
Soggetti
  • co-authorship network...

  • IRIS platform

  • web scraping

  • disambiguation algori...

Visualizzazioni
2
Data di acquisizione
Apr 19, 2024
Vedi dettagli
google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback