Logo del repository
  1. Home
 
Opzioni

Cluster based oversampling for imbalanced learning

Gioia Di Credico
•
Nicola Torelli
2022
  • conference object

Abstract
Oversampling is a widespread remedy used when there is data imbalance in classification problems. Some oversampling techniques amount to generating new cases in the minority class which are similar to the observed ones. ROSE (Random OverSampling Examples) is an algorithm for generating new data, both in minority and majority classes, by using ideas from kernel density estimation and bootstrap resampling. In this paper, we show that a new strategy which couples density-based clustering methods with ROSE can improve the performance of supervised classification methods with data imbalance. Evidence from some simulation experiments shows that the new procedure is promising and solves some issues related to the use of ROSE.
Archivio
https://hdl.handle.net/11368/3030898
https://it.pearson.com/content/dam/region-core/italy/pearson-italy/pdf/Docenti/Università /Sis-2022-4c-low.pdf
Diritti
closed access
license:digital rights management non definito
license uri:iris.pri00
FVG url
https://arts.units.it/request-item?handle=11368/3030898
Soggetti
  • Density-based cluster...

  • tuning parameter

  • resampling

  • ROSE

  • SMOTE

google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback