Logo del repository
  1. Home
 
Opzioni

Training and assessing classification rules with imbalanced data

Menardi G.
•
TORELLI, Nicola
2014
  • journal article

Periodico
DATA MINING AND KNOWLEDGE DISCOVERY
Abstract
he problem of modeling binary responses by using cross-sectional data has been addressed with a number of satisfying solutions that draw on both parametric and nonparametric methods. However, there exist many real situations where one of the two responses (usually the most interesting for the analysis) is rare. It has been largely reported that this class imbalance heavily compromises the process of learning, because the model tends to focus on the prevalent class and to ignore the rare events. However, not only the estimation of the classification model is affected by a skewed distribution of the classes, but also the evaluation of its accuracy is jeopardized, because the scarcity of data leads to poor estimates of the model’s accuracy. In this work, the effects of class imbalance on model training and model assessing are discussed. Moreover, a unified and systematic framework for dealing with the problem of imbalanced classification is proposed, based on a smoothed bootstrap re-sampling technique. The proposed technique is founded on a sound theoretical basis and an extensive empirical study shows that it outperforms the main other remedies to face imbalanced learning problems.
DOI
10.1007/s10618-012-0295-5
WOS
WOS:000329229100004
Archivio
http://hdl.handle.net/11368/2625451
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-84891860723
http://link.springer.com/article/10.1007%2Fs10618-012-0295-5#page-1
Diritti
metadata only access
Soggetti
  • Accuracy

  • Binary classification...

  • Bootstrap

  • Kernel density estima...

  • Imbalanced learning

Scopus© citazioni
245
Data di acquisizione
Jun 7, 2022
Vedi dettagli
Web of Science© citazioni
359
Data di acquisizione
Mar 7, 2024
Visualizzazioni
6
Data di acquisizione
Apr 19, 2024
Vedi dettagli
google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback