Logo del repository
  1. Home
 
Opzioni

Stratified learning: A general-purpose statistical method for improved learning under covariate shift

Maximilian Autenrieth
•
David A. van Dyk
•
Roberto Trotta
•
David C. Stenning
2023
  • journal article

Periodico
STATISTICAL ANALYSIS AND DATA MINING
Abstract
We propose a simple, statistically principled, and theoretically justified method to improve supervised learning when the training set is not representative, a situation known as covariate shift. We build upon a well-established methodology in causal inference, and show that the effects of covariate shift can be reduced or eliminated by conditioning on propensity scores. In practice, this is achieved by fitting learners within strata constructed by partitioning the data based on the estimated propensity scores, leading to approximately balanced covariates and much-improved target prediction. We demonstrate the effectiveness of our general-purpose method on two contemporary research questions in cosmology, outperforming state-of-the-art importance weighting methods. We obtain the best reported AUC (0.958) on the updated "Supernovae photometric classification challenge", and we improve upon existing conditional density estimation of galaxy redshift from Sloan Data Sky Survey (SDSS) data.
DOI
10.1002/sam.11643
WOS
WOS:001073021800001
Archivio
https://hdl.handle.net/20.500.11767/134310
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85173913727
http://arxiv.org/abs/2106.11211v2
https://ricerca.unityfvg.it/handle/20.500.11767/134310
Diritti
open access
Soggetti
  • Statistics - Machine ...

  • Statistics - Machine ...

  • astro-ph.CO

  • Computer Science - Le...

  • Settore FIS/02 - Fisi...

google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback