< Terug naar vorige pagina

Publicatie

High-dimensional prediction of binary outcomes in the presence of between-study heterogeneity

Tijdschriftbijdrage - Tijdschriftartikel

Many prediction methods have been proposed in the literature, but most of them ignore heterogeneity between populations. Either only data from a single study or population is available for model building and evaluation, or when data from multiple studies make up the training dataset, studies are pooled before model building. As a result, prediction models might perform less than expected when applied to new subjects from new study populations. We propose a linear method for building prediction models with high-dimensional data from multiple studies. Our method explicitly addresses between-population variability and tends to select predictors that are predictive in most of the study populations. We employ empirical Bayes estimators and hence avoid selection bias during the variable selection process. Simulation results demonstrate that the new method works better than other linear prediction methods that ignore the between-study variability. Our method is developed for classification into two groups.
Tijdschrift: Statistical methods in medical research
ISSN: 0962-2802
Issue: 9
Volume: 28
Pagina's: 2848 - 2867
Jaar van publicatie:2019
Trefwoorden:Health Care Sciences & Services, Mathematical & Computational Biology, Medical Informatics, Statistics & Probability, Empirical Bayes, high-dimensional data, multiple studies, heterogeneity, naive Bayes
BOF-keylabel:ja
IOF-keylabel:ja
BOF-publication weight:6
CSS-citation score:1
Auteurs:International
Authors from:Higher Education
Toegankelijkheid:Closed