Publication

A Robust Ensemble Approach to Learn From Positive and Unlabeled Data Using SVM Base Models

Journal Contribution - Journal Article

© 2015 Elsevier B.V. We present a novel approach to learn binary classifiers when only positive and unlabeled instances are available (PU learning). This problem is routinely cast as a supervised task with label noise in the negative set. We use an ensemble of SVM models trained on bootstrap resamples of the training data for increased robustness against label noise. The approach can be considered in a bagging framework which provides an intuitive explanation for its mechanics in a semi-supervised setting. We compared our method to state-of-the-art approaches in simulations using multiple public benchmark data sets. The included benchmark comprises three settings with increasing label noise: (i) fully supervised, (ii) PU learning and (iii) PU learning with false positives. Our approach shows a marginal improvement over existing methods in the second setting and a significant improvement in the third.

Journal: Neurocomputing

ISSN: 0925-2312

Volume: 160

Pages: 73 - 84

Publication year:2015

Institutional Repository URL: https://lirias.kuleuven.be/92920
VABB Id: c:vabb:407874
DOI: https://doi.org/10.1016/j.neucom.2014.10.081
WoS Id: 000354139100007

BOF-keylabel:yes

IOF-keylabel:yes

BOF-publication weight:2

CSS-citation score:2

Authors from:Higher Education

Accessibility:Open

Publication

A Robust Ensemble Approach to Learn From Positive and Unlabeled Data Using SVM Base Models

Journal Contribution - Journal Article

Authors/publisher

Research units