Publication

Detecting adversarial examples with inductive Venn-ABERS predictors

Book Contribution - Book Chapter Conference Contribution

Inductive Venn-ABERS predictors (IVAPs) are a type of probabilistic predictors with the theoretical guarantee that their predictions are perfectly calibrated. We propose to exploit this calibration property for the detection of adversarial examples in binary classification tasks. By rejecting predictions if the uncertainty of the IVAP is too high, we obtain an algorithm that is both accurate on the original test set and significantly more robust to adversarial examples. The method appears to be competitive to the state of the art in adversarial defense, both in terms of robustness as well as scalability

Book: Proceedings of the 27th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN 2019)

Pages: 143 - 148

ISBN:9782875870650

Publication year:2019

Handle: http://hdl.handle.net/1854/LU-8622378

Accessibility:Open

Publication

Detecting adversarial examples with inductive Venn-ABERS predictors

Book Contribution - Book Chapter Conference Contribution

Authors/publisher

Research units