< Terug naar vorige pagina

Publicatie

AVATAR — Automated feature wrangling for machine learning

Boekbijdrage - Boekhoofdstuk Conferentiebijdrage

A large part of the time invested in data science is spent on manual preparation of data. Transforming wrongly formatted columns into useful features takes up a significant part of this time. We present the avatar algorithm for automatically learning programs that perform this type of feature wrangling. Instead of relying on users to guide the wrangling process, avatar directly uses the predictive performance of machine learning models to measure its progress during wrangling. We use datasets from Kaggle to show that avatar improves raw data for prediction, and square it off against human data scientists.
Boek: Lecture Notes in Computer Science
Pagina's: 235 - 247
ISBN:978-3-030-74251-5
Jaar van publicatie:2021
BOF-keylabel:ja
IOF-keylabel:ja
Authors from:Higher Education
Toegankelijkheid:Open