< Back to previous page

Publication

AVATAR — Automated feature wrangling for machine learning

Book Contribution - Book Chapter Conference Contribution

A large part of the time invested in data science is spent on manual preparation of data. Transforming wrongly formatted columns into useful features takes up a significant part of this time. We present the avatar algorithm for automatically learning programs that perform this type of feature wrangling. Instead of relying on users to guide the wrangling process, avatar directly uses the predictive performance of machine learning models to measure its progress during wrangling. We use datasets from Kaggle to show that avatar improves raw data for prediction, and square it off against human data scientists.
Book: Lecture Notes in Computer Science
Pages: 235 - 247
ISBN:978-3-030-74251-5
Publication year:2021
BOF-keylabel:yes
IOF-keylabel:yes
Authors from:Higher Education
Accessibility:Open