< Terug naar vorige pagina
Drivetrain system identification in a multi-task learning strategy using partial asynchronous elastic averaging stochastic gradient descent
Boekbijdrage - Boekhoofdstuk Conferentiebijdrage
The limited ability of deep learning models to generalize to regions outside of the training data distribution impedes their use for mechatronic applications, with high requirements of safe and robust operation in multiple operating conditions. We draw inspiration from the fields of Multi-Task Learning and distributed computing and propose an adaptation to Elastic Averaging Stochastic Gradient Descent that makes it possible to leverage upon the information of a fleet of systems to extend the generalization capabilities of the individual models, without having access to the full dataset. We demonstrate in simulation that our method enables models to generalize even outside of the joint training data distribution of the fleet. We compare our method to vanilla Elastic Averaging Stochastic Gradient Descent and demonstrate the importance of our adaptation for convergence in the Multi-Task Learning setting. Finally we investigate the interplay between the elastic force and the individual gradients in the update rules as a determining force for its performance.
Boek: 2020 IEEE/ASME International Conference on Advanced Intelligent Mechatronics (AIM)
Pagina's: 1549 - 1554
Jaar van publicatie:2020