< Terug naar vorige pagina

Publicatie

Neural networks using full-band and subband spatial features for mask based source separation

Boekbijdrage - Boekhoofdstuk Conferentiebijdrage

With a microphone array, spatial diversity can be exploited to estimate time-frequency masks that effectively suppress interfering speakers as well as noise. Here, we propose a deep learning approach where the signal components are distinguished based on the associated directions of arrival. To capture the target signal spectrogram more accurately, the estimation can be performed for each subband separately. In order to also take advantage of cross-band dependencies, we additionally consider a combined subband and full-band architecture. Our evaluation indicates that this combination consistently improves the performance in terms of instrumental quality metrics as compared to a pure subband or full-band method. Further, the comparison with two baseline approaches demonstrates the effectiveness of the location based deep learning approach.
Boek: 2021 29th European Signal Processing Conference (EUSIPCO)
Pagina's: 346 - 350
ISBN:9789082797060
Jaar van publicatie:2021
Toegankelijkheid:Open