Publicatie

Hierarchical sparse coding framework for speech emotion recognition

Tijdschriftbijdrage - Tijdschriftartikel

Finding an appropriate feature representation for audio data is central to speech emotion recognition. Most existing audio features rely on hand-crafted feature encoding techniques, such as the AVEC challenge feature set. An alternative approach is to use features that are learned automatically. This has the advantage of generalizing well to new data, particularly if the features are learned in an unsupervised manner with less restrictions on the data itself. In this work, we adopt the sparse coding framework as a means to automatically represent features from audio and propose a hierarchical sparse coding (HSC) scheme. Experimental results indicate that the obtained features, in an unsupervised fashion, are able to capture useful properties of the speech that distinguish between emotions.

Tijdschrift: Speech Commun

ISSN: 0167-6393

Volume: 99

Pagina's: 80-89

Jaar van publicatie:2018

Trefwoorden:Affective computing, Sparse coding, Speech emotion recognition, Support vector regression

ORCID: /0000-0002-1774-2970/work/83442855
WoS Id: 000440877900009
Scopus Id: 85043770992
DOI: https://doi.org/10.1016/j.specom.2018.01.006

BOF-keylabel:ja

BOF-publication weight:1

CSS-citation score:1

Auteurs:International

Authors from:Government, Higher Education

Toegankelijkheid:Open

Publicatie

Hierarchical sparse coding framework for speech emotion recognition

Tijdschriftbijdrage - Tijdschriftartikel

Auteurs/uitgever

Onderzoekseenheden