Publicatie

Training a Speech-to-Text Model for Dutch on the Corpus Gesproken Nederlands

Boekbijdrage - Boekhoofdstuk Conferentiebijdrage

Speech-to-text, also known as Speech Recognition, is a technology that is able to recognize and transcribe spoken language into text. In subsequent steps, this transcription can be used to complete a multitude of tasks, such as providing automatic subtitles or parsing voice commands. In recent years, Speech-to-Text models have dramatically improved thanks partially to advances in Deep Learning methods. Starting from the open-source project DeepSpeech, we train speech-to-text models for Dutch, using the Corpus Gesproken Nederlands (CGN). First, we contribute a pre-processing pipeline for this dataset, to make it suitable for the task at hand, obtaining a ready-to-use speech-to-text dataset for Dutch. Second, we investigate the performance of Dutch and Flemish models trained from scratch, establishing a baseline for the CGN dataset for this task. Finally, we investigate the issue of transferring speech-to-text models between related languages. In this case, we analyse how a pre-trained English model can be transferred and fine-tuned for Dutch.

Boek: Proceedings of the 31st Benelux Conference on Artificial Intelligence (BNAIC 2019)

Series: CEUR Workshop Proceedings

Volume: 2491

Aantal pagina's: 14

Jaar van publicatie:2019

ORCID: /0000-0001-5045-6127/work/99866428
ORCID: /0000-0002-2235-5115/work/90436498
ORCID: /0000-0001-6346-4564/work/65577290
Scopus Id: 85075057076
ORCID: /0000-0003-1446-5514/work/64620504
Institutional Repository URL: https://cris.vub.be/ws/files/75754370/Training_a_Speech_to_Text_Model_for_Dutch_on_the_Corpus_Gesproken_Nederlands.pdf
Institutional Repository URL: http://ceur-ws.org/Vol-2491/paper60.pdf

Toegankelijkheid:Open

Publicatie

Training a Speech-to-Text Model for Dutch on the Corpus Gesproken Nederlands

Boekbijdrage - Boekhoofdstuk Conferentiebijdrage

Auteurs/uitgever

Onderzoekseenheden