Reducing Exposure Bias in Training Recurrent Neural Network Transducers

Cui, Xiaodong; Kingsbury, Brian; Saon, George; Haws, David; Tuske, Zoltan

Computer Science > Computation and Language

arXiv:2108.10803 (cs)

[Submitted on 24 Aug 2021]

Title:Reducing Exposure Bias in Training Recurrent Neural Network Transducers

Authors:Xiaodong Cui, Brian Kingsbury, George Saon, David Haws, Zoltan Tuske

View PDF

Abstract:When recurrent neural network transducers (RNNTs) are trained using the typical maximum likelihood criterion, the prediction network is trained only on ground truth label sequences. This leads to a mismatch during inference, known as exposure bias, when the model must deal with label sequences containing errors. In this paper we investigate approaches to reducing exposure bias in training to improve the generalization of RNNT models for automatic speech recognition (ASR). A label-preserving input perturbation to the prediction network is introduced. The input token sequences are perturbed using SwitchOut and scheduled sampling based on an additional token language model. Experiments conducted on the 300-hour Switchboard dataset demonstrate their effectiveness. By reducing the exposure bias, we show that we can further improve the accuracy of a high-performance RNNT ASR model and obtain state-of-the-art results on the 300-hour Switchboard dataset.

Comments:	accepted to Interspeech 2021
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2108.10803 [cs.CL]
	(or arXiv:2108.10803v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2108.10803

Submission history

From: Xiaodong Cui [view email]
[v1] Tue, 24 Aug 2021 15:43:42 UTC (30 KB)

Computer Science > Computation and Language

Title:Reducing Exposure Bias in Training Recurrent Neural Network Transducers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Reducing Exposure Bias in Training Recurrent Neural Network Transducers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators