Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement

Yan, Yuzi; Zhang, Wei-Qiang; Johnson, Michael T.

Computer Science > Sound

arXiv:2108.12105 (cs)

[Submitted on 27 Aug 2021]

Title:Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement

Authors:Yuzi Yan, Wei-Qiang Zhang, Michael T. Johnson

View PDF

Abstract:As the cornerstone of other important technologies, such as speech recognition and speech synthesis, speech enhancement is a critical area in audio signal processing. In this paper, a new deep learning structure for speech enhancement is demonstrated. The model introduces a "full" attention mechanism to a bidirectional sequence-to-sequence method to make use of latent information after each focal frame. This is an extension of the previous attention-based RNN method. The proposed bidirectional attention-based architecture achieves better performance in terms of speech quality (PESQ), compared with OM-LSA, CNN-LSTM, T-GSA and the unidirectional attention-based LSTM baseline.

Comments:	4 pages
Subjects:	Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2108.12105 [cs.SD]
	(or arXiv:2108.12105v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2108.12105

Submission history

From: Yuzi Yan [view email]
[v1] Fri, 27 Aug 2021 03:19:07 UTC (5,943 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.SD

< prev | next >

new | recent | 2021-08

Change to browse by:

cs
cs.LG
eess
eess.AS

References & Citations

DBLP - CS Bibliography

listing | bibtex

Wei-Qiang Zhang
Michael T. Johnson

export BibTeX citation

Computer Science > Sound

Title:Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators