Stochastic Transformer Networks with Linear Competing Units: Application to end-to-end SL Translation

Voskou, Andreas; Panousis, Konstantinos P.; Kosmopoulos, Dimitrios; Metaxas, Dimitris N.; Chatzis, Sotirios

Computer Science > Computation and Language

arXiv:2109.13318 (cs)

[Submitted on 1 Sep 2021 (v1), last revised 1 Oct 2021 (this version, v2)]

Title:Stochastic Transformer Networks with Linear Competing Units: Application to end-to-end SL Translation

Authors:Andreas Voskou, Konstantinos P. Panousis, Dimitrios Kosmopoulos, Dimitris N. Metaxas, Sotirios Chatzis

View PDF

Abstract:Automating sign language translation (SLT) is a challenging real world application. Despite its societal importance, though, research progress in the field remains rather poor. Crucially, existing methods that yield viable performance necessitate the availability of laborious to obtain gloss sequence groundtruth. In this paper, we attenuate this need, by introducing an end-to-end SLT model that does not entail explicit use of glosses; the model only needs text groundtruth. This is in stark contrast to existing end-to-end models that use gloss sequence groundtruth, either in the form of a modality that is recognized at an intermediate model stage, or in the form of a parallel output process, jointly trained with the SLT model. Our approach constitutes a Transformer network with a novel type of layers that combines: (i) local winner-takes-all (LWTA) layers with stochastic winner sampling, instead of conventional ReLU layers, (ii) stochastic weights with posterior distributions estimated via variational inference, and (iii) a weight compression technique at inference time that exploits estimated posterior variance to perform massive, almost lossless compression. We demonstrate that our approach can reach the currently best reported BLEU-4 score on the PHOENIX 2014T benchmark, but without making use of glosses for model training, and with a memory footprint reduced by more than 70%.

Comments:	In Proceedings of ICCV 2021
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2109.13318 [cs.CL]
	(or arXiv:2109.13318v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.13318

Submission history

From: Sotirios Chatzis [view email]
[v1] Wed, 1 Sep 2021 15:00:52 UTC (1,283 KB)
[v2] Fri, 1 Oct 2021 08:57:30 UTC (1,283 KB)

Computer Science > Computation and Language

Title:Stochastic Transformer Networks with Linear Competing Units: Application to end-to-end SL Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Stochastic Transformer Networks with Linear Competing Units: Application to end-to-end SL Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators