Challenging the Semi-Supervised VAE Framework for Text Classification

Felhi, Ghazi; Roux, Joseph Le; Seddah, Djamé

Computer Science > Computation and Language

arXiv:2109.12969 (cs)

[Submitted on 27 Sep 2021]

Title:Challenging the Semi-Supervised VAE Framework for Text Classification

Authors:Ghazi Felhi, Joseph Le Roux, Djamé Seddah

View PDF

Abstract:Semi-Supervised Variational Autoencoders (SSVAEs) are widely used models for data efficient learning. In this paper, we question the adequacy of the standard design of sequence SSVAEs for the task of text classification as we exhibit two sources of overcomplexity for which we provide simplifications. These simplifications to SSVAEs preserve their theoretical soundness while providing a number of practical advantages in the semi-supervised setup where the result of training is a text classifier. These simplifications are the removal of (i) the Kullback-Liebler divergence from its objective and (ii) the fully unobserved latent variable from its probabilistic model. These changes relieve users from choosing a prior for their latent variables, make the model smaller and faster, and allow for a better flow of information into the latent variables. We compare the simplified versions to standard SSVAEs on 4 text classification tasks. On top of the above-mentioned simplification, experiments show a speed-up of 26%, while keeping equivalent classification scores. The code to reproduce our experiments is public.

Comments:	Accepted at the EMNLP 2021 Workshop on Insights from Negative Results
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2109.12969 [cs.CL]
	(or arXiv:2109.12969v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.12969

Submission history

From: Ghazi Felhi [view email]
[v1] Mon, 27 Sep 2021 11:46:32 UTC (92 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-09

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Ghazi Felhi
Joseph Le Roux

export BibTeX citation

Computer Science > Computation and Language

Title:Challenging the Semi-Supervised VAE Framework for Text Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Challenging the Semi-Supervised VAE Framework for Text Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators