Self-Training with Differentiable Teacher

Zuo, Simiao; Yu, Yue; Liang, Chen; Jiang, Haoming; Er, Siawpeng; Zhang, Chao; Zhao, Tuo; Zha, Hongyuan

Computer Science > Computation and Language

arXiv:2109.07049 (cs)

[Submitted on 15 Sep 2021 (v1), last revised 3 May 2022 (this version, v2)]

Title:Self-Training with Differentiable Teacher

Authors:Simiao Zuo, Yue Yu, Chen Liang, Haoming Jiang, Siawpeng Er, Chao Zhang, Tuo Zhao, Hongyuan Zha

View PDF

Abstract:Self-training achieves enormous success in various semi-supervised and weakly-supervised learning tasks. The method can be interpreted as a teacher-student framework, where the teacher generates pseudo-labels, and the student makes predictions. The two models are updated alternatingly. However, such a straightforward alternating update rule leads to training instability. This is because a small change in the teacher may result in a significant change in the student. To address this issue, we propose DRIFT, short for differentiable self-training, that treats teacher-student as a Stackelberg game. In this game, a leader is always in a more advantageous position than a follower. In self-training, the student contributes to the prediction performance, and the teacher controls the training process by generating pseudo-labels. Therefore, we treat the student as the leader and the teacher as the follower. The leader procures its advantage by acknowledging the follower's strategy, which involves differentiable pseudo-labels and differentiable sample weights. Consequently, the leader-follower interaction can be effectively captured via Stackelberg gradient, obtained by differentiating the follower's strategy. Experimental results on semi- and weakly-supervised classification and named entity recognition tasks show that our model outperforms existing approaches by large margins.

Comments:	NAACL 2022 (Findings)
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2109.07049 [cs.CL]
	(or arXiv:2109.07049v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.07049

Submission history

From: Simiao Zuo [view email]
[v1] Wed, 15 Sep 2021 02:06:13 UTC (3,731 KB)
[v2] Tue, 3 May 2022 12:52:13 UTC (2,462 KB)

Computer Science > Computation and Language

Title:Self-Training with Differentiable Teacher

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Self-Training with Differentiable Teacher

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators