Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training

Hardalov, Momchil; Arora, Arnav; Nakov, Preslav; Augenstein, Isabelle

Computer Science > Computation and Language

arXiv:2109.06050 (cs)

[Submitted on 13 Sep 2021 (v1), last revised 21 Dec 2021 (this version, v2)]

Title:Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training

Authors:Momchil Hardalov, Arnav Arora, Preslav Nakov, Isabelle Augenstein

View PDF

Abstract:The goal of stance detection is to determine the viewpoint expressed in a piece of text towards a target. These viewpoints or contexts are often expressed in many different languages depending on the user and the platform, which can be a local news outlet, a social media platform, a news forum, etc. Most research in stance detection, however, has been limited to working with a single language and on a few limited targets, with little work on cross-lingual stance detection. Moreover, non-English sources of labelled data are often scarce and present additional challenges. Recently, large multilingual language models have substantially improved the performance on many non-English tasks, especially such with limited numbers of examples. This highlights the importance of model pre-training and its ability to learn from few examples. In this paper, we present the most comprehensive study of cross-lingual stance detection to date: we experiment with 15 diverse datasets in 12 languages from 6 language families, and with 6 low-resource evaluation settings each. For our experiments, we build on pattern-exploiting training, proposing the addition of a novel label encoder to simplify the verbalisation procedure. We further propose sentiment-based generation of stance data for pre-training, which shows sizeable improvement of more than 6% F1 absolute in low-shot settings compared to several strong baselines.

Comments:	Accepted to AAAI 2022 (Preprint version)
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2109.06050 [cs.CL]
	(or arXiv:2109.06050v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.06050

Submission history

From: Momchil Hardalov [view email]
[v1] Mon, 13 Sep 2021 15:20:06 UTC (1,223 KB)
[v2] Tue, 21 Dec 2021 09:03:27 UTC (1,220 KB)

Computer Science > Computation and Language

Title:Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators