Learning Opinion Summarizers by Selecting Informative Reviews

Bražinskas, Arthur; Lapata, Mirella; Titov, Ivan

Computer Science > Computation and Language

arXiv:2109.04325 (cs)

[Submitted on 9 Sep 2021]

Title:Learning Opinion Summarizers by Selecting Informative Reviews

Authors:Arthur Bražinskas, Mirella Lapata, Ivan Titov

View PDF

Abstract:Opinion summarization has been traditionally approached with unsupervised, weakly-supervised and few-shot learning techniques. In this work, we collect a large dataset of summaries paired with user reviews for over 31,000 products, enabling supervised training. However, the number of reviews per product is large (320 on average), making summarization - and especially training a summarizer - impractical. Moreover, the content of many reviews is not reflected in the human-written summaries, and, thus, the summarizer trained on random review subsets hallucinates. In order to deal with both of these challenges, we formulate the task as jointly learning to select informative subsets of reviews and summarizing the opinions expressed in these subsets. The choice of the review subset is treated as a latent variable, predicted by a small and simple selector. The subset is then fed into a more powerful summarizer. For joint training, we use amortized variational inference and policy gradient methods. Our experiments demonstrate the importance of selecting informative reviews resulting in improved quality of summaries and reduced hallucinations.

Comments:	EMNLP 2021
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2109.04325 [cs.CL]
	(or arXiv:2109.04325v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.04325

Submission history

From: Arthur Bražinskas [view email]
[v1] Thu, 9 Sep 2021 15:01:43 UTC (1,736 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-09

Change to browse by:

cs
cs.AI
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Arthur Brazinskas
Mirella Lapata
Ivan Titov

export BibTeX citation

Computer Science > Computation and Language

Title:Learning Opinion Summarizers by Selecting Informative Reviews

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Learning Opinion Summarizers by Selecting Informative Reviews

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators