Does Pretraining for Summarization Require Knowledge Transfer?

Krishna, Kundan; Bigham, Jeffrey; Lipton, Zachary C.

Computer Science > Computation and Language

arXiv:2109.04953 (cs)

[Submitted on 10 Sep 2021]

Title:Does Pretraining for Summarization Require Knowledge Transfer?

Authors:Kundan Krishna, Jeffrey Bigham, Zachary C. Lipton

View PDF

Abstract:Pretraining techniques leveraging enormous datasets have driven recent advances in text summarization. While folk explanations suggest that knowledge transfer accounts for pretraining's benefits, little is known about why it works or what makes a pretraining task or dataset suitable. In this paper, we challenge the knowledge transfer story, showing that pretraining on documents consisting of character n-grams selected at random, we can nearly match the performance of models pretrained on real corpora. This work holds the promise of eliminating upstream corpora, which may alleviate some concerns over offensive language, bias, and copyright issues. To see whether the small residual benefit of using real data could be accounted for by the structure of the pretraining task, we design several tasks motivated by a qualitative study of summarization corpora. However, these tasks confer no appreciable benefit, leaving open the possibility of a small role for knowledge transfer.

Comments:	Camera-ready for Findings of EMNLP 2021
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2109.04953 [cs.CL]
	(or arXiv:2109.04953v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.04953

Submission history

From: Kundan Krishna [view email]
[v1] Fri, 10 Sep 2021 15:54:15 UTC (107 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-09

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Kundan Krishna
Jeffrey P. Bigham
Zachary C. Lipton

export BibTeX citation

Computer Science > Computation and Language

Title:Does Pretraining for Summarization Require Knowledge Transfer?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Does Pretraining for Summarization Require Knowledge Transfer?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators