Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing

Kanakarajan, Kamal Raj; Kundumani, Bhuvana; Sankarasubbu, Malaikannan

Computer Science > Machine Learning

arXiv:2109.10847 (cs)

[Submitted on 22 Sep 2021 (v1), last revised 23 Sep 2021 (this version, v2)]

Title:Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing

Authors:Kamal Raj Kanakarajan, Bhuvana Kundumani, Malaikannan Sankarasubbu

View PDF

Abstract:Recent progress in the Natural Language Processing domain has given us several State-of-the-Art (SOTA) pretrained models which can be finetuned for specific tasks. These large models with billions of parameters trained on numerous GPUs/TPUs over weeks are leading in the benchmark leaderboards. In this paper, we discuss the need for a benchmark for cost and time effective smaller models trained on a single GPU. This will enable researchers with resource constraints experiment with novel and innovative ideas on tokenization, pretraining tasks, architecture, fine tuning methods etc. We set up Small-Bench NLP, a benchmark for small efficient neural language models trained on a single GPU. Small-Bench NLP benchmark comprises of eight NLP tasks on the publicly available GLUE datasets and a leaderboard to track the progress of the community. Our ELECTRA-DeBERTa (15M parameters) small model architecture achieves an average score of 81.53 which is comparable to that of BERT-Base's 82.20 (110M parameters). Our models, code and leaderboard are available at this https URL

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:2109.10847 [cs.LG]
	(or arXiv:2109.10847v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.10847

Submission history

From: Kamal Raj Kanakarajan [view email]
[v1] Wed, 22 Sep 2021 17:18:55 UTC (52 KB)
[v2] Thu, 23 Sep 2021 06:19:05 UTC (52 KB)

Computer Science > Machine Learning

Title:Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators