Context-NER : Contextual Phrase Generation at Scale

Gupta, Himanshu; Verma, Shreyas; Mashetty, Santosh; Mishra, Swaroop

Computer Science > Information Retrieval

arXiv:2109.08079 (cs)

[Submitted on 16 Sep 2021 (v1), last revised 8 Jun 2023 (this version, v4)]

Title:Context-NER : Contextual Phrase Generation at Scale

Authors:Himanshu Gupta, Shreyas Verma, Santosh Mashetty, Swaroop Mishra

View PDF

Abstract:Named Entity Recognition (NER) has seen significant progress in recent years, with numerous state-of-the-art (SOTA) models achieving high performance. However, very few studies have focused on the generation of entities' context. In this paper, we introduce CONTEXT-NER, a task that aims to generate the relevant context for entities in a sentence, where the context is a phrase describing the entity but not necessarily present in the sentence. To facilitate research in this task, we also present the EDGAR10-Q dataset, which consists of annual and quarterly reports from the top 1500 publicly traded companies. The dataset is the largest of its kind, containing 1M sentences, 2.8M entities, and an average of 35 tokens per sentence, making it a challenging dataset. We propose a baseline approach that combines a phrase generation algorithm with inferencing using a 220M language model, achieving a ROUGE-L score of 27% on the test split. Additionally, we perform a one-shot inference with ChatGPT, which obtains a 30% ROUGE-L, highlighting the difficulty of the dataset. We also evaluate models such as T5 and BART, which achieve a maximum ROUGE-L of 49% after supervised finetuning on EDGAR10-Q. We also find that T5-large, when pre-finetuned on EDGAR10-Q, achieve SOTA results on downstream finance tasks such as Headline, FPB, and FiQA SA, outperforming vanilla version by 10.81 points. To our surprise, this 66x smaller pre-finetuned model also surpasses the finance-specific LLM BloombergGPT-50B by 15 points. We hope that our dataset and generated artifacts will encourage further research in this direction, leading to the development of more sophisticated language models for financial text analysis

Comments:	29 pages, 5 Figures, 2 AlgorithmS, 17 Tables. Accepted in NeurIPS 2022 - Efficient Natural Language and Speech Processing (ENLSP) Workshop
Subjects:	Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2109.08079 [cs.IR]
	(or arXiv:2109.08079v4 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2109.08079

Submission history

From: Himanshu Gupta [view email]
[v1] Thu, 16 Sep 2021 16:10:05 UTC (1,168 KB)
[v2] Thu, 27 Oct 2022 05:33:28 UTC (350 KB)
[v3] Fri, 28 Oct 2022 04:49:28 UTC (350 KB)
[v4] Thu, 8 Jun 2023 18:33:01 UTC (659 KB)

Computer Science > Information Retrieval

Title:Context-NER : Contextual Phrase Generation at Scale

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Context-NER : Contextual Phrase Generation at Scale

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators