Synerise at RecSys 2021: Twitter user engagement prediction with a fast neural model

Daniluk, Michał; Dąbrowski, Jacek; Rychalska, Barbara; Gołuchowski, Konrad

Computer Science > Information Retrieval

arXiv:2109.12985 (cs)

[Submitted on 23 Sep 2021 (v1), last revised 28 Sep 2021 (this version, v2)]

Title:Synerise at RecSys 2021: Twitter user engagement prediction with a fast neural model

Authors:Michał Daniluk, Jacek Dąbrowski, Barbara Rychalska, Konrad Gołuchowski

View PDF

Abstract:In this paper we present our 2nd place solution to ACM RecSys 2021 Challenge organized by Twitter. The challenge aims to predict user engagement for a set of tweets, offering an exceptionally large data set of 1 billion data points sampled from over four weeks of real Twitter interactions. Each data point contains multiple sources of information, such as tweet text along with engagement features, user features, and tweet features. The challenge brings the problem close to a real production environment by introducing strict latency constraints in the model evaluation phase: the average inference time for single tweet engagement prediction is limited to 6ms on a single CPU core with 64GB memory. Our proposed model relies on extensive feature engineering performed with methods such as the Efficient Manifold Density Estimator (EMDE) - our previously introduced algorithm based on Locality Sensitive Hashing method, and novel Fourier Feature Encoding, among others. In total, we create numerous features describing a user's Twitter account status and the content of a tweet. In order to adhere to the strict latency constraints, the underlying model is a simple residual feed-forward neural network. The system is a variation of our previous methods which proved successful in KDD Cup 2021, WSDM Challenge 2021, and SIGIR eCom Challenge 2020. We release the source code at: this https URL

Subjects:	Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as:	arXiv:2109.12985 [cs.IR]
	(or arXiv:2109.12985v2 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2109.12985

Submission history

From: Michał Daniluk [view email]
[v1] Thu, 23 Sep 2021 13:51:09 UTC (470 KB)
[v2] Tue, 28 Sep 2021 14:43:12 UTC (472 KB)

Computer Science > Information Retrieval

Title:Synerise at RecSys 2021: Twitter user engagement prediction with a fast neural model

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Synerise at RecSys 2021: Twitter user engagement prediction with a fast neural model

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators