Learning to Superoptimize Real-world Programs

Shypula, Alex; Yin, Pengcheng; Lacomis, Jeremy; Goues, Claire Le; Schwartz, Edward; Neubig, Graham

Computer Science > Machine Learning

arXiv:2109.13498 (cs)

[Submitted on 28 Sep 2021 (v1), last revised 4 Apr 2022 (this version, v2)]

Title:Learning to Superoptimize Real-world Programs

Authors:Alex Shypula, Pengcheng Yin, Jeremy Lacomis, Claire Le Goues, Edward Schwartz, Graham Neubig

View PDF

Abstract:Program optimization is the process of modifying software to execute more efficiently. Superoptimizers attempt to find the optimal program by employing significantly more expensive search and constraint solving techniques. Generally, these methods do not scale well to programs in real development scenarios, and as a result, superoptimization has largely been confined to small-scale, domain-specific, and/or synthetic program benchmarks. In this paper, we propose a framework to learn to superoptimize real-world programs by using neural sequence-to-sequence models. We created a dataset consisting of over 25K real-world x86-64 assembly functions mined from open-source projects and propose an approach, Self Imitation Learning for Optimization (SILO) that is easy to implement and outperforms a standard policy gradient learning approach on our dataset. Our method, SILO, superoptimizes 5.9% of our test set when compared with the gcc version 10.3 compiler's aggressive optimization level -O3. We also report that SILO's rate of superoptimization on our test set is over five times that of a standard policy gradient approach and a model pre-trained on compiler optimization demonstration.

Comments:	Best Paper, ICLR 2022 Deep Learning for Code workshop
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Programming Languages (cs.PL); Software Engineering (cs.SE)
Cite as:	arXiv:2109.13498 [cs.LG]
	(or arXiv:2109.13498v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.13498

Submission history

From: Alexander Shypula [view email]
[v1] Tue, 28 Sep 2021 05:33:21 UTC (225 KB)
[v2] Mon, 4 Apr 2022 21:09:35 UTC (240 KB)

Computer Science > Machine Learning

Title:Learning to Superoptimize Real-world Programs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning to Superoptimize Real-world Programs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators