Protein Folding Neural Networks Are Not Robust

Jha, Sumit Kumar; Ramanathan, Arvind; Ewetz, Rickard; Velasquez, Alvaro; Jha, Susmit

Quantitative Biology > Biomolecules

arXiv:2109.04460 (q-bio)

[Submitted on 9 Sep 2021 (v1), last revised 19 Sep 2021 (this version, v2)]

Title:Protein Folding Neural Networks Are Not Robust

Authors:Sumit Kumar Jha, Arvind Ramanathan, Rickard Ewetz, Alvaro Velasquez, Susmit Jha

View PDF

Abstract:Deep neural networks such as AlphaFold and RoseTTAFold predict remarkably accurate structures of proteins compared to other algorithmic approaches. It is known that biologically small perturbations in the protein sequence do not lead to drastic changes in the protein structure. In this paper, we demonstrate that RoseTTAFold does not exhibit such a robustness despite its high accuracy, and biologically small perturbations for some input sequences result in radically different predicted protein structures. This raises the challenge of detecting when these predicted protein structures cannot be trusted. We define the robustness measure for the predicted structure of a protein sequence to be the inverse of the root-mean-square distance (RMSD) in the predicted structure and the structure of its adversarially perturbed sequence. We use adversarial attack methods to create adversarial protein sequences, and show that the RMSD in the predicted protein structure ranges from 0.119Å to 34.162Å when the adversarial perturbations are bounded by 20 units in the BLOSUM62 distance. This demonstrates very high variance in the robustness measure of the predicted structures. We show that the magnitude of the correlation (0.917) between our robustness measure and the RMSD between the predicted structure and the ground truth is high, that is, the predictions with low robustness measure cannot be trusted. This is the first paper demonstrating the susceptibility of RoseTTAFold to adversarial attacks.

Comments:	8 pages, 5 figures
Subjects:	Biomolecules (q-bio.BM); Machine Learning (cs.LG)
Cite as:	arXiv:2109.04460 [q-bio.BM]
	(or arXiv:2109.04460v2 [q-bio.BM] for this version)
	https://doi.org/10.48550/arXiv.2109.04460

Submission history

From: Sumit Kumar Jha [view email]
[v1] Thu, 9 Sep 2021 17:57:19 UTC (19,315 KB)
[v2] Sun, 19 Sep 2021 20:23:19 UTC (19,315 KB)

Quantitative Biology > Biomolecules

Title:Protein Folding Neural Networks Are Not Robust

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Quantitative Biology > Biomolecules

Title:Protein Folding Neural Networks Are Not Robust

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators