PatchCleanser: Certifiably Robust Defense against Adversarial Patches for Any Image Classifier

Xiang, Chong; Mahloujifar, Saeed; Mittal, Prateek

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.09135 (cs)

[Submitted on 20 Aug 2021 (v1), last revised 8 Apr 2022 (this version, v2)]

Title:PatchCleanser: Certifiably Robust Defense against Adversarial Patches for Any Image Classifier

Authors:Chong Xiang, Saeed Mahloujifar, Prateek Mittal

View PDF

Abstract:The adversarial patch attack against image classification models aims to inject adversarially crafted pixels within a restricted image region (i.e., a patch) for inducing model misclassification. This attack can be realized in the physical world by printing and attaching the patch to the victim object; thus, it imposes a real-world threat to computer vision systems. To counter this threat, we design PatchCleanser as a certifiably robust defense against adversarial patches. In PatchCleanser, we perform two rounds of pixel masking on the input image to neutralize the effect of the adversarial patch. This image-space operation makes PatchCleanser compatible with any state-of-the-art image classifier for achieving high accuracy. Furthermore, we can prove that PatchCleanser will always predict the correct class labels on certain images against any adaptive white-box attacker within our threat model, achieving certified robustness. We extensively evaluate PatchCleanser on the ImageNet, ImageNette, CIFAR-10, CIFAR-100, SVHN, and Flowers-102 datasets and demonstrate that our defense achieves similar clean accuracy as state-of-the-art classification models and also significantly improves certified robustness from prior works. Remarkably, PatchCleanser achieves 83.9% top-1 clean accuracy and 62.1% top-1 certified robust accuracy against a 2%-pixel square patch anywhere on the image for the 1000-class ImageNet dataset.

Comments:	USENIX Security Symposium 2022; extended technical report
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
Cite as:	arXiv:2108.09135 [cs.CV]
	(or arXiv:2108.09135v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.09135

Submission history

From: Chong Xiang [view email]
[v1] Fri, 20 Aug 2021 12:09:33 UTC (1,875 KB)
[v2] Fri, 8 Apr 2022 18:52:45 UTC (4,889 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:PatchCleanser: Certifiably Robust Defense against Adversarial Patches for Any Image Classifier

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:PatchCleanser: Certifiably Robust Defense against Adversarial Patches for Any Image Classifier

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators