Auditing AI models for Verified Deployment under Semantic Specifications

Bharadhwaj, Homanga; Huang, De-An; Xiao, Chaowei; Anandkumar, Anima; Garg, Animesh

Computer Science > Machine Learning

arXiv:2109.12456 (cs)

[Submitted on 25 Sep 2021 (v1), last revised 1 Nov 2021 (this version, v2)]

Title:Auditing AI models for Verified Deployment under Semantic Specifications

Authors:Homanga Bharadhwaj, De-An Huang, Chaowei Xiao, Anima Anandkumar, Animesh Garg

View PDF

Abstract:Auditing trained deep learning (DL) models prior to deployment is vital for preventing unintended consequences. One of the biggest challenges in auditing is the lack of human-interpretable specifications for the DL models that are directly useful to the auditor. We address this challenge through a sequence of semantically-aligned unit tests, where each unit test verifies whether a predefined specification (e.g., accuracy over 95%) is satisfied with respect to controlled and semantically aligned variations in the input space (e.g., in face recognition, the angle relative to the camera). We enable such unit tests through variations in a semantically-interpretable latent space of a generative model. Further, we conduct certified training for the DL model through a shared latent space representation with the generative model. With evaluations on four different datasets, covering images of chest X-rays, human faces, ImageNet classes, and towers, we show how AuditAI allows us to obtain controlled variations for certified training. Thus, our framework, AuditAI, bridges the gap between semantically-aligned formal verification and scalability. A blog post accompanying the paper is at this link this https URL

Comments:	Preprint; Under review
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2109.12456 [cs.LG]
	(or arXiv:2109.12456v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.12456

Submission history

From: Homanga Bharadhwaj [view email]
[v1] Sat, 25 Sep 2021 22:53:24 UTC (4,236 KB)
[v2] Mon, 1 Nov 2021 15:33:09 UTC (5,575 KB)

Computer Science > Machine Learning

Title:Auditing AI models for Verified Deployment under Semantic Specifications

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Auditing AI models for Verified Deployment under Semantic Specifications

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators