Towards Interpretable Deep Metric Learning with Structural Matching

Zhao, Wenliang; Rao, Yongming; Wang, Ziyi; Lu, Jiwen; Zhou, Jie

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.05889 (cs)

[Submitted on 12 Aug 2021]

Title:Towards Interpretable Deep Metric Learning with Structural Matching

Authors:Wenliang Zhao, Yongming Rao, Ziyi Wang, Jiwen Lu, Jie Zhou

View PDF

Abstract:How do the neural networks distinguish two images? It is of critical importance to understand the matching mechanism of deep models for developing reliable intelligent systems for many risky visual applications such as surveillance and access control. However, most existing deep metric learning methods match the images by comparing feature vectors, which ignores the spatial structure of images and thus lacks interpretability. In this paper, we present a deep interpretable metric learning (DIML) method for more transparent embedding learning. Unlike conventional metric learning methods based on feature vector comparison, we propose a structural matching strategy that explicitly aligns the spatial embeddings by computing an optimal matching flow between feature maps of the two images. Our method enables deep models to learn metrics in a more human-friendly way, where the similarity of two images can be decomposed to several part-wise similarities and their contributions to the overall similarity. Our method is model-agnostic, which can be applied to off-the-shelf backbone networks and metric learning methods. We evaluate our method on three major benchmarks of deep metric learning including CUB200-2011, Cars196, and Stanford Online Products, and achieve substantial improvements over popular metric learning methods with better interpretability. Code is available at this https URL

Comments:	Accepted to ICCV 2021
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2108.05889 [cs.CV]
	(or arXiv:2108.05889v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.05889

Submission history

From: Wenliang Zhao [view email]
[v1] Thu, 12 Aug 2021 17:59:09 UTC (5,279 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Interpretable Deep Metric Learning with Structural Matching

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Interpretable Deep Metric Learning with Structural Matching

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators