Relation-aware Compositional Zero-shot Learning for Attribute-Object Pair Recognition

Xu, Ziwei; Wang, Guangzhi; Wong, Yongkang; Kankanhalli, Mohan

doi:10.1109/TMM.2021.3104411

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.04603 (cs)

[Submitted on 10 Aug 2021]

Title:Relation-aware Compositional Zero-shot Learning for Attribute-Object Pair Recognition

Authors:Ziwei Xu, Guangzhi Wang, Yongkang Wong, Mohan Kankanhalli

View PDF

Abstract:This paper proposes a novel model for recognizing images with composite attribute-object concepts, notably for composite concepts that are unseen during model training. We aim to explore the three key properties required by the task --- relation-aware, consistent, and decoupled --- to learn rich and robust features for primitive concepts that compose attribute-object pairs. To this end, we propose the Blocked Message Passing Network (BMP-Net). The model consists of two modules. The concept module generates semantically meaningful features for primitive concepts, whereas the visual module extracts visual features for attributes and objects from input images. A message passing mechanism is used in the concept module to capture the relations between primitive concepts. Furthermore, to prevent the model from being biased towards seen composite concepts and reduce the entanglement between attributes and objects, we propose a blocking mechanism that equalizes the information available to the model for both seen and unseen concepts. Extensive experiments and ablation studies on two benchmarks show the efficacy of the proposed model.

Comments:	Accepted by IEEE Transactions on Multimedia
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
Cite as:	arXiv:2108.04603 [cs.CV]
	(or arXiv:2108.04603v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.04603
Related DOI:	https://doi.org/10.1109/TMM.2021.3104411

Submission history

From: Ziwei Xu [view email]
[v1] Tue, 10 Aug 2021 11:23:03 UTC (11,986 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Relation-aware Compositional Zero-shot Learning for Attribute-Object Pair Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Relation-aware Compositional Zero-shot Learning for Attribute-Object Pair Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators