G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-guided Feature Imitation

Yao, Lewei; Pi, Renjie; Xu, Hang; Zhang, Wei; Li, Zhenguo; Zhang, Tong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.07482 (cs)

[Submitted on 17 Aug 2021 (v1), last revised 12 Oct 2021 (this version, v3)]

Title:G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-guided Feature Imitation

Authors:Lewei Yao, Renjie Pi, Hang Xu, Wei Zhang, Zhenguo Li, Tong Zhang

View PDF

Abstract:In this paper, we investigate the knowledge distillation (KD) strategy for object detection and propose an effective framework applicable to both homogeneous and heterogeneous student-teacher pairs. The conventional feature imitation paradigm introduces imitation masks to focus on informative foreground areas while excluding the background noises. However, we find that those methods fail to fully utilize the semantic information in all feature pyramid levels, which leads to inefficiency for knowledge distillation between FPN-based detectors. To this end, we propose a novel semantic-guided feature imitation technique, which automatically performs soft matching between feature pairs across all pyramid levels to provide the optimal guidance to the student. To push the envelop even further, we introduce contrastive distillation to effectively capture the information encoded in the relationship between different feature regions. Finally, we propose a generalized detection KD pipeline, which is capable of distilling both homogeneous and heterogeneous detector pairs. Our method consistently outperforms the existing detection KD techniques, and works when (1) components in the framework are used separately and in conjunction; (2) for both homogeneous and heterogenous student-teacher pairs and (3) on multiple detection benchmarks. With a powerful X101-FasterRCNN-Instaboost detector as the teacher, R50-FasterRCNN reaches 44.0% AP, R50-RetinaNet reaches 43.3% AP and R50-FCOS reaches 43.1% AP on COCO dataset.

Comments:	Accepted by ICCV2021
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2108.07482 [cs.CV]
	(or arXiv:2108.07482v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.07482

Submission history

From: Renjie Pi [view email]
[v1] Tue, 17 Aug 2021 07:44:27 UTC (1,940 KB)
[v2] Fri, 20 Aug 2021 16:50:54 UTC (1,941 KB)
[v3] Tue, 12 Oct 2021 13:23:48 UTC (3,709 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-guided Feature Imitation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-guided Feature Imitation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators