PGTRNet: Two-phase Weakly Supervised Object Detection with Pseudo Ground Truth Refinement

Wang, Jun; Zhou, Hefeng; Yu, Xiaohan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.11439 (cs)

[Submitted on 25 Aug 2021 (v1), last revised 17 Mar 2022 (this version, v2)]

Title:PGTRNet: Two-phase Weakly Supervised Object Detection with Pseudo Ground Truth Refinement

Authors:Jun Wang, Hefeng Zhou, Xiaohan Yu

View PDF

Abstract:Current state-of-the-art weakly supervised object detection (WSOD) studies mainly follow a two-stage training strategy which integrates a fully supervised detector (FSD) with a pure WSOD model. There are two main problems hindering the performance of the two-phase WSOD approaches, i.e., insufficient learning problem and strict reliance between the FSD and the pseudo ground truth (PGT) generated by the WSOD model. This paper proposes pseudo ground truth refinement network (PGTRNet), a simple yet effective method without introducing any extra learnable parameters, to cope with these problems. PGTRNet utilizes multiple bounding boxes to establish the PGT, mitigating the insufficient learning problem. Besides, we propose a novel online PGT refinement approach to steadily improve the quality of PGT by fully taking advantage of the power of FSD during the second-phase training, decoupling the first and second-phase models. Elaborate experiments are conducted on the PASCAL VOC 2007 benchmark to verify the effectiveness of our methods. Experimental results demonstrate that PGTRNet boosts the backbone model by 2.1% mAP and achieves the state-of-the-art performance.

Comments:	This paper was accepted by ICASSP2022. arXiv admin note: substantial text overlap with arXiv:2104.00231
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2108.11439 [cs.CV]
	(or arXiv:2108.11439v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.11439

Submission history

From: Jun Wang [view email]
[v1] Wed, 25 Aug 2021 19:20:49 UTC (1,849 KB)
[v2] Thu, 17 Mar 2022 19:00:16 UTC (1,847 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:PGTRNet: Two-phase Weakly Supervised Object Detection with Pseudo Ground Truth Refinement

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:PGTRNet: Two-phase Weakly Supervised Object Detection with Pseudo Ground Truth Refinement

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators