Learning Canonical 3D Object Representation for Fine-Grained Recognition

Joung, Sunghun; Kim, Seungryong; Kim, Minsu; Kim, Ig-Jae; Sohn, Kwanghoon

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.04628 (cs)

[Submitted on 10 Aug 2021]

Title:Learning Canonical 3D Object Representation for Fine-Grained Recognition

Authors:Sunghun Joung, Seungryong Kim, Minsu Kim, Ig-Jae Kim, Kwanghoon Sohn

View PDF

Abstract:We propose a novel framework for fine-grained object recognition that learns to recover object variation in 3D space from a single image, trained on an image collection without using any ground-truth 3D annotation. We accomplish this by representing an object as a composition of 3D shape and its appearance, while eliminating the effect of camera viewpoint, in a canonical configuration. Unlike conventional methods modeling spatial variation in 2D images only, our method is capable of reconfiguring the appearance feature in a canonical 3D space, thus enabling the subsequent object classifier to be invariant under 3D geometric variation. Our representation also allows us to go beyond existing methods, by incorporating 3D shape variation as an additional cue for object recognition. To learn the model without ground-truth 3D annotation, we deploy a differentiable renderer in an analysis-by-synthesis framework. By incorporating 3D shape and appearance jointly in a deep representation, our method learns the discriminative representation of the object and achieves competitive performance on fine-grained image recognition and vehicle re-identification. We also demonstrate that the performance of 3D shape reconstruction is improved by learning fine-grained shape deformation in a boosting manner.

Comments:	ICCV 2021
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2108.04628 [cs.CV]
	(or arXiv:2108.04628v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.04628

Submission history

From: Sunghun Joung [view email]
[v1] Tue, 10 Aug 2021 12:19:34 UTC (3,721 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Canonical 3D Object Representation for Fine-Grained Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Canonical 3D Object Representation for Fine-Grained Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators