InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images

Cherian, Anoop; Pais, Goncalo Dias; Jain, Siddarth; Marks, Tim K.; Sullivan, Alan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.13865 (cs)

[Submitted on 31 Aug 2021 (v1), last revised 28 Jan 2022 (this version, v2)]

Title:InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images

Authors:Anoop Cherian, Goncalo Dias Pais, Siddarth Jain, Tim K. Marks, Alan Sullivan

View PDF

Abstract:In this paper, we present InSeGAN, an unsupervised 3D generative adversarial network (GAN) for segmenting (nearly) identical instances of rigid objects in depth images. Using an analysis-by-synthesis approach, we design a novel GAN architecture to synthesize a multiple-instance depth image with independent control over each instance. InSeGAN takes in a set of code vectors (e.g., random noise vectors), each encoding the 3D pose of an object that is represented by a learned implicit object template. The generator has two distinct modules. The first module, the instance feature generator, uses each encoded pose to transform the implicit template into a feature map representation of each object instance. The second module, the depth image renderer, aggregates all of the single-instance feature maps output by the first module and generates a multiple-instance depth image. A discriminator distinguishes the generated multiple-instance depth images from the distribution of true depth images. To use our model for instance segmentation, we propose an instance pose encoder that learns to take in a generated depth image and reproduce the pose code vectors for all of the object instances. To evaluate our approach, we introduce a new synthetic dataset, "Insta-10", consisting of 100,000 depth images, each with 5 instances of an object from one of 10 classes. Our experiments on Insta-10, as well as on real-world noisy depth images, show that InSeGAN achieves state-of-the-art performance, often outperforming prior methods by large margins.

Comments:	Accepted at ICCV 2021. Code & data @ this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2108.13865 [cs.CV]
	(or arXiv:2108.13865v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.13865

Submission history

From: Anoop Cherian [view email]
[v1] Tue, 31 Aug 2021 14:18:40 UTC (11,775 KB)
[v2] Fri, 28 Jan 2022 17:06:06 UTC (11,775 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators