SIGN: Spatial-information Incorporated Generative Network for Generalized Zero-shot Semantic Segmentation

Cheng, Jiaxin; Nandi, Soumyaroop; Natarajan, Prem; Abd-Almageed, Wael

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.12517 (cs)

[Submitted on 27 Aug 2021]

Title:SIGN: Spatial-information Incorporated Generative Network for Generalized Zero-shot Semantic Segmentation

Authors:Jiaxin Cheng, Soumyaroop Nandi, Prem Natarajan, Wael Abd-Almageed

View PDF

Abstract:Unlike conventional zero-shot classification, zero-shot semantic segmentation predicts a class label at the pixel level instead of the image level. When solving zero-shot semantic segmentation problems, the need for pixel-level prediction with surrounding context motivates us to incorporate spatial information using positional encoding. We improve standard positional encoding by introducing the concept of Relative Positional Encoding, which integrates spatial information at the feature level and can handle arbitrary image sizes. Furthermore, while self-training is widely used in zero-shot semantic segmentation to generate pseudo-labels, we propose a new knowledge-distillation-inspired self-training strategy, namely Annealed Self-Training, which can automatically assign different importance to pseudo-labels to improve performance. We systematically study the proposed Relative Positional Encoding and Annealed Self-Training in a comprehensive experimental evaluation, and our empirical results confirm the effectiveness of our method on three benchmark datasets.

Comments:	Accepted in ICCV 2021
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2108.12517 [cs.CV]
	(or arXiv:2108.12517v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.12517

Submission history

From: Jiaxin Cheng [view email]
[v1] Fri, 27 Aug 2021 22:18:24 UTC (3,187 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2021-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jiaxin Cheng
Prem Natarajan
Wael Abd-Almageed

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:SIGN: Spatial-information Incorporated Generative Network for Generalized Zero-shot Semantic Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SIGN: Spatial-information Incorporated Generative Network for Generalized Zero-shot Semantic Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators