Learning to Downsample for Segmentation of Ultra-High Resolution Images

Jin, Chen; Tanno, Ryutaro; Mertzanidou, Thomy; Panagiotaki, Eleftheria; Alexander, Daniel C.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2109.11071 (cs)

[Submitted on 22 Sep 2021 (v1), last revised 16 Mar 2022 (this version, v2)]

Title:Learning to Downsample for Segmentation of Ultra-High Resolution Images

Authors:Chen Jin, Ryutaro Tanno, Thomy Mertzanidou, Eleftheria Panagiotaki, Daniel C. Alexander

View PDF

Abstract:Many computer vision systems require low-cost segmentation algorithms based on deep learning, either because of the enormous size of input images or limited computational budget. Common solutions uniformly downsample the input images to meet memory constraints, assuming all pixels are equally informative. In this work, we demonstrate that this assumption can harm the segmentation performance because the segmentation difficulty varies spatially. We combat this problem by introducing a learnable downsampling module, which can be optimised together with the given segmentation model in an end-to-end fashion. We formulate the problem of training such downsampling module as optimisation of sampling density distributions over the input images given their low-resolution views. To defend against degenerate solutions (e.g. over-sampling trivial regions like the backgrounds), we propose a regularisation term that encourages the sampling locations to concentrate around the object boundaries. We find the downsampling module learns to sample more densely at difficult locations, thereby improving the segmentation performance. Our experiments on benchmarks of high-resolution street view, aerial and medical images demonstrate substantial improvements in terms of efficiency-and-accuracy trade-off compared to both uniform downsampling and two recent advanced downsampling techniques.

Comments:	19 pages, 17 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
ACM classes:	I.4.6
Cite as:	arXiv:2109.11071 [cs.CV]
	(or arXiv:2109.11071v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2109.11071
Journal reference:	International Conference on Learning Representations, 2022

Submission history

From: Chen Jin [view email]
[v1] Wed, 22 Sep 2021 23:04:59 UTC (10,417 KB)
[v2] Wed, 16 Mar 2022 01:30:27 UTC (26,680 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Learning to Downsample for Segmentation of Ultra-High Resolution Images

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning to Downsample for Segmentation of Ultra-High Resolution Images

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators