GeoCLR: Georeference Contrastive Learning for Efficient Seafloor Image Interpretation

Yamada, Takaki; Prügel-Bennett, Adam; Williams, Stefan B.; Pizarro, Oscar; Thornton, Blair

doi:10.55417/fr.2022037

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.06421 (cs)

[Submitted on 13 Aug 2021 (v1), last revised 26 Jun 2022 (this version, v2)]

Title:GeoCLR: Georeference Contrastive Learning for Efficient Seafloor Image Interpretation

Authors:Takaki Yamada, Adam Prügel-Bennett, Stefan B. Williams, Oscar Pizarro, Blair Thornton

View PDF

Abstract:This paper describes Georeference Contrastive Learning of visual Representation (GeoCLR) for efficient training of deep-learning Convolutional Neural Networks (CNNs). The method leverages georeference information by generating a similar image pair using images taken of nearby locations, and contrasting these with an image pair that is far apart. The underlying assumption is that images gathered within a close distance are more likely to have similar visual appearance, where this can be reasonably satisfied in seafloor robotic imaging applications where image footprints are limited to edge lengths of a few metres and are taken so that they overlap along a vehicle's trajectory, whereas seafloor substrates and habitats have patch sizes that are far larger. A key advantage of this method is that it is self-supervised and does not require any human input for CNN training. The method is computationally efficient, where results can be generated between dives during multi-day AUV missions using computational resources that would be accessible during most oceanic field trials. We apply GeoCLR to habitat classification on a dataset that consists of ~86k images gathered using an Autonomous Underwater Vehicle (AUV). We demonstrate how the latent representations generated by GeoCLR can be used to efficiently guide human annotation efforts, where the semi-supervised framework improves classification accuracy by an average of 10.2% compared to the state-of-the-art SimCLR using the same CNN and equivalent number of human annotations for training.

Comments:	30 pages, 9 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2108.06421 [cs.CV]
	(or arXiv:2108.06421v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.06421
Journal reference:	Field Robotics 2 (2022) 1134-1155
Related DOI:	https://doi.org/10.55417/fr.2022037

Submission history

From: Takaki Yamada [view email]
[v1] Fri, 13 Aug 2021 22:42:34 UTC (6,332 KB)
[v2] Sun, 26 Jun 2022 14:15:48 UTC (6,560 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:GeoCLR: Georeference Contrastive Learning for Efficient Seafloor Image Interpretation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:GeoCLR: Georeference Contrastive Learning for Efficient Seafloor Image Interpretation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators