BlockCopy: High-Resolution Video Processing with Block-Sparse Feature Propagation and Online Policies

Verelst, Thomas; Tuytelaars, Tinne

doi:10.1109/ICCV48922.2021.00511

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.09376 (cs)

[Submitted on 20 Aug 2021 (v1), last revised 5 Aug 2022 (this version, v2)]

Title:BlockCopy: High-Resolution Video Processing with Block-Sparse Feature Propagation and Online Policies

Authors:Thomas Verelst, Tinne Tuytelaars

View PDF

Abstract:In this paper we propose BlockCopy, a scheme that accelerates pretrained frame-based CNNs to process video more efficiently, compared to standard frame-by-frame processing. To this end, a lightweight policy network determines important regions in an image, and operations are applied on selected regions only, using custom block-sparse convolutions. Features of non-selected regions are simply copied from the preceding frame, reducing the number of computations and latency. The execution policy is trained using reinforcement learning in an online fashion without requiring ground truth annotations. Our universal framework is demonstrated on dense prediction tasks such as pedestrian detection, instance segmentation and semantic segmentation, using both state of the art (Center and Scale Predictor, MGAN, SwiftNet) and standard baseline networks (Mask-RCNN, DeepLabV3+). BlockCopy achieves significant FLOPS savings and inference speedup with minimal impact on accuracy.

Comments:	Accepted for International Conference on Computer Vision (ICCV 2021)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2108.09376 [cs.CV]
	(or arXiv:2108.09376v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.09376
Related DOI:	https://doi.org/10.1109/ICCV48922.2021.00511

Submission history

From: Thomas Verelst [view email]
[v1] Fri, 20 Aug 2021 21:16:01 UTC (9,627 KB)
[v2] Fri, 5 Aug 2022 14:21:05 UTC (5,987 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:BlockCopy: High-Resolution Video Processing with Block-Sparse Feature Propagation and Online Policies

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:BlockCopy: High-Resolution Video Processing with Block-Sparse Feature Propagation and Online Policies

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators