Deep Anomaly Discovery From Unlabeled Videos via Normality Advantage and Self-Paced Refinement

Yu, Guang; Wang, Siqi; Cai, Zhiping; Liu, Xinwang; Xu, Chuanfu; Wu, Chengkun

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.01975 (cs)

[Submitted on 4 Aug 2021 (v1), last revised 21 Jun 2022 (this version, v3)]

Title:Deep Anomaly Discovery From Unlabeled Videos via Normality Advantage and Self-Paced Refinement

Authors:Guang Yu, Siqi Wang, Zhiping Cai, Xinwang Liu, Chuanfu Xu, Chengkun Wu

View PDF

Abstract:While classic video anomaly detection (VAD) requires labeled normal videos for training, emerging unsupervised VAD (UVAD) aims to discover anomalies directly from fully unlabeled videos. However, existing UVAD methods still rely on shallow models to perform detection or initialization, and they are evidently inferior to classic VAD methods. This paper proposes a full deep neural network (DNN) based solution that can realize highly effective UVAD. First, we, for the first time, point out that deep reconstruction can be surprisingly effective for UVAD, which inspires us to unveil a property named "normality advantage", i.e., normal events will enjoy lower reconstruction loss when DNN learns to reconstruct unlabeled videos. With this property, we propose Localization based Reconstruction (LBR) as a strong UVAD baseline and a solid foundation of our solution. Second, we propose a novel self-paced refinement (SPR) scheme, which is synthesized into LBR to conduct UVAD. Unlike ordinary self-paced learning that injects more samples in an easy-to-hard manner, the proposed SPR scheme gradually drops samples so that suspicious anomalies can be removed from the learning process. In this way, SPR consolidates normality advantage and enables better UVAD in a more proactive way. Finally, we further design a variant solution that explicitly takes the motion cues into account. The solution evidently enhances the UVAD performance, and it sometimes even surpasses the best classic VAD methods. Experiments show that our solution not only significantly outperforms existing UVAD methods by a wide margin (5% to 9% AUROC), but also enables UVAD to catch up with the mainstream performance of classic VAD.

Comments:	Accepted by CVPR 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2108.01975 [cs.CV]
	(or arXiv:2108.01975v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.01975

Submission history

From: Guang Yu [view email]
[v1] Wed, 4 Aug 2021 11:31:57 UTC (596 KB)
[v2] Tue, 23 Nov 2021 04:01:41 UTC (643 KB)
[v3] Tue, 21 Jun 2022 08:09:00 UTC (1,031 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Deep Anomaly Discovery From Unlabeled Videos via Normality Advantage and Self-Paced Refinement

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Deep Anomaly Discovery From Unlabeled Videos via Normality Advantage and Self-Paced Refinement

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators