摘要 Abstract
在基于姿态的视频异常检测领域,现有技术基于异常事件可主要归因于不常见的人类行为这一假设。然而,与利用人体骨架表示不同,我们研究了使用二维轮廓学习正常人类行为重复运动模式的可能性。在保持基于姿态方法优势(如增强物体匿名化)的同时,从人体骨架转向轮廓被认为为未来研究覆盖更多物体类别提供了机会。我们提出将问题表述为回归任务和分类任务,并进一步探索两种不同的轮廓数据表示技术。为了进一步降低基于姿态的视频异常检测解决方案的计算复杂度,本研究中的所有方法均基于来自深度学习领域的浅层神经网络,并在视频异常检测的三个最著名基准数据集及其相关的人类数据集上进行评估,总计六个数据集。我们的结果表明,这种对基于姿态的视频异常检测的新视角为未来的研究指明了一个有前景的方向。
In Pose-based Video Anomaly Detection prior art is rooted on the assumption that abnormal events can be mostly regarded as a result of uncommon human behavior. Opposed to utilizing skeleton representations of humans, however, we investigate the potential of learning recurrent motion patterns of normal human behavior using 2D contours. Keeping all advantages of pose-based methods, such as increased object anonymization, the shift from human skeletons to contours is hypothesized to leave the opportunity to cover more object categories open for future research. We propose formulating the problem as a regression and a classification task, and additionally explore two distinct data representation techniques for contours. To further reduce the computational complexity of Pose-based Video Anomaly Detection solutions, all methods in this study are based on shallow Neural Networks from the field of Deep Learning, and evaluated on the three most prominent benchmark datasets within Video Anomaly Detection and their human-related counterparts, totaling six datasets. Our results indicate that this novel perspective on Pose-based Video Anomaly Detection marks a promising direction for future research.