实时环境下真实世界辅助生活中的动作识别

Action Recognition in Real-World Ambient Assisted Living Environment

摘要 Abstract

随着人口老龄化加剧以及老年人希望在家中独立生活的愿望,需要采取积极策略确保其安全与支持。环境辅助生活(Ambient Assisted Living, AAL)技术应运而生,通过在家庭环境中提供持续监控和协助,帮助老年人实现居家养老。在AAL技术中,动作识别在解释人类活动、检测跌倒、行动能力下降或异常行为等方面发挥着关键作用,这些行为可能预示健康状况恶化。然而,实际AAL应用中的动作识别面临诸多挑战,包括遮挡、噪声数据以及实时性能需求。尽管在准确率、抗噪性和计算效率方面取得了进展,但如何平衡这些因素仍是一个难题。为解决这一问题,本文介绍了一种鲁棒且高效的时序卷积网络(Robust and Efficient Temporal Convolution Network, RE-TCN),该模型包含三个主要组成部分:自适应时序加权(Adaptive Temporal Weighting, ATW)、深度可分离卷积(Depthwise Separable Convolutions, DSC)以及数据增强技术。这些组件旨在提升模型在真实AAL场景下的准确性、抗噪性和遮挡鲁棒性,同时提高计算效率。实验结果表明,RE-TCN在准确率、抗噪性和遮挡鲁棒性方面优于现有模型,并已在四个基准数据集(NTU RGB+D 60、Northwestern-UCLA、SHREC'17、DHG-14/28)上得到验证。代码已公开发布于:https://github.com/Gbouna/RE-TCN

The growing ageing population and their preference to maintain independence by living in their own homes require proactive strategies to ensure safety and support. Ambient Assisted Living (AAL) technologies have emerged to facilitate ageing in place by offering continuous monitoring and assistance within the home. Within AAL technologies, action recognition plays a crucial role in interpreting human activities and detecting incidents like falls, mobility decline, or unusual behaviours that may signal worsening health conditions. However, action recognition in practical AAL applications presents challenges, including occlusions, noisy data, and the need for real-time performance. While advancements have been made in accuracy, robustness to noise, and computation efficiency, achieving a balance among them all remains a challenge. To address this challenge, this paper introduces the Robust and Efficient Temporal Convolution network (RE-TCN), which comprises three main elements: Adaptive Temporal Weighting (ATW), Depthwise Separable Convolutions (DSC), and data augmentation techniques. These elements aim to enhance the model's accuracy, robustness against noise and occlusion, and computational efficiency within real-world AAL contexts. RE-TCN outperforms existing models in terms of accuracy, noise and occlusion robustness, and has been validated on four benchmark datasets: NTU RGB+D 60, Northwestern-UCLA, SHREC'17, and DHG-14/28. The code is publicly available at: https://github.com/Gbouna/RE-TCN