论文标题
基于网格的人类行动识别的表示
A Grid-based Representation for Human Action Recognition
论文作者
论文摘要
视频中的人类行动识别(HAR)是计算机视觉中的基本研究主题。它主要由理解基于一系列视觉观察的序列所采取的动作。近年来,Har目睹了重大进展,尤其是随着深度学习模型的出现。但是,大多数现有的行动识别方法都取决于与此任务并不总是相关的信息,并且在融合时间信息的方式上受到限制。在本文中,我们提出了一种新型的人类行动识别方法,该方法将动作的最歧视性外观信息具有明确的关注,以对代表性姿势特征进行明确关注,并将其描述为新的紧凑型网格表示。我们在几个基准数据集上测试了我们的GRAR(基于网格的表述)方法,表明尽管有阶级内的外观变化和遮挡挑战,但我们的模型仍可以准确地识别人类的行为。
Human action recognition (HAR) in videos is a fundamental research topic in computer vision. It consists mainly in understanding actions performed by humans based on a sequence of visual observations. In recent years, HAR have witnessed significant progress, especially with the emergence of deep learning models. However, most of existing approaches for action recognition rely on information that is not always relevant for this task, and are limited in the way they fuse the temporal information. In this paper, we propose a novel method for human action recognition that encodes efficiently the most discriminative appearance information of an action with explicit attention on representative pose features, into a new compact grid representation. Our GRAR (Grid-based Representation for Action Recognition) method is tested on several benchmark datasets demonstrating that our model can accurately recognize human actions, despite intra-class appearance variations and occlusion challenges.