Attention-Based Temporal Encoding Network with Background-Independent Motion Mask for Action Recognition

Document detail

ID

oai:pubmedcentral.nih.gov:8024...

Topic

Research Article

Author

Weng, Zhengkui Jin, Zhipeng Chen, Shuangxi Shen, Quanquan Ren, Xiangyang Li, Wuzhao

Langue

Editor

Hindawi

Year

2021

listing date

12/12/2022

Keywords

network frames inside temporal segmenting motion

Metrics

Abstract

Convolutional neural network (CNN) has been leaping forward in recent years.

However, the high dimensionality, rich human dynamic characteristics, and various kinds of background interference increase difficulty for traditional CNNs in capturing complicated motion data in videos.

A novel framework named the attention-based temporal encoding network (ATEN) with background-independent motion mask (BIMM) is proposed to achieve video action recognition here.

Initially, we introduce one motion segmenting approach on the basis of boundary prior by associating with the minimal geodesic distance inside a weighted graph that is not directed.

Then, we propose one dynamic contrast segmenting strategic procedure for segmenting the object that moves within complicated environments.

Subsequently, we build the BIMM for enhancing the object that moves based on the suppression of the not relevant background inside the respective frame.

Furthermore, we design one long-range attention system inside ATEN, capable of effectively remedying the dependency of sophisticated actions that are not periodic in a long term based on the more automatic focus on the semantical vital frames other than the equal process for overall sampled frames.

For this reason, the attention mechanism is capable of suppressing the temporal redundancy and highlighting the discriminative frames.

Lastly, the framework is assessed by using HMDB51 and UCF101 datasets.

As revealed from the experimentally achieved results, our ATEN with BIMM gains 94.5% and 70.6% accuracy, respectively, which outperforms a number of existing methods on both datasets.

Weng, Zhengkui,Jin, Zhipeng,Chen, Shuangxi,Shen, Quanquan,Ren, Xiangyang,Li, Wuzhao, 2021, Attention-Based Temporal Encoding Network with Background-Independent Motion Mask for Action Recognition, Hindawi

Document

Open Open

Source

Articles recommended by ES/IODE AI

Computer Science

Optimal Cost Constrained Adversarial Attacks For Multiple Agent Systems

optimal

Annals of Indian Academy of ...

Platform Abstracts

sciences : sciences du vivan...

Αnalysis by DΝΑ barcοding οf the heterοgeneοus respοnse tο anticancer drugs by different subpοpulatiοns οf lung cancer cells;Analysis by DNA barcoding of the heterogeneous response to anticancer drugs by different subpopulations of lung cancer cells

growth spécifique au subpopulations anti-cancer barcoding barcode nsclc clonal tumor cellules cancer avons cell traitement cbnpc specific lung drugs treatment population response cells réponse barcode-tracker