GLORIA — GEOMAR Library Ocean Research Information Access

Hits per page

hits 1 - 2 | 2 hits

Sorting

Online Resource

Shuffle-invariant Network for Action Recognition in Videos

Shi, Qinghongya ; Zhang, Hong-Bo ; Li, Zhe ; [et al.]

Association for Computing Machinery (ACM) ; 2022

In: ACM Transactions on Multimedia Computing, Communications, and Applications Vol. 18, No. 3 ( 2022-08-31), p. 1-18

add to mindlist on the mindlist

Details

In: ACM Transactions on Multimedia Computing, Communications, and Applications, Association for Computing Machinery (ACM), Vol. 18, No. 3 ( 2022-08-31), p. 1-18

Abstract: The local key features in video are important for improving the accuracy of human action recognition. However, most end-to-end methods focus on global feature learning from videos, while few works consider the enhancement of the local information in a feature. In this article, we discuss how to automatically enhance the ability to discriminate the local information in an action feature and improve the accuracy of action recognition. To address these problems, we assume that the critical level of each region for the action recognition task is different and will not change with the region location shuffle. We therefore propose a novel action recognition method called the shuffle-invariant network. In the proposed method, the shuffled video is generated by regular region cutting and random confusion to enhance the input data. The proposed network adopts the multitask framework, which includes one feature backbone network and three task branches: local critical feature shuffle-invariant learning, adversarial learning, and an action classification network. To enhance the local features, the feature response of each region is predicted by a local critical feature learning network. To train this network, an L 1-based critical feature shuffle-invariant loss is defined to ensure that the ordered feature response list of these regions remains unchanged after region location shuffle. Then, the adversarial learning is applied to eliminate the noise caused by the region shuffle. Finally, the action classification network combines these two tasks to jointly guide the training of the feature backbone network and obtain more effective action features. In the testing phase, only the action classification network is applied to identify the action category of the input video. We verify the proposed method on the HMDB51 and UCF101 action datasets. Several ablation experiments are constructed to verify the effectiveness of each module. The experimental results show that our approach achieves the state-of-the-art performance.

Type of Medium: Online Resource

ISSN: 1551-6857 , 1551-6865

URL: Article

DOI: 10.1145/3485665

RVK:

ST 325

Language: English

Publisher: Association for Computing Machinery (ACM)

Publication Date: 2022

detail.hit.zdb_id: 2182650-X

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

Online Resource

Improved algorithms for non-submodular function maximization problem

Liu, Zhicheng ; Jin, Jing ; Chang, Hong ; [et al.]

Elsevier BV ; 2022

In: Theoretical Computer Science Vol. 931 ( 2022-09), p. 49-55

add to mindlist on the mindlist

Details

In: Theoretical Computer Science, Elsevier BV, Vol. 931 ( 2022-09), p. 49-55

Type of Medium: Online Resource

ISSN: 0304-3975

URL: Article

DOI: 10.1016/j.tcs.2022.07.029

RVK:

SQ 1100

Language: English

Publisher: Elsevier BV

Publication Date: 2022

detail.hit.zdb_id: 193706-6

detail.hit.zdb_id: 1466347-8

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

hits 1 - 2 | 2 hits