Paper
7 March 2024 Human pose estimation constrained by human structure information
Author Affiliations +
Proceedings Volume 13086, MIPPR 2023: Pattern Recognition and Computer Vision; 1308606 (2024) https://doi.org/10.1117/12.2690367
Event: Twelfth International Symposium on Multispectral Image Processing and Pattern Recognition (MIPPR2023), 2023, Wuhan, China
Abstract
We present a two-dimensional human pose estimation network constrained by the human structure information (HSINet). HSINet effectively fuses features of different scales and explicitly integrates human structure information to enhance the precision of key point localization. The architecture of HSINet comprises three pivotal modules: the feature extraction module, the encoding module, and the decoding module. The feature extraction module within HSINet employs the architecture of High-Resolution Net (HRNet). In contrast to HRNet, we remove redundant layers, and enhance the ability to combine global features and local features using the Gated Attention Unit (GAU). The second module encodes the feature maps derived from the feature extraction module. Each feature map corresponds to a joint point and is characterized by two feature vectors representing the x and y axes. Utilizing graph convolution for encoding introduces constraints based on human structure information. Subsequently, these encoded feature maps are decoded into precise coordinates of key points. The experiment results on COCO datasets show that our proposed method can improve the precision of key point detection while effectively reducing the number of parameters.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Lamei Zou, Hao Wang, Jia Xie, Chengqing Wu, Luhan Lu, and Yichun Guo "Human pose estimation constrained by human structure information", Proc. SPIE 13086, MIPPR 2023: Pattern Recognition and Computer Vision, 1308606 (7 March 2024); https://doi.org/10.1117/12.2690367
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Convolution

Pose estimation

Feature extraction

Feature fusion

Ablation

Autoregressive models

Action recognition

Back to Top