• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Chen, Jing (Chen, Jing.) [1] (Scholars:陈静) | Chen, Deying (Chen, Deying.) [2] | Jiang, Hao (Jiang, Hao.) [3] (Scholars:江灏) | Miao, Xiren (Miao, Xiren.) [4] (Scholars:缪希仁) | Yin, Cunyi (Yin, Cunyi.) [5]

Indexed by:

EI Scopus SCIE

Abstract:

Human sensing based on the low-resolution infrared sensor is widely used in hand gestures recognition, activity recognition, intrusion detection, etc. However, the information about humans acquired by the previous human sensing system using the infrared sensor is limited. In this paper, a human pose estimation system is proposed to realize the three-dimensional skeleton information acquisition by low-resolution infrared sensors. It is a difficult task to acquire human pose estimation with more rich human information from low-resolution infrared sensors. The system leverages the 8 x 8 pixels low-resolution infrared array sensor to collect the activity data and the Kinect v2 camera to capture the three-dimensional skeleton of the human body as annotations of the infrared data. The convolutional neural network-bidirectional gated recurrent unit model with attention mechanism (CNN-BiGRU-AM) model is employed for model training to effectively extract the characteristics of the infrared data from spatial and temporal dimensions. The attention mechanism (AM) can improve the ability of the model to capture important local information. The bone joint point data predicted by the model are utilized to draw the three-dimensional skeleton diagram. The k-means clustering algorithm is applied to eliminate the outliers that affect the overall visualization effect in the prediction. The accuracy and completeness of human pose estimation are measured by the euclidean distance between the real coordinates of the bone joint points obtained by Kinect v2 camera and the coordinates predicted by the model. The proportion of the number of predictions with euclidean distance less than a threshold 20 mm is 90.151%, representing the accuracy of human pose estimation. The experimental results show that three-dimensional skeleton information can be acquired accurately by the low-resolution infrared array sensor and the subtle difference within each activity can be observed through the 3D human pose to improve the effect of activity recognition.

Keyword:

Attention mechanism Bidirectional gated recurrent unit (BiGRU) Convolutional neural network (CNN) Low-resolution infrared array sensor Skeleton-based 3D human pose estimation

Community:

  • [ 1 ] [Chen, Jing]Fuzhou Univ, Coll Elect Engn & Automat, Fuzhou 350108, Peoples R China
  • [ 2 ] [Chen, Deying]Fuzhou Univ, Coll Elect Engn & Automat, Fuzhou 350108, Peoples R China
  • [ 3 ] [Jiang, Hao]Fuzhou Univ, Coll Elect Engn & Automat, Fuzhou 350108, Peoples R China
  • [ 4 ] [Miao, Xiren]Fuzhou Univ, Coll Elect Engn & Automat, Fuzhou 350108, Peoples R China
  • [ 5 ] [Yin, Cunyi]Fuzhou Univ, Coll Elect Engn & Automat, Fuzhou 350108, Peoples R China

Reprint 's Address:

  • [Jiang, Hao]Fuzhou Univ, Coll Elect Engn & Automat, Fuzhou 350108, Peoples R China;;

Show more details

Related Keywords:

Source :

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS

ISSN: 1868-8071

Year: 2023

Issue: 5

Volume: 15

Page: 2049-2062

3 . 1

JCR@2023

3 . 1 0 0

JCR@2023

JCR Journal Grade:2

CAS Journal Grade:3

Cited Count:

WoS CC Cited Count: 2

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Online/Total:52/9996758
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1