Affective computing is a key research topic in artificial intelligence which is applied to psychology and machines. It consists of the estimation and measurement of human emotions. A person’s body language is one of the most significant sources of information during job interview, and it reflects a deep psychological state that is often missing from other data sources. In our work, we combine two tasks of pose estimation and emotion classification for emotional body gesture recognition to propose a deep multi-stage architecture that is able to deal with both tasks. Our deep pose decoding method detects and tracks the candidate’s skeleton in a video using a combination of depthwise convolutional network and detection-based method for 2D pose reconstruction. Moreover, we propose a representation technique based on the superposition of skeletons to generate for each video sequence a single image synthesizing the different poses of the subject. We call this image: ‘history pose image’, and it is used as input to the convolutional neural network model based on the Visual Geometry Group architecture. We demonstrate the effectiveness of our method in comparison with other methods in the state of the art on the standard Common Object in Context keypoint dataset and Face and Body gesture video database.

Khalifa, I., Ejbali, R., Schettini, R., Zaied, M. (2022). Deep Multi-Stage Approach For Emotional Body Gesture Recognition In Job Interview. COMPUTER JOURNAL, 65(7), 1702-1716 [10.1093/comjnl/bxab011].

Deep Multi-Stage Approach For Emotional Body Gesture Recognition In Job Interview

Khalifa I.
;
Schettini R.;
2022

Abstract

Affective computing is a key research topic in artificial intelligence which is applied to psychology and machines. It consists of the estimation and measurement of human emotions. A person’s body language is one of the most significant sources of information during job interview, and it reflects a deep psychological state that is often missing from other data sources. In our work, we combine two tasks of pose estimation and emotion classification for emotional body gesture recognition to propose a deep multi-stage architecture that is able to deal with both tasks. Our deep pose decoding method detects and tracks the candidate’s skeleton in a video using a combination of depthwise convolutional network and detection-based method for 2D pose reconstruction. Moreover, we propose a representation technique based on the superposition of skeletons to generate for each video sequence a single image synthesizing the different poses of the subject. We call this image: ‘history pose image’, and it is used as input to the convolutional neural network model based on the Visual Geometry Group architecture. We demonstrate the effectiveness of our method in comparison with other methods in the state of the art on the standard Common Object in Context keypoint dataset and Face and Body gesture video database.
Articolo in rivista - Review Essay
deep multi-stage architecture; depthwise convolutional network; emotional body gesture recognition; heat map; history pose image; pose estimation;
English
19-apr-2021
2022
65
7
1702
1716
none
Khalifa, I., Ejbali, R., Schettini, R., Zaied, M. (2022). Deep Multi-Stage Approach For Emotional Body Gesture Recognition In Job Interview. COMPUTER JOURNAL, 65(7), 1702-1716 [10.1093/comjnl/bxab011].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/470940
Citazioni
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 0
Social impact