Facial action units recognition by de-expression residue learning

Jun He; Xiaocui Yu; Bo Sun; Yongkang Xiao

doi:10.1117/12.2539053

18 November 2019 Facial action units recognition by de-expression residue learning

Jun He, Xiaocui Yu, Bo Sun, Yongkang Xiao

Proceedings Volume 11187, Optoelectronic Imaging and Multimedia Technology VI; 1118719 (2019) https://doi.org/10.1117/12.2539053
Event: SPIE/COS Photonics Asia, 2019, Hangzhou, China

Abstract

Understanding human facial expressions is one of the key steps to achieving human-computer interaction. However, the facial expression is a combination of an expressive component called facial behavior and a neutral component of a person. The most commonly used taxonomy to describe facial behaviors is the Facial Action Coding System (FACS). FACS segments the visible effects of facial muscle activation into 30+ action units (AUs). So, we introduce a method to recognize AUs by extracting information of the expressive component through a de-expression learning procedure, called De-expression Residue Learning (DeRL). Firstly, we train a Generative Adversarial Network named cGAN to filter out the expressive information and generate the corresponding neutral face image. Then, we use the intermediate layers, which contains the action unit information, to recognition AUs. Our work alleviates problems of AUs recognition based on the pixel level difference, which is unreliable due to the variation between images i.e., rotation, translation and lighting condition changes, or the feature level difference, which is also unstable as the expression information may vary according to the identity information. As for experiments, we use the data augmentation method to avoid overfitting and trained deep network to recognition AUs on CK+ datasets. The results reveal that our work achieves more competitive performance than several other popular approaches.

Citation Download Citation

Jun He, Xiaocui Yu, Bo Sun, and Yongkang Xiao "Facial action units recognition by de-expression residue learning", Proc. SPIE 11187, Optoelectronic Imaging and Multimedia Technology VI, 1118719 (18 November 2019); https://doi.org/10.1117/12.2539053

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
7 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

CITATIONS

Cited by 1 scholarly publication.

Explore citations on Lens.org

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Databases

Image processing

Computer vision technology

RELATED CONTENT

Automated detection of retinal landmarks for the identification of clinically...
Proceedings of SPIE (March 21 2016)

Scene sketch generation using mixture of gradient kernels and adaptive...
Proceedings of SPIE (April 20 2016)

Medical image retrieval based on mutual correlation method
Proceedings of SPIE (August 07 2001)

Recognition of 3-D Scene with Partially Occluded Objects
Proceedings of SPIE (March 27 1987)

Using constraints to incorporate domain knowledge
Proceedings of SPIE (February 01 1992)

Explaining scene composition using kinematic chains of humans application...
Proceedings of SPIE (March 08 2011)

Content-based image retrieval using color features of partitioned images
Proceedings of SPIE (September 30 2011)

Subscribe to Digital Library

Receive Erratum Email Alert

Keywords/Phrases

Search In:

Publication Years