D-PAttNet: Dynamic patch-attentive deep network for action unit detection
Abstract
Facial action units (AUs) relate to specific local facial regions. Recent efforts in automated AU detection have focused on learning the facial patch representations to detect specific AUs. These efforts have encountered three hurdles. First, they implicitly assume that facial patches are robust to head rotation; yet non-frontal rotation is common. Second, mappings between AUs and patches are defined a priori, which ignores co-occurrences among AUs. And third, the dynamics of AUs are either ignored or modeled sequentially rather than simultaneously as in human perception. Inspired by recent advances in human perception, we propose a dynamic patch-attentive deep network, called D-PAttNet, for AU detection that (i) controls for 3D head and face rotation, (ii) learns mappings of patches to AUs, and (iii) models spatiotemporal dynamics. D-PAttNet approach significantly improves upon existing state of the art.
BibTeX
@article{Ertugrul-2019-119647,author = {Itir Onal Ertugrul and Le Yang and Laszlo A. Jeni and Jeffrey F. Cohn},
title = {D-PAttNet: Dynamic patch-attentive deep network for action unit detection},
journal = {Frontiers in Computer Science},
year = {2019},
month = {November},
volume = {1},
number = {11},
}