Dynamic Cascades with Bidirectional Bootstrapping for Spontaneous Facial Action Unit Detection - Robotics Institute Carnegie Mellon University

Dynamic Cascades with Bidirectional Bootstrapping for Spontaneous Facial Action Unit Detection

Y. Zhou, F. De la Torre, and J. Cohn
Journal Article, IEEE Transactions on Affective Computing, Vol. 2, No. 2, pp. 79 - 91, April, 2011

Abstract

A relatively unexplored problem in facial expression analysis is how to select the positive and negative samples with which to train classifiers for expression recognition. Typically, for each action unit (AU) or other expression, the peak frames are selected as positive class and the negative samples are selected from other AUs. This approach suffers from at least two drawbacks. One, because many state of the art classifiers, such as Support Vector Machines (SVMs), fail to scale well with increases in the number of training samples (e.g. for the worse case in SVM), it may be infeasible to use all potential training data. Two, it often is unclear how best to choose the positive and negative samples. If we only label the peaks as positive samples, a large imbalance will result between positive and negative samples, especially for infrequent AU. On the other hand, if all frames from onset to offset are labeled as positive, many may differ minimally or not at all from the negative class. Frames near onsets and offsets often differ little from those that precede them. In this paper, we propose Dynamic Cascades with Bidirectional Bootstrapping (DCBB) to address these issues. DCBB optimally selects positive and negative class samples in training sets. In experimental evaluations in non-posed video from the RU-FACS Database, DCBB yielded improved performance for action unit recognition relative to alternative approaches.

BibTeX

@article{Zhou-2011-120786,
author = {Y. Zhou and F. De la Torre and J. Cohn},
title = {Dynamic Cascades with Bidirectional Bootstrapping for Spontaneous Facial Action Unit Detection},
journal = {IEEE Transactions on Affective Computing},
year = {2011},
month = {April},
volume = {2},
number = {2},
pages = {79 - 91},
}