Query-Driven Approach to Face Clustering and Tagging
Abstract
In the era of big data, a traditional offline setting to processing image data is simply not tenable. We simply do not have the computational power to process every image with every possible tag; moreover, we will not have the manpower to clean up the potentially noisy results. In this paper, we introduce a query-driven approach to visual tagging, focusing on the application of face tagging and clustering. We integrate active learning with query-driven probabilistic databases. Rather than asking a user to provide manual labels so as to minimize the uncertainty of labels (face tags) across the entire data set, we ask the user to provide labels that minimize the uncertainty of his/her query result (e.g., “How many times did Bob and Jim appear together?”). We use a data-driven Gaussian process model of facial appearance to write the probabilistic estimates of facial identity into a probabilistic database, which can then support inference through query answering. Importantly, the database is augmented with contextual constraints (faces in the same image cannot be the same identity, while faces in the same track must be identical). Experiments on the real-world photo collections demonstrate the effectiveness of the proposed method.
BibTeX
@article{Zhang-2016-121104,author = {L. Zhang and X. Wang and D. Kalashnikov and S. Mehrotra and D. Ramanan},
title = {Query-Driven Approach to Face Clustering and Tagging},
journal = {IEEE Transactions on Image Processing},
year = {2016},
month = {October},
volume = {25},
number = {10},
pages = {4504 - 4513},
}