Computationally Efficient Scoring of Activity using Demographics and Connectivity of Entities - Robotics Institute Carnegie Mellon University

Computationally Efficient Scoring of Activity using Demographics and Connectivity of Entities

Artur W. Dubrawski, John K. Ostlund, Lujie Chen, and Andrew W. Moore
Journal Article, Journal on Information Technology and Management, Vol. 11, No. 2, pp. 77 - 89, June, 2010

Abstract

Consider a collection of entities, where each may have some demographic properties, and where the entities may be linked in some kind of, perhaps social, network structure. Some of these entities are “of interest”—we call them active. What is the relative likelihood of each of the other entities being active? AFDL, Activity from Demographics and Links, is an algorithm designed to answer this question in a computationally-efficient manner. AFDL is able to work with demographic data, link data (including noisy links), or both; and it is able to process very large datasets quickly. This paper describes AFDL’s feature extraction and classification algorithms, gives timing and accuracy results obtained for several datasets, and offers suggestions for its use in real-world situations.

Notes
A preliminary version of this work has been presented at the 2nd INFORMS Workshop on Artificial Intelligence and Data Mining WAID 2007, Seattle, WA, November 2007

BibTeX

@article{Dubrawski-2010-121664,
author = {Artur W. Dubrawski and John K. Ostlund and Lujie Chen and Andrew W. Moore},
title = {Computationally Efficient Scoring of Activity using Demographics and Connectivity of Entities},
journal = {Journal on Information Technology and Management},
year = {2010},
month = {June},
volume = {11},
number = {2},
pages = {77 - 89},
}