Classification in Very High Dimensional Problems with Handfuls of Examples - Robotics Institute Carnegie Mellon University

Classification in Very High Dimensional Problems with Handfuls of Examples

Conference Paper, Proceedings of Knowledge Discovery in Databases (PKDD '07), pp. 212 - 223, September, 2007

Abstract

Modern classification techniques perform well when the number of training examples exceed the number of features. If, however, the number of features greatly exceed the number of training examples, then these same techniques can fail. To address this problem, we present a hierarchical Bayesian framework that shares information between features by modeling similarities between their parameters. We believe this approach is applicable to many sparse, high dimensional problems and especially relevant to those with both spatial and temporal components. One such problem is fMRI time series, and we present a case study that shows how we can successfully classify in this domain with 80,000 original features and only 2 training examples per class.

BibTeX

@conference{Palatucci-2007-9807,
author = {Mark Palatucci and Tom Mitchell},
title = {Classification in Very High Dimensional Problems with Handfuls of Examples},
booktitle = {Proceedings of Knowledge Discovery in Databases (PKDD '07)},
year = {2007},
month = {September},
pages = {212 - 223},
publisher = {Springer-Verlag},
keywords = {classification, fmri, hierarchical bayes},
}