Loading view.
Seminar
Learning Visual, Audio, and Cross-Modal Correspondences
Newell-Simon Hall 3305Abstract: Today's machine perception systems rely heavily on supervision provided by humans, such as labels and natural language. I will talk about our efforts to make systems that, instead, learn from two ubiquitous sources of unlabeled data: visual motion and cross-modal sensory associations. I will begin by discussing our work on creating unified models for [...]