VASC Seminar
Learning Single-view 3D Reconstruction of Objects and Scenes
Abstract: In this talk, I will discuss the task of inferring 3D structure underlying an image, in particular focusing on two questions - a) how we can plausibly obtain supervisory signal for this task, and b) what forms of representation should we pursue. I will first show that we can leverage image-based supervision to learn [...]
Neuromorphic Event-based time oriented vision and Computation
Abstract: There has been significant research over the past two decades in developing new systems for spiking neural computation. The impact of neuromorphic concepts on recent developments in optical sensing, display and artificial vision is presented. State-of-the-art image sensors suffer from severe limitations imposed by their very principle of operation. These sensors acquire the visual [...]
Towards Goal-Driven Visually Grounded Dialog Agents
Abstract: Communication between human users and artificial intelligences is essential for human-AI cooperative tasks. For these collaborations to extend into real environments, artificial agents must be able to perceive their environment (visually, aurally, tactilely, etc.) and to communicate with humans about it in order to accomplish mutual goals. For example, a user might talk with [...]
On challenges in image generation
Abstract: Recent work has shown impressive success in automatically synthesizing new images with desired properties such as transferring painterly style, modifying facial expressions, increasing image resolution or manipulating the center of attention of the image. In this talk I will discuss two of the standing challenges in image synthesis and how we tackle them: - [...]
Learning Common Sense: a Grand Challenge for Academic AI Research
Abstract: In a world where Google, Facebook, and others possess massive proprietary data sets, and unprecedented computational power---how is a graduate student to make a dent in the universe? I’ll address this conundrum by re-visiting one of the holy grails of AI: acquiring, representing, and utilizing common-sense knowledge. Can we leverage modern methods including deep [...]
Multimodal, multilevel analysis of human behavior
Abstract: Computer analysis of human behavior is an interdisciplinary endeavor combining sensing technology, theoretical and empirical models of human behavior, pattern recognition and machine learning algorithms, and interaction sciences. The applications in this area range widely, from robotics to healthcare, from smart environments to multimedia, from security to humanitarian response. While human behaviors span different [...]
Object Detection and Tracking on Low Resolution Aerial Images
Abstract: Object tracking from an aerial platform poses a number of unique challenges including the small number of pixels representing the objects, large camera motion, and low temporal resolution. Because of these unique reasons, low resolution aerial image analysis needs to be tackled differently than the traditional image analysis both in terms of the sensors, [...]
Data-Driven Learning Towards Perceptual Organization
Abstract: Computer vision has advanced rapidly with deep learning, achieving above human performance on some classification benchmarks. At the core of the state-of-the-art approaches for image classification, object detection, and semantic/instance segmentation is sliding window classification, engineered for computational efficiency. Such piecemeal analysis of visual perception often has trouble getting details right and fails miserably [...]
Deep Representation Learning with Induced Structural Priors
Abstract: With the support of big-data and big-compute, deep learning has reshaped the landscape of research and applications in artificial intelligence. Whilst traditional hand-guided feature engineering in many cases is simplified, the deep network architectures become increasingly more complex. A central question is if we can distill the minimal set of structural priors that can [...]
Lifelong Learning via Curiosity and Self-supervision
Abstract: Humans demonstrate remarkable ability to generalize their knowledge and skills to new unseen scenarios. One of the primary reasons is that they are continually learning by acting in the environment and adapting to novel circumstances. This is in sharp contrast to our current machine learning algorithms which are incredibly narrow in only performing the [...]