MSR Thesis Defense
MSR Student
Robotics Institute,
Carnegie Mellon University

Vision-Language Models for Hand-Object Interaction Prediction

Rashid Auditorium - 4401 Gates and Hillman Centers

Abstract: How can we predict future interaction trajectories of human hands in a scene given high-level colloquial task specifications in the form of natural language? In this paper, we extend the classic hand trajectory prediction task to two tasks involving explicit or implicit language queries. Our proposed tasks require extensive understanding of human daily activities [...]