Creative Tools: In Press, In Submission, and In Progress
Abstract: It's been a while since I've had a chance to show the rest of the RI what I and my various collaborators have been working on. So this talk will be an informal and rapid-fire tour through some of the freshest results from my lab, including work that is in press, in submission, and in [...]
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Abstract: Recent advances in GPU-based parallel simulation have enabled practitioners to collect large amounts of data and train complex control policies using deep reinforcement learning (RL), on commodity GPUs. However, such successes for RL in robotics have been limited to tasks sufficiently simulated by fast rigid-body dynamics. Simulation techniques for soft bodies are comparatively several [...]
Is Data All You Need?: Large Robot Action Models and Good Old Fashioned Engineering
Abstract: Enthusiasm has been skyrocketing for humanoids based on recent advances in "end-to-end" large robot action models. Initial results are promising, and several collaborative efforts are underway to collect the needed demonstration data. But is data really all you need? Although end-to-end Large Vision, Language, Action (VLA) Models have potential to generalize and reliably solve [...]
Informative Path Planning Toward Autonomous Real-World Applications
Abstract: Gathering information from the physical world is critical for applications such as scientific research, environmental monitoring, search and rescue, defense, and disaster response. Autonomous robots provide significant advantages for information gathering, particularly in situations where human access is constrained, hazardous, or impractical. By leveraging intelligent algorithms, these robots can efficiently collect data, enhancing decision-making [...]
The New Era of Video Generation
Abstract: Traditional video production is slow, expensive, and requires specialized skills. Founded by CMU alumni, HeyGen is an AI-native video platform designed to revolutionize the video creation process by making visual storytelling accessible to all. We've successfully grown to more than 20M users, and tens of millions revenue in less than one year, with recognition [...]
Robot Safety Beyond Collision-Avoidance
Abstract: It is common to equate robot safety with “collision avoidance”, but in unstructured open-world environments, a robot’s representation of safety should be much more nuanced. For example, the household manipulator should understand that pouring coffee too fast will cause the liquid to overflow or pulling a mug too quickly from a cupboard will cause [...]
Sensing the Unseen: Dexterous Tool Manipulation Through Touch and Vision
Abstract: Dexterous tool manipulation is a dance between tool motion, deformation, and force transmission choreographed by the robot's end-effector. Take for example the use of a spatula. How should the robot reason jointly over the tool’s geometry and forces imparted to the environment through vision and touch? In this talk, I will present our recent [...]
Carnegie Mellon University
Enabling Collaboration between Creators and Generative Models
Abstract: Generative models have made visual content creation as little effort as writing a short text description. Meanwhile, these models also spark concerns among artists, designers, and photographers about job security and data ownership. This leads to many questions: Will generative models make creators’ jobs obsolete? Should creators stop sharing their work publicly? How can creators [...]
RI Seminar with Nikolay Atanasov
Physical Intelligence and Cognitive Biases Toward AI
Abstract: When will robots be able to clean my house, dishes, and take care of laundry? While we source labor primarily from automated machines in factories, the penetration of physical robots in our daily lives has been slow. What are the challenges in realizing these intelligent machines capable of human level skill? Isn’t AI advanced [...]