Enhancing Robot Perception and Interaction Through Structured Domain Knowledge - Robotics Institute Carnegie Mellon University

Enhancing Robot Perception and Interaction Through Structured Domain Knowledge

Master's Thesis, Tech. Report, CMU-RI-TR-24-28, June, 2024

Abstract

Despite the advancements in deep learning driven by increased computational power and large datasets, significant challenges remain. These include difficulty in handling novel entities, limited mechanisms for human experts to update knowledge, and lack of interpretability, all of which are crucial for human-centric applications like assistive robotics. To address these issues, we propose leveraging structured information sources, such as knowledge graphs, to enhance the robustness and reliability of deep learning models by utilizing additional domain knowledge. By integrating these knowledge sources through neurosymbolic architectures, which combine neural networks and symbolic reasoning, we can improve model interpretability, generalization, and flexibility. This approach enables AI systems to understand complex scenes and human actions better, ultimately leading to more reliable and transparent performance in real-world scenarios. Our work highlights the potential of augmenting neural networks with additional domain knowledge. Particularly, we demonstrate the benefit of this approach in the task of learning novel objects in a sample-efficient manner and action anticipation from short-video contexts in a human-robot collaborative setting.

BibTeX

@mastersthesis{Bhagat-2024-141327,
author = {Sarthak Bhagat},
title = {Enhancing Robot Perception and Interaction Through Structured Domain Knowledge},
year = {2024},
month = {June},
school = {Carnegie Mellon University},
address = {Pittsburgh, PA},
number = {CMU-RI-TR-24-28},
keywords = {domain knowledge, concept learning, few-shot learning, action anticipation, video understanding, robot learning, knowledge graphs},
}