Incremental Reconstruction of 3D Scenes from Multiple Complex Images - Robotics Institute Carnegie Mellon University

Incremental Reconstruction of 3D Scenes from Multiple Complex Images

M. Herman and Takeo Kanade
Journal Article, Artificial Intelligence, Vol. 30, No. 3, pp. 289 - 341, December, 1986

Abstract

The 3D Mosaic system is a vision system that incrementally reconstructs complex 3D scenes from a sequence of images obtained from multiple viewpoints. The system encompasses several levels of the vision process, starting with images and ending with symbolic scene descriptions. This paper describes the various components of the system, including stereo analysis, monocular analysis, and constructing and updating the scene model. In addition, the representation of the scene model is described. This model is intended for tasks such as matching, display generation, planning paths through the scene, and making other decisions about the scene environment. Examples showing how the system is used to interpret complex aerial photographs of urban scenes are presented.

Each view of the scene, which may be either a single image or a stereo pair, undergoes analysis which results in a 3D wire-frame description that represents portions of edges and vertices of objects. The model is a surface-based description constructed from the wire frames. With each successive view, the model is incrementally updated and gradually becomes more accurate and complete. Task-specific knowledge, involving block-shaped objects in an urban scene, is used to extract the wire frames and construct and update the model.

The model is represented as a graph in terms of symbolic primitives such as faces, edges, vertices, and their topology and geometry. This permits the representation of partially complete, planar-faced objects. Because incremental modifications to the model must be easy to perform, the model contains mechanisms to (1) add primitives in a manner such that constraints on geometry imposed by these additions are propagated throughout the model, and (2) modify and delete primitives if discrepancies arise between newly derived and current information. The model also contains mechanisms that permit the generation, addition, and deletion of hypotheses for parts of the scene for which there is little data.

BibTeX

@article{Herman-1986-15653,
author = {M. Herman and Takeo Kanade},
title = {Incremental Reconstruction of 3D Scenes from Multiple Complex Images},
journal = {Artificial Intelligence},
year = {1986},
month = {December},
volume = {30},
number = {3},
pages = {289 - 341},
}