Multi-reference object pose indexing and 3-D modeling from video using volume feedback

TitleMulti-reference object pose indexing and 3-D modeling from video using volume feedback
Publication TypeConference Paper
Year of Publication2004
AuthorsAvanaki, A. N., B. Hamidzadeh, F. Kossentini, and R. Ward
Conference NameCircuits and Systems, 2004. ISCAS '04. Proceedings of the 2004 International Symposium on
PaginationIII-893 - III-896 Vol.3
Date Publishedmay.
Keywords3D modelling, 3D object reconstruction, 3D object tracking, content-based retrieval, feature matching, flat depth map, human head, image matching, image reconstruction, image sequences, image texture, motion estimation, multireference object pose indexing, point tracking, pose estimates, pose index, rigid object, shape-from-silhouette volume reconstruction, video sequence, video sequences, volume feedback
Abstract

A system for 3-D reconstruction of a rigid object from monocular video sequences is introduced. Initially an object pose is estimated in each image by locating similar (unknown) texture assuming flat depth map for all images. Shape-from-silhouette as stated in R. Szeliski (1993) is then applied to construct a 3-D model which is used to obtain better pose estimates using a model-based method. Before repeating the process by building a new 3-D model, pose estimates are adjusted to reduce error by maximizing a quality measure for shape-from-silhouette volume reconstruction. Translation of the object in the input sequence is compensated in two stages. The volume feedback is terminated when the updates in pose estimates become small. The final output is a pose index (the last set of pose estimates) and a 3-D model of the object. Good performance of the system is shown by experiments on a real video sequence of a human head. Our method has the following advantages: (1) No model is assumed for the object. (2) Feature points are neither detected nor tracked, thus no problematic feature matching or lengthy point tracking are required. (3) The method generates a high level pose index for the input images, these can be used for content-based retrieval. Our method can also be applied to 3-D object tracking in video.

URLhttp://dx.doi.org/10.1109/ISCAS.2004.1328891
DOI10.1109/ISCAS.2004.1328891

a place of mind, The University of British Columbia

Electrical and Computer Engineering
2332 Main Mall
Vancouver, BC Canada V6T 1Z4
Tel +1.604.822.2872
Fax +1.604.822.5949
Email:

Emergency Procedures | Accessibility | Contact UBC | © Copyright 2020 The University of British Columbia