Predictive RD optimized motion estimation for very low bit-rate video coding

TitlePredictive RD optimized motion estimation for very low bit-rate video coding
Publication TypeJournal Article
Year of Publication1997
AuthorsKossentini, F., Y. - W. Lee, M. J. T. Smith, and R. K. Ward
JournalSelected Areas in Communications, IEEE Journal on
Pagination1752 -1763
Date Publisheddec.
Keywordscode standards, compression performance, computation-performance tradeoffs, DCT-based video coding framework, discrete cosine transforms, dynamic vector quantization, finite state machines, finite-state machine modeling method, H.263 framework, H.263 TMN5 coder, intra-DCT, Lagrangian, linear prediction techniques, linear predictive coding, local statistics, motion estimation, motion field, motion vector search, optimisation, predictive rate-distortion, predictive RD optimized motion estimation, probabilistic models, probability, rate distortion theory, search matching measure, statistical mean, telecommunication standards, transform coding, vector quantisation, very low bit rates, very low bit-rate video coding, video coder, video coding, weighted mean

Predictive rate-distortion (RD) optimized motion estimation techniques are studied and developed for very low bit-rate video coding. Four types of predictors are studied: mean, weighted mean, median, and statistical mean. The weighted mean is obtained using conventional linear prediction techniques. The statistical mean is obtained using a finite-state machine modeling method based on dynamic vector quantization. By employing prediction, the motion vector search can then be constrained to a small area. The effective search area is reduced further by varying its size based on the local statistics of the motion field, through using a Lagrangian as the search matching measure and imposing probabilistic models during the search process. The proposed motion estimation techniques are analyzed within a simple DCT-based video coding framework, where an RD criterion is used for alternating among three coding modes for each 8 times;8 block: motion only, motion-compensated prediction and DCT, and intra-DCT. Experimental results indicate that our techniques yield very good computation-performance tradeoffs. When such techniques are applied to an RD optimized H.263 framework at very low bit rates, the resulting H.263 compliant video coder is shown to outperform the H.263 TMN5 coder in terms of compression performance and computations simultaneously


a place of mind, The University of British Columbia

Electrical and Computer Engineering
2332 Main Mall
Vancouver, BC Canada V6T 1Z4
Tel +1.604.822.2872
Fax +1.604.822.5949

Emergency Procedures | Accessibility | Contact UBC | © Copyright 2021 The University of British Columbia