Fast video segmentation using encoding cost data

Ricardo L. de Queiroz, Gozde Bozdagi, Husrev Taha Sencar

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper presents a simple and effective pre-processing method, developed for the segmentation of MPEG compressed video sequences. The proposed method for scene-cut detection only involves computing the number of bits spent for each frame (encoding cost data), thus avoiding decoding the bitstream. The information is separated into I-, P-, B- frames, thus forming 3 vectors, which are independently processed by a new peak detection algorithm, based on overcomplete filter banks and on joint thresholding, using a confidence number. Each processed vector yields a set of candidate frame numbers, i.e., 'hints' of positions where scene-cuts may have occurred. The 'hints' for all frame types are recombined into one frame sequence and clustered into scene cuts. The algorithm was not designed to distinguish among types of cuts, but rather to indicate its position and duration. Experimental results show that the proposed algorithm is effective in detecting abrupt scene changes, as well as gradual transitions. For precision- demanding applications, the algorithm can be used with a low confidence factor, just to select the frames, which are worth being investigated by a more complex algorithm. The algorithm is not particularly tailored to MPEG and can be applied to most video compression techniques.
Original languageEnglish
Title of host publicationProceedings of SPIE - The International Society for Optical Engineering
Pages699-709
Number of pages11
Volume3656
Publication statusPublished - Dec 1998
Externally publishedYes
EventProceedings of the 1999 7th Conference of the Storage and Retrieval for Image and Video Databases VII - San Jose, Ca, USA
Duration: 26 Jan 199929 Jan 1999

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
PublisherSPIE
ISSN (Print)0277-786X

Conference

ConferenceProceedings of the 1999 7th Conference of the Storage and Retrieval for Image and Video Databases VII
CitySan Jose, Ca, USA
Period26/01/9929/01/99

Fingerprint

Dive into the research topics of 'Fast video segmentation using encoding cost data'. Together they form a unique fingerprint.

Cite this