Violent Scene Detection Using a Super Descriptor Tensor Decomposition

Muhammad Rizwan Khokher, Abdesselam Bouzerdoum, Son Lam Phung

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Citations (Scopus)

Abstract

This article presents a new method for violent scene detection using super descriptor tensor decomposition. Multi-modal local features comprising auditory and visual features are extracted from Mel-frequency cepstral coefficients (including first and second order derivatives) and refined dense trajectories. There is usually a large number of dense trajectories extracted from a video sequence; some of these trajectories are unnecessary and can affect the accuracy. We propose to refine the dense trajectories by selecting only discriminative trajectories in the region of interest. Visual descriptors consisting of oriented gradient and motion boundary histograms are computed along the refined dense trajectories. In traditional bag-of-visual-words techniques, the feature descriptors are concatenated to form a single large feature vector for classification. This destroys the spatio-Temporal interactions among features extracted from multi-modal data. To address this problem, a super descriptor tensor decomposition is proposed. The extracted feature descriptors are first encoded using super descriptor vector method. Then the encoded features are arranged as tensors so as to retain the spatio-Temporal structure of the features. To obtain a compact set of features for classification, the TUCKER-3 decomposition is applied to the super descriptor tensors, followed by feature selection using Fisher feature ranking. The obtained features are fed to a support vector machine classifier. Experimental evaluation is performed on violence detection benchmark dataset, MediaEval VSD2014. The proposed method outperforms most of the state-of-The-Art methods, achieving MAP2014 scores of 60.2% and 67.8% on two subsets of the dataset.

Original languageEnglish
Title of host publication2015 International Conference on Digital Image Computing
Subtitle of host publicationTechniques and Applications, DICTA 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781467367950
DOIs
Publication statusPublished - 2015
Externally publishedYes
EventInternational Conference on Digital Image Computing: Techniques and Applications, DICTA 2015 - Adelaide, Australia
Duration: 23 Nov 201525 Nov 2015

Publication series

Name2015 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2015

Conference

ConferenceInternational Conference on Digital Image Computing: Techniques and Applications, DICTA 2015
Country/TerritoryAustralia
CityAdelaide
Period23/11/1525/11/15

Keywords

  • Refined dense trajectories
  • Super descriptor vector
  • Support vector machines
  • Tensor decomposition
  • Violent scene detection

Fingerprint

Dive into the research topics of 'Violent Scene Detection Using a Super Descriptor Tensor Decomposition'. Together they form a unique fingerprint.

Cite this