Part-Based Feature Aggregation Method for Dynamic Scene Recognition

Xiaoming Peng, Abdesselam Bouzerdoum

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

Existing methods for dynamic scene recognition mostly use global features extracted from the entire video frame or a video segment. In this paper, a part-based method is proposed for aggregating local features from multiple video frames. A pre-trained Fast R-CNN model is used to extract local convolutional layer features from the regions of interest (ROIs) of training images. These features are then clustered to locate representative parts. A set cover problem is formulated to select the discriminative parts, which are further refined by fine-tuning the Fast R-CNN. Local convolutional layer features and fully-connected layer features are extracted using the fine-tuned Fast R-CNN model, and then aggregated separately from a video segment to form two feature representations. They are concatenated into a global feature representation. Experimental results show that the proposed method outperforms several state-of-the-art features on two dynamic scene datasets.

Original languageEnglish
Title of host publication2019 Digital Image Computing
Subtitle of host publicationTechniques and Applications, DICTA 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728138572
DOIs
Publication statusPublished - Dec 2019
Event2019 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2019 - Perth, Australia
Duration: 2 Dec 20194 Dec 2019

Publication series

Name2019 Digital Image Computing: Techniques and Applications, DICTA 2019

Conference

Conference2019 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2019
Country/TerritoryAustralia
CityPerth
Period2/12/194/12/19

Keywords

  • deep neural networks
  • dynamic scene recognition
  • feature aggregation
  • video classification

Fingerprint

Dive into the research topics of 'Part-Based Feature Aggregation Method for Dynamic Scene Recognition'. Together they form a unique fingerprint.

Cite this