Local visual microphones: Improved sound extraction from silent video

Mohammad Amin Shabani, Laleh Samadfam, Mohammad Amin Sadeghi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

Sound waves cause small vibrations in nearby objects. A few techniques exist in the literature that can extract sound from video. In this paper we study local vibration patterns at different image locations. We show that different locations in the image vibrate differently. We carefully aggregate local vibrations and produce a sound quality that improves state-of-the-art. We show that local vibrations could have a time delay because sound waves take time to travel through the air. We use this phenomenon to estimate sound direction. We also present a novel algorithm that speeds up sound extraction by two to three orders of magnitude and reaches real-time performance in a 20KHz video.

Original languageEnglish
Title of host publicationBritish Machine Vision Conference 2017, BMVC 2017
PublisherBMVA Press
ISBN (Electronic)190172560X, 9781901725605
Publication statusPublished - 2017
Externally publishedYes
Event28th British Machine Vision Conference, BMVC 2017 - London, United Kingdom
Duration: 4 Sept 20177 Sept 2017

Publication series

NameBritish Machine Vision Conference 2017, BMVC 2017

Conference

Conference28th British Machine Vision Conference, BMVC 2017
Country/TerritoryUnited Kingdom
CityLondon
Period4/09/177/09/17

Fingerprint

Dive into the research topics of 'Local visual microphones: Improved sound extraction from silent video'. Together they form a unique fingerprint.

Cite this