Konuşma ve müzi̇k i̇çeren sesleri̇n ayriştirilmasi

Translated title of the contribution: Classification of audios containing speech and music

Erkam Uzun*, Hüsrev Taha Sencar

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

We propose an automated technique that uses perceptual and non-perceptual audio quality measures for discrimination of speech and music signals with high accuracy. Deployed audio quality measures used for characterization of audio are obtained via de-noising of the original audio. The underlying idea of the approach is that de-noising operation affects speech and music signals in a different and consistent manner and these differences can be captured by the audio quality metrics. Obtained quality measures are then used in conjunction with a machine learning classifier to statistically model speech and music signals. To determine the accuracy of the proposed method, tests have been performed on different datasets with and without audio compression.

Translated title of the contributionClassification of audios containing speech and music
Original languageTurkish
Title of host publication2012 20th Signal Processing and Communications Applications Conference, SIU 2012, Proceedings
DOIs
Publication statusPublished - 2012
Externally publishedYes
Event2012 20th Signal Processing and Communications Applications Conference, SIU 2012 - Fethiye, Mugla, Turkey
Duration: 18 Apr 201220 Apr 2012

Publication series

Name2012 20th Signal Processing and Communications Applications Conference, SIU 2012, Proceedings

Conference

Conference2012 20th Signal Processing and Communications Applications Conference, SIU 2012
Country/TerritoryTurkey
CityFethiye, Mugla
Period18/04/1220/04/12

Fingerprint

Dive into the research topics of 'Classification of audios containing speech and music'. Together they form a unique fingerprint.

Cite this