Unsupervised Recognition and Clustering of Speech Overlaps in Spoken Conversations

Research output: Contribution to conferencePaperpeer-review

12 Citations (Scopus)

Abstract

We are interested in understanding speech overlaps and their function in human conversations. Previous studies on speech overlaps have relied on supervised methods, small corpora and controlled conversations. The characterization of overlaps based on timing, semantic and discourse function requires an analysis over a very large feature space. In this study, the corpus of overlapped speech segments was automatically extracted from human-human spoken conversations using a large vocabulary Automatic Speech Recognizer (ASR) and a turn segmenter. Each overlap instance is automatically projected onto a high dimensional space of acoustic and lexical features. Then, we used unsupervised clustering to find the distinct and well-separated clusters in terms of acoustic and lexical features. We have evaluated recognition and clustering algorithms over a large set of real human-human spoken conversations. The clusters have been comparatively evaluated in terms of feature distributions and their contribution to the automatic classification of the clusters.

Original languageEnglish
Pages62-66
Number of pages5
Publication statusPublished - 2014
Externally publishedYes
Event2nd International Workshop on Speech, Language and Audio in Multimedia, SLAM 2014 - Penang, Malaysia
Duration: 11 Sept 201412 Sept 2014

Conference

Conference2nd International Workshop on Speech, Language and Audio in Multimedia, SLAM 2014
Country/TerritoryMalaysia
CityPenang
Period11/09/1412/09/14

Keywords

  • Discourse
  • Human Conversation
  • Language understanding
  • Overlapping Speech

Fingerprint

Dive into the research topics of 'Unsupervised Recognition and Clustering of Speech Overlaps in Spoken Conversations'. Together they form a unique fingerprint.

Cite this