Abstract
We are interested in understanding speech overlaps and their function in human conversations. Previous studies on speech overlaps have relied on supervised methods, small corpora and controlled conversations. The characterization of overlaps based on timing, semantic and discourse function requires an analysis over a very large feature space. In this study, the corpus of overlapped speech segments was automatically extracted from human-human spoken conversations using a large vocabulary Automatic Speech Recognizer (ASR) and a turn segmenter. Each overlap instance is automatically projected onto a high dimensional space of acoustic and lexical features. Then, we used unsupervised clustering to find the distinct and well-separated clusters in terms of acoustic and lexical features. We have evaluated recognition and clustering algorithms over a large set of real human-human spoken conversations. The clusters have been comparatively evaluated in terms of feature distributions and their contribution to the automatic classification of the clusters.
Original language | English |
---|---|
Pages | 62-66 |
Number of pages | 5 |
Publication status | Published - 2014 |
Externally published | Yes |
Event | 2nd International Workshop on Speech, Language and Audio in Multimedia, SLAM 2014 - Penang, Malaysia Duration: 11 Sept 2014 → 12 Sept 2014 |
Conference
Conference | 2nd International Workshop on Speech, Language and Audio in Multimedia, SLAM 2014 |
---|---|
Country/Territory | Malaysia |
City | Penang |
Period | 11/09/14 → 12/09/14 |
Keywords
- Discourse
- Human Conversation
- Language understanding
- Overlapping Speech