Bangla grapheme to phoneme conversion using conditional random fields

Shammur Absar Chowdhury*, Firoj Alam, Naira Khan, Sheak R.H. Noori

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

Integrated with handheld devices, toys, KIOSKs, and call centers, Text to Speech (TTS) and Speech Recognition (SR) have become widely used applications in everyday life. One of the core components of said applications is Grapheme to Phoneme (G2P) conversion. The task at hand is the mapping of the written form to the spoken form, i.e. mapping one sequence to another. In Natural Language Processing (NLP), it is typically referred to as a sequence to sequence labeling task. The task however, is a language dependent one and has primarily been implemented for English and similar resource-rich languages. In comparison, very little has been done for digitally under-resourced languages such as Bangla (ethnonym: Bangla; exonym: Bengali). The current state-of-the-art Bangla Grapheme to Phoneme conversion is limited to rule-based and lexicon based approaches, the development of which requires a significant contribution of linguistic experts. In this paper, we propose a data-driven machine learning approach for Bangla G2P conversion. We evaluate the existing rule based approaches and design a machine learning model using Conditional Ran-dom Fields (CRFs). To train the machine learning models we have only used character level contextual features due to the fact that extracting hand crafted features requires specialized knowledge. We have evaluated the systems using two publicly available datasets. We have obtained promising results with a phoneme error rate of 1.51% and 14.88% for CRBLP and Google pronunciation lexicons, respectively.

Original languageEnglish
Title of host publication20th International Conference of Computer and Information Technology, ICCIT 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1-6
Number of pages6
ISBN (Electronic)9781538611500
DOIs
Publication statusPublished - 2 Jul 2017
Event20th International Conference of Computer and Information Technology, ICCIT 2017 - Dhaka, Bangladesh
Duration: 22 Dec 201724 Dec 2017

Publication series

Name20th International Conference of Computer and Information Technology, ICCIT 2017
Volume2018-January

Conference

Conference20th International Conference of Computer and Information Technology, ICCIT 2017
Country/TerritoryBangladesh
CityDhaka
Period22/12/1724/12/17

Keywords

  • Bangla
  • Conditional Random Fields
  • Grapheme to Phoneme (G2P)
  • Pronunciation Generation

Fingerprint

Dive into the research topics of 'Bangla grapheme to phoneme conversion using conditional random fields'. Together they form a unique fingerprint.

Cite this