Neural vs Statistical Machine Translation: Revisiting the Bangla-English Language Pair

Md Arid Hasan, Firoj Alam, Shammur Absar Chowdhury, Naira Khan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

9 Citations (Scopus)

Abstract

Machine translation systems facilitate our communication and access to information, taking down language barriers. It is a well-researched area of Natural Language Processing (NLP), especially for resource-rich languages (e.g., language pairs in Europarl Parallel corpus). Besides these languages, there is also work on other language pairs including the Bangla-English language pair. In the current study, we aim to revisit both Statistical Machine Translation (SMT) and Neural Machine Translation (NMT) approaches using well-known, publicly available corpora for the Bangla-English (Bangla to English) language pair. We reported how the performance of the models differ based on the data and modeling techniques; consequently, we also compared the results obtained with Google's machine translation system. Our findings, across different corpora, indicates that NMT based approaches outperform SMT systems. Our results also outperform existing baselines by a large margin.

Original languageEnglish
Title of host publication2019 International Conference on Bangla Speech and Language Processing, ICBSLP 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728152417
DOIs
Publication statusPublished - Sept 2019
Event2019 International Conference on Bangla Speech and Language Processing, ICBSLP 2019 - Sylhet, Bangladesh
Duration: 27 Sept 201928 Sept 2019

Publication series

Name2019 International Conference on Bangla Speech and Language Processing, ICBSLP 2019

Conference

Conference2019 International Conference on Bangla Speech and Language Processing, ICBSLP 2019
Country/TerritoryBangladesh
CitySylhet
Period27/09/1928/09/19

Keywords

  • Bangla-to-English
  • Bidirectional LSTM
  • Machine Translation
  • Neural Machine Translation
  • Statistical Machine Translation

Fingerprint

Dive into the research topics of 'Neural vs Statistical Machine Translation: Revisiting the Bangla-English Language Pair'. Together they form a unique fingerprint.

Cite this