QCRI's live speech translation system

Fahim Imaduddin Dalvi, Yifan Zhang, Sameer Khurana, Nadir Khan Durrani, Hassan Sajjad, Ahmed Abdelali, Hamdy Mubarak, Ahmed Ali, Stephan Vogel

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper presents QCRI’s Arabic-to-English live speech translation system. It features modern web technologies to capture live audio, and broadcasts Arabic transcriptions and English translations simultaneously. Our Kaldi-based ASR system uses the Time Delay Neural Network (TDNN) architecture, while our Machine Translation (MT) system uses both phrase-based and neural frameworks. Although our neural MT system is slower than the phrase-based system, it produces significantly better translations and is memory efficient.
Original languageEnglish
Title of host publicationProceedings of the Software Demonstrations of the 15th Conference of the European Chapter of the Association for Computational Linguistics
Publication statusPublished - 2017

Fingerprint

Dive into the research topics of 'QCRI's live speech translation system'. Together they form a unique fingerprint.

Cite this