Sahsoh@qalb-2015 shared task: A rule-based correction method of common arabic native and non-native speakers' errors

Wajdi Zaghouani, Taha Zerrouki, Amar Balla

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

This paper describes our participation in the QALB-2015 Automatic Correction of Arabic Text shared task. We employed various tools and external resources to build a rule based correction method. Hand written linguistic rules were added by using existing lexicons and regular expressions. We handled specific errors with dedicated rules reserved for nonnative speakers. The system is simple as it does not employ any sophisticated machine learning methods and it does not correct punctuation errors. The system achieved results comparable to other approaches when the punctuation errors are ignored with an F1 of 66.9% for native speakers' data and an F1 of 31.72% for the non-native speakers' data.

Original languageEnglish
Title of host publication2nd Workshop on Arabic Natural Language Processing, ANLP 2015 - held at 53rd Annual Meeting of the Association for Computational Linguistics, ACL 2015 - Proceedings
EditorsNizar Habash, Stephan Vogel, Kareem Darwish
PublisherAssociation for Computational Linguistics (ACL)
Pages155-160
Number of pages6
ISBN (Electronic)9781941643587
Publication statusPublished - 2015
Externally publishedYes
Event2nd Workshop on Arabic Natural Language Processing, ANLP 2015 - Beijing, China
Duration: 30 Jul 2015 → …

Publication series

Name2nd Workshop on Arabic Natural Language Processing, ANLP 2015 - held at 53rd Annual Meeting of the Association for Computational Linguistics, ACL 2015 - Proceedings

Conference

Conference2nd Workshop on Arabic Natural Language Processing, ANLP 2015
Country/TerritoryChina
CityBeijing
Period30/07/15 → …

Fingerprint

Dive into the research topics of 'Sahsoh@qalb-2015 shared task: A rule-based correction method of common arabic native and non-native speakers' errors'. Together they form a unique fingerprint.

Cite this