PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

Diacritic-aware Yorùbá spell checker

Treść / Zawartość
Identyfikatory
Warianty tytułu
Języki publikacji
EN
Abstrakty
EN
Spell checking and correction is still in its infancy for the Yorùbá language; existing tools cannot be directly applied to address the problem, as Yorùbá uses diacritics extensively for distinguishing phonemes and for marking tone. A model was formulated as a parallel combination of a unigram language model and a diacritic model to form a dictionary sub-model that can be used by error-detection and candidate-generation modules. The candidate-generation module was implemented as a reverse Levensthein edit-distance algorithm. The system was evaluated by using detection accuracy (calculated from the precision and recall) and suggestion accuracy (SA) as metrics. Our experimental setups compared the performance of the component subsystems when used alone and with their combination into a unified model. The detection accuracies for the different models range from 93.23 to 95.01%, and the suggestion accuracies range from 26.94 to 72.10%. The results indicated that each of the sub-models in the dictionary played different roles.
Słowa kluczowe
EN
Wydawca
Czasopismo
Rocznik
Tom
Strony
31--51
Opis fizyczny
Bibliogr. 25 poz., rys., tab.
Twórcy
  • Obafemi Awolowo University, Ile-Ife, Nigeria
  • Kings University, Gbongan-Osogbo Road, Ode-Omu, Nigeria
  • Obafemi Awolowo University, Ile-Ife, Nigeria
  • Obafemi Awolowo University, Ile-Ife, Nigeria
  • Obafemi Awolowo University, Ile-Ife, Nigeria
  • Obafemi Awolowo University, Ile-Ife, Nigeria
Bibliografia
  • [1] Ahmed F., De Luca E., Nürnberger A.: Revised N-Gram based Automatic Spelling Correction Tool to Improve Retrieval Effectiveness, Polibits, vol. 40, pp. 39–48, 2009. doi: 10.17562/PB-40-6.
  • [2] Arndt E., Foorman B.: Second Graders as Spellers: What Types of Errors Are They Making?, Assessment for Effective Intervention, vol. 36(1), pp. 57–67, 2010. doi: 10.1177/1534508410380135.
  • [3] Asahiah F., Onífádé T., Adégùnle.hìn A.: Spelling Error Patterns in Typed Yorùbá Text Documents, IJ Information Engineering and Electronic Business, vol. 6, pp. 28–30, 2020.
  • [4] Asahiah F.O., O. déjobí O.A., Adagunodo E.R.: Restoring Tone-Marks in Standard Yorùbá Electronic Text: Improved Model, Computer Science, vol. 18(3), pp. 301–315, 2017. doi: 10.7494/csci.2017.18.3.2128.
  • [5] Awoyale Y.: The LDC Corpus Catalog (Global Yoruba Lexical Database v. 1.0), Web, 2008. Retrieved Decemeber 04, 2020 from http://www.languagearchives.org/item/oai:www.ldc.upenn.edu:LDC2008L03.
  • [6] Balogun T.A.: An Endangered Nigerian Indigenous Language: The Case of Yorùbá Language, African Nebula, vol. 6, pp. 70–83, 2013.
  • [7] Bassil Y., Alwani M.: Context-sensitive Spelling Correction Using Google Web 1T 5-Gram Information, Computer and Information Science, vol. 5(3), 2012. doi: 10.5539/cis.v5n3p37.
  • [8] Colesnicov A.: The Roumanian spelling checker ROMSP: the project overview, Computer Science Journal of Moldova, vol. 3, pp. 40–54, 1995.
  • [9] Coulmas F.: Writing systems: An introduction to their linguistic analysis, Cambridge University Press, 2003.
  • [10] Daffern T., Mackenzie N.M., Hemmings B.: Predictors of writing success: How important are spelling, grammar and punctuation?, Australian Journal of Education, vol. 61(1), pp. 75–87, 2017. doi: 10.1177/0004944116685319.
  • [11] Enikuomehin A.O.: A computerized identification system for verb sorting and arrangement in a natural language: Case study of the Nigerian Yoruba Language, European Journal of Computer Science and Information Technology, vol. 3(1), pp. 43–52, 2015.
  • [12] Federal Government of Nigeria: Official Gazette of Federal Republic of Nigeria, Web, 2007. Retrieved March 12, 2021 from http://www.population.gov.ng/ pop_figure.pdf.
  • [13] Federal Ministry of Education, Nigeria: The 1974 Revised Official Orthography for the Yoruba Language (Joint Consultative Committee on Education), The Committee, 1974.
  • [14] Hema P., Sunitha C.: Spell Checker for Non Word Error Detection: Survey, International Journal of Advanced Research in Computer Science and Software Engineering, vol. 5(3), pp. 30–38, 2015.
  • [15] Hok P.: Khmer Spell Checker, Master’s thesis, Australian National University 2005.
  • [16] Kaur A., Singh P., Rani S.: Spell Checking and Error Correcting System for text paragraphs written in Punjabi Language using Hybrid approach, International Journal of Latest Trends in Engineering and Technology (IJLTET), vol. 2(4), pp. 156–160, 2014.
  • [17] Mishra R., Kaur N.: A Survey of Spelling Error Detection and Correction Techniques, The International Journal of Computer Trends and Technology, vol. 4(3), pp. 372–374, 2013.
  • [18] Moats L.: Spelling: Development, disability, and instruction, York Press, Baltimore, 1995.
  • [19] Naseem T.: A Hybrid Approach for Urdu Spell Checking, Ph.D. thesis, National University of Computer & Emerging Sciences, 2004.
  • [20] Padhy H.H., Mohanty S.: Designing hybrid approach Spell checker for Oriya, International Journal of Latest trends in Engineering and Technology (IJLTET), vol. 2(4), pp. 156–159, 2013.
  • [21] Rajashekara M.S., Vadiraj M., Sachin D., Ramakanth K.P.: A Non-Word Kannada Spell Checker Using Morphological Analyzer And Dictionary Lookup Method, International Journal of Engineering Sciences and Emerging Technologies, vol. 2(2), pp. 45–52, 2012.
  • [22] Renkema J.: Improving the Quality of Governmental Documents: A Combined Academic and Professional Approach. In: W. Cheng, K.C.C. Kong (eds.), Professional Communication. Collaboration between Academics and Practitioners, pp. 173–190, Hong Kong University Press, 2008.
  • [23] Salifou L., Naroua H.: Design of A Spell Corrector For Hausa Language, International Journal of Computational Linguistics (IJCL), vol. 5(2), pp. 14–26, 2014.
  • [24] Sawyer D., Wade S., Kim J.: Spelling errors as a window on variations in phonological deficits among students with dyslexia, Annals of Dyslexia, vol. 49, pp. 135–159, 1999.
  • [25] Wells J.: Orthographic diacritics and multilingual computing, Language Problems & Language Planning, vol. 24(3), pp. 249–272, 2000. Retrieved July 12, 2010 from http://www.phon.ucl.ac.uk/home/wells/dia/diacritics-revised.htm.
Uwagi
PL
Opracowanie rekordu ze środków MEiN, umowa nr SONP/SP/546092/2022 w ramach programu "Społeczna odpowiedzialność nauki" - moduł: Popularyzacja nauki i promocja sportu (2022-2023).
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-96caa422-8350-47eb-9079-a909fd566d49
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.