PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

Two template matching approaches to Arabic, Amharic and Latin isolated characters recognition

Autorzy
Wybrane pełne teksty z tego czasopisma
Identyfikatory
Warianty tytułu
Języki publikacji
EN
Abstrakty
EN
With the establishment of commercial OCR systems for Latin text, recent research efforts have been directed at the design of recognition systems for non-Latin scripts, such as Japanese, Cyrillic, Chinese, Hindi, Tibetan, and in particular Arabic. The Unicode 4.0 standard supports 50 scripts that are used across the world, and many, such as Amharic (Ethiopic), have attracted virtually no attention from researchers. An extensive literature review reveals no papers which report on an OCR system for Amharic. This paper describes a normalised technique which can be used for recognition of isolated Arabic, Amharic and Latin characters. Two approaches are considered for identifying the characters by comparing them to a series of templates and using a signature template scheme. The degrees of similarity between pairs of Amharic, Arabic and typical Latin characters are presented in the confusion matrix, and the performance of the two approaches is compared for each of these three character sets.
Rocznik
Strony
213--232
Opis fizyczny
Bibliogr. 27 poz., rys.
Twórcy
autor
  • Centre for Computational Intelligence, De Montfort University, The Gateway, Leicester, LE1 9BH, England
autor
  • Dept. of Computing Information Systems, Uniwersity of Luton, Park Square, LU1 3JU, England
Bibliografia
  • [1] Bender M. et al.: Language in Ethiopia. London: Oxford University Press, 1976.
  • [2] Fu K. S.: Syntactic models in pattern recognition and applications. Pattern Recognition in Practice, ed. Gelsema E.S, 1980.
  • [3] Gerard A. S.: African Language Literatures: An introduction to the literary history of sub-Saharan Africa. Washington DC: Three Continents Press, Inc, 1981.
  • [4] Tappert C.C., Suen C.Y., Wakahara T.: On-line handwriting recognition - a survey. Proc. 9th ICPR Int. Conf. on Pattern Recognition ICPR 9, Rome, Italy, IEEE, New York, N.Y., USA, 1123-1132, 1988.
  • [5] El-Wakil M.S., Shoukry A. A.: On-line recognition of handwritten isolated Arabic characters. Pattern Recognition, 22(2), 97-106, 1989.
  • [6] Mori S., Suen C.Y. Yamamoto K.: Historical review of OCR research and development. Proc. IEEE 80, 1029-1058, 1992.
  • [7] Al-Yousefi H., Upda S.S.: Recognition of Arabic characters. IEEE Trans. on PAMI, 14(8), 853-857, 1992.
  • [8] Abuhaiba I.S.I., Mahmoud S.A., Green R.J.: Recognition of handwritten cursive Arabic characters. IEEE Trans. on PAMI, 16(6), 664-672, 1994.
  • [9] Amin A., Al-Sadoun H.B.: Hand printed Arabic character recognition system. Proc. 12th IAPR Int. Conf. on Pattern Recognition, volume 2, 1994.
  • [10] Al-Badr B., Mahmoud S.A.: Survey and bibliography of Arabic optical text recognition. Signal Processing, 41(1), 49-77, 1995.
  • [11] Cowell J.: Syntactic pattern recognizer for vehicle identification numbers. Image and Vision Computing, 13(1), 13-19, 1995.
  • [12] Encyclopaedia Britannica CD2000. www.britannica.co.uk. ; 1997
  • [13] Amin A.: Off-line Arabic character recognition - the state of the art [review]. Pattern Recognition, 31(5), 517-530, 1998.
  • [14] Bushofa B. M. F., Spann M.: Segmentation and recognition of Arabic characters by structural classification. Image and Vision Computing, 15(3), 167-179, 1998.
  • [15] Cowell J., Hussain F.: A Multi-Stage Algorithm for Character Recognition. Proc.: GITIS 2000 Conf., Chamber of Commerce & Industry, Dubai, UAE, 140-146, 2000.
  • [16] Cowell J., and Hussain F.: The Confusion Matrix - identifying Conflicts in Arabic and Latin Character Recognition. Proc. CGIM 2000, Las Vegas, 2000.
  • [17] Hussain F., Cowell J.: Character recognition of Arabic and Latin Script. Proc. IV2000 Conf., London, 2000.
  • [18] Hussain F., Cowell J.: A Generic Recognition Algorithm for Latin and Arabic Scripts. 3 Workshop on Information & Computer Science, KFUPM , Saudi Arabia, Oct., 2000
  • [19] Plamondon R., Srihari S.N.: On-line and off-line handwriting recognition: a comprehensive survey. IEEE Trans. on PAMI, 22(1), 63-84, 2000.
  • [20] Cowell J., Hussain F .: Resolving Conflicts in Arabic and Latin Character Recognition. EG2001, UCL London, 2001.
  • [21] Cowell J., Hussain F.: Extracting Features from Arabic Characters. Proc. CGIM2001, Hawaii, 2001.
  • [22] Kinser J.: Image signatures: Ontology and classification. Proc. IASTED Computer Graphics and Imaging Conf., CGIM2001, Hawaii, USA, 2001.
  • [23] Yaregal A.: Optical Character Recognition of Amharic Text: an Integrated Approach. (Masters thesis) School of Information studies for Africa, Addis Ababa University, Addis Ababa, 2001.
  • [24] Hussain F., Cowell J.: A fast signature based algorithm for recognition of isolated Arabic characters. IASTED Conf. on Visualisation, Imaging and Image Processing, VIIP September, Malaga, Spain, 2002.
  • [25] Xi J., Hu J., Wu L.: Page segmentation of Chinese newspapers. Pattern Recognition, 35(12), 2695-2704, 2002.
  • [26] The Unicode Consortium : The Unicode Standard, Version 4.0.0, defined by: The Unicode Standard, Version 4.0, Boston, MA , Addison-Wesley, 2003. ISBN 0-321-18578-1
  • [27] Zhao S., Chi Z., Shi P., and Yan H.: Two-stage segmentation of unconstrained handwritten Chinese characters. Pattern Recognition, 36(1), 145-156, 2003.
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-article-BWA1-0011-0013
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.