Porównanie modeli LeNet-5, AlexNet i GoogLeNet w rozpoznawaniu pisma ręcznego

Michalski, Bartosz; Plechawska-Wójcik, Małgorzata

doi:10.35784/jcsi.2919

Artykuł - szczegóły

Tytuł artykułu

Porównanie modeli LeNet-5, AlexNet i GoogLeNet w rozpoznawaniu pisma ręcznego

Autorzy

Michalski Bartosz , Plechawska-Wójcik Małgorzata

Treść / Zawartość

Pełne teksty:

2919-Article Text-13064-1-10-20220610.pdf

Pobierz

Identyfikatory

DOI

10.35784/jcsi.2919

Warianty tytułu

Comparison of LeNet-5, AlexNet and GoogLeNet models in handwriting recognition

Języki publikacji

Abstrakty

Celem badania było porównanie dokładności rozpoznawania pisma odręcznego oraz czasu potrzebnego na klasyfikację danych ze zbiorów testowych. Do badań wykorzystano architektury Lenet-5, AlexNet i GoogLeNet. Wszystkie wybra-ne architektury są modelami konwolucyjnych sieci neuronowych. Badania przeprowadzono z wykorzystaniem baz obrazów odręczenie pisanych cyfr MNIST i odręcznie pisanych liter EMNIST. Po wykonaniu badań stwierdzono, że największą dokładnością wykazał się model GoogLeNet, a najmniejszą LeNet-5. Natomiast najmniej czasu na wykona-nie zadania potrzebował model LeNet-5, a najwięcej GoogLeNet. Na podstawie otrzymanych wyników stwierdzono, że zwiększanie złożoności modelu wpływa pozytywnie na dokładność klasyfikacji obiektów, ale znacznie zwiększa zapo-trzebowanie na zasoby komputera.

The aim of the study was to compare the accuracy of handwriting recognition and the time needed to classify data from the test sets. The Lenet-5, AlexNet and GoogLeNet architectures were used for the research. All selected architectures are models of convolutional neural networks. The research was carried out with the use of image databases, handwritten digits MNIST and handwritten letters EMNIST. After the tests, it was found that the GoogLeNet model showed the highest accuracy, and the LeNet-5 the lowest. However, the LeNet-5 model needed the least time to complete the task, and GoogLeNet the most. On the basis of the obtained results, it was found that increasing the complexity of the model positively influences the accuracy of object classification, but significantly increases the demand for computer re-sources.

Słowa kluczowe

konwolucyjne sieci neuronowe klasyfikacja pisma odręcznego

convolutional neural networks handwriting classification

Wydawca

Wydawnictwo Politechniki Lubelskiej

Czasopismo

Journal of Computer Sciences Institute

Rocznik

2022

Tom

Vol. 23

Strony

145--151

Opis fizyczny

Bibliogr. 19 poz., rys., tab.

Twórcy

autor

Michalski Bartosz

Department of Computer Science, Lublin University of Technology Poland

autor

Plechawska-Wójcik Małgorzata

m.plechawska@pollub.pl

Department of Computer Science, Lublin University of Technology Poland

Bibliografia

1. D. O. Hebb, The organisation of behaviour: a neuropsychological theory. New York: Science Editions (1949).
2. F. Rosenblatt, The perceptron: a probabilistic model for information storage and organization in the brain. Psychological review 65(6) (1958) 386.
3. Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard., W. Hubbard, L. D. Jackel, Backpropagation applied to handwritten zip code recognition. Neural computation 1(4) (1989) 541-551.
4. O. Russakovsky, J. Deng, H. Su, et al. ImageNet Large Scale Visual Recognition Challenge. Int J Comput Vis 115 (2015) 211–252. https://doi.org/10.1007/s11263-015-0816-y
5. Ü. Budak, A. Şengür, U. Halici, Deep convolutional neural networks for airport detection in remote sensing images. 26th Signal Processing and Communications Applications Conference (SIU) (2018) 1-4, doi: 10.1109/SIU.2018.8404195.
6. M. J. Aitkenhead, A. J. S. McDonald. A neural network face recognition system. Engineering Applications of Artificial Intelligence 16(3) (2003) 167-176.
7. D. S. Maitra, U. Bhattacharya, S. K. Parui, CNN based common approach to handwritten character recognition of multiple scripts. 13th International Conference on Document Analysis and Recognition (ICDAR) (2015) 1021-1025, doi: 10.1109/ICDAR.2015.7333916.
8. K. Nygren, Stock prediction–a neural network approach. Royal Instiute of Technology (2004) 1-34.
9. S. S. Baboo, I. K. Shereef, An efficient weather forecasting system using artificial neural network. International journal of environmental science and development 1(4) (2010) 321.
10. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, L. Fei-Fei, Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition (2009) 248–255.
11. Y. LeCun, C. Cortes, The MNIST database of handwritten digits (2005).
12. G. Cohen, S. Afshar, J. Tapson, A. van Schaik, EMNIST: an extension of MNIST to handwritten letters (2017) arXiv:1702.05373.
13. Y. Lecun, L. Bottou, Y. Bengio and P. Haffner, Gradient-based learning applied to document recognition in Proceedings of the IEEE 86(11) (1998) 2278-2324, doi: 10.1109/5.726791.
14. Krizhevsky, I. Sutskever, G. E. Hinton, Imagenet classification with deep convolutional neural networks. In F. Pereira, C. Burges, L. Bottou, K. Weinberger, eds., Advances in Neural Information Processing Systems 25. Curran Associates (2012) 1097–1105. arXiv:1803.01164
15. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich, Going Deeper with Convolutions (2014) arXiv:1409.4842.
16. K. O'Shea, R. Nash, An introduction to convolutional neural networks (2015) arXiv preprint arXiv:1511.08458.
17. Grother, P. J, NIST special database 19. Handprinted forms and characters database, National Institute of Standards and Technology (1995).
18. W. S. McCulloch, W. Pitts, A logical calculus of the ideas immanent in nervous activity. The bulletin of mathematical biophysics 5(4) (1943) 115-133.
19. E. Lukasik, M. Charytanowicz, M. Milosz, M. Tokovarov, M. Kaczorowska, D. Czerwinski, T. Zientarski, Recognition of handwritten Latin characters with diacritics using CNN. Bulletin of the Polish Academy of Sciences. Technical Sciences 69(1) (2021).

Uwagi

Opracowanie rekordu ze środków MEiN, umowa nr SONP/SP/546092/2022 w ramach programu "Społeczna odpowiedzialność nauki" - moduł: Popularyzacja nauki i promocja sportu (2022-2023).

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-b4b3b57d-8ce3-4e7f-a765-44d6c5333486