PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

Voice Command Recognition Using Hybrid Genetic Algorithm

Wybrane pełne teksty z tego czasopisma
Identyfikatory
Warianty tytułu
Języki publikacji
EN
Abstrakty
EN
Speech recognition is a process of converting the acoustic signal into a set of words, whereas voice command recognition consists in the correct identification of voice commands, usually single words. Voice command recognition systems are widely used in the military, control systems, electronic devices, such as cellular phones, or by people with disabilities (e.g., for controlling a wheelchair or operating a computer system). This paper describes the construction of a model for a voice command recognition system based on the combination of genetic algorithms (GAs) and K-nearest neighbour classifier (KNN). The model consists of two parts. The first one concerns the creation of feature patterns from spoken words. This is done by means of the discrete Fourier transform and frequency analysis. The second part constitutes the essence of the model, namely the design of the supervised learning and classification system. The technique used for the classification task is based on the simplest classifier – K-nearest neighbour algorithm. GAs, which have been demonstrated as a good optimization and machine learning technique, are applied to the feature extraction process for the pattern vectors. The purpose and main interest of this work is to adapt such a hybrid approach to the task of voice command recognition, develop an implementation and to assess its performance. The complete model of the system was implemented in the C++ language, the implementation was subsequently used to determine the relevant parameters of the method and to improve the approach in order to obtain the desired accuracy. Different variants of GAs were surveyed in this project and the influence of particular operators was verified in terms of the classification success rate. The main finding from the performed numerical experiments indicates the necessity of using genetic algorithms for the learning process. In consequence, a highly accurate recognition system was obtained, providing 94.2% correctly classified patterns. The hybrid GA/KNN approach constituted a significant improvement over the simple KNN classifier. Moreover, the training time required for the GA to learn the given set of words was found to be on a level that is acceptable for the efficient functioning of the voice command recognition system.
Rocznik
Strony
377--396
Opis fizyczny
Bibliogr. 18 poz., rys., tab.
Twórcy
autor
  • Faculty of Technical Physics and Applied Mathematics, Gdansk University of Technology, Narutowicza 11/12, 80-233 Gdansk, Poland, martapad@gmail.com
Bibliografia
  • [1] Rabiner L and Biing-Hwang J 1993 Fundamentals of Speech Recognition, Prentice Hall, Englewood Cliffs, NJ
  • [2] Goldberg D E 1989 Genetic Algorithms in Search, Optimization and Machine Learning, Addison Wesley, New York
  • [3] Kelly J D and Davis L 1991 Proc. 4 th Int. Conf. Genetic Algorithms and their Applications (ICGA), San Mateo, CA, pp. 377–383
  • [4] Pei M, Goodman E D, Punch W F and Ding Y 1998 Proc. IDEAL98, Hong Kong, pp. 371–384
  • [5] Jain R and Mazumdar J 2003 Australasian Physical and Engineering Sciences in Medicine 26 (1) 6
  • [6] He H, Hawkins S, Graco W and Yao X 2000 J. Adv. Comp. Intelligence and Intelligent Informatics 4 (2) 130
  • [7] Mitchell M 1996 An Introduction to Genetic Algorithms, MIT Press, Cambridge
  • [8] Holland J H 1975 Adaptation in Natural and Artificial Systems, University of Michigan Press, Ann Arbor
  • [9] Koza J 1992 Genetic Programming: On the Programming of Computers by Means of Natural Selection, MIT Press
  • [10] Cover and Hart P 1967 IEEE Trans. on Information Theory 13 (1) 21
  • [11] Shakhnarovich G, Darrell T and Indyk P 2006 Nearest-Neighbor Methods in Learning and Vision, MIT Press
  • [12] Beyer K, Goldstein J, Ramakrishnan R and Shaft U 1999 Lecture Notes in Computer Science 1540 217
  • [13] Bot M C J 2001 Proc. 4 th European Conf. Genetic Programming, Como, Italy, pp. 256–267
  • [14] Shannon C E 1949 Proc. IRE 37 (1), pp. 10–21
  • [15] Titze I R 1994 Principles of Voice Production, Prentice Hall
  • [16] Baker J E 1987 Proc. 2 nd Int. Conf. Genetic Algorithms and their Application, Hillsdale, pp. 14–21
  • [17] Gavrilova M L 2002 J. Supercomputing 22 (1) 87
  • [18] Bandyopadhyay S, Murthy C A and Pal S K 1995 Pattern Classification with Genetic Algorithms 16 (8) 801
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-article-BPG8-0042-0058
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.