Optimization of Speech Recognition by Clustering of Phones

Nowak, A.; Wakulicz-Deja, A.; Bachliński, S.

Powiadomienia systemowe

Sesja wygasła!
Sesja wygasła!

Artykuł - szczegóły

Tytuł artykułu

Optimization of Speech Recognition by Clustering of Phones

Autorzy

Nowak A. , Wakulicz-Deja A. , Bachliński S.

Wybrane pełne teksty z tego czasopisma

https://fi.episciences.org/

Identyfikatory

Warianty tytułu

Języki publikacji

Abstrakty

Optimization of the speech recognition process is aiming at achieving short time of classification (speech to text system), while preserving the content of speech signal description, and all necessary details of speech signal in considered application. The goal of parametrization of the human's speech is to eliminate of those physical features of speech signal, that do not bring any useful information (e.g., frequency of laryngeal tone, timbre of voice). The purpose of the parametrization of a speech signal is to minimize the volume of information that is to be analyzed. Our experiments suggest that using the cluster analysis method with agglomerative hierarchical technique is very helpful in finding relationships between speech phones. It lets us accelerate the process of speech recognition, simply because it is not necessary to analyze each phone separately and comparing it with an unclassified object. This principle has been carried to hidden Markov models. To organize those models we use the cluster analysis method with hierarchical techniques. Each model represents a single sequence of speech (probably the phone sequence). At the "top" of the structure we have models of phones in the most general context. When we go thru this structure to the bottom, there are models of phones in particular context. By the context we understand the juxtaposition the different phones.

Słowa kluczowe

Wydawca

IOS Press

Czasopismo

Fundamenta Informaticae

Rocznik

2006

Tom

Vol. 72, nr 1-3

Strony

283--293

Opis fizyczny

tab., wykr., bibliogr. 12 poz.

Twórcy

autor

Nowak A.

autor

Wakulicz-Deja A.

autor

Bachliński S.

Institute of Computer Science University of Silesia Będzińska 39, 41-200 Sosnowiec, Poland, nowak@us.edu.pll

Bibliografia

[1] Bachliński S., (2003) PHONOLAB - Sound analysis with ant system for speech recognition task,Master thesis, Silesian University, Poland, [in Polish]
[2] Czyżewski A. (1998) Digital sound, Academic Outbuilding Publishing EXIT,Warsaw, Poland, [in Polish]
[3] Deller J. (2000) Discrete-Time Processing of Speech Signals, IEEE Press
[4] Everitt B.S. (1993) Cluster Analysis (3rd edition), Edward Arnold / Halsted Press, London
[5] Huang X., Acero A., Hon H. W. (2001) Spoken Language Processing - A Guide to Theory, Algorithm, and System Development, Prentice Hall, Upper Saddle River, New Jersey, USA
[6] Jain A.K., Dubes R.C. (1988) Algorithms for clustering data, Prentice Hall, New Yersey
[7] Kaufman L., Rousseeuw P.J. (1990) Finding Groups in Data: An Introduction to Cluster Analysis, JohnWiley Sons, New York
[8] Nowak A., Wakulicz-Deja A. (2005) The concept of the hierarchical clustering algorithms for rules based systems, Intelligent Information Systems 2005 - New Trends in Intelligent Information Processing and Web Mining, Gdansk, Poland
[9] Tadeusiewicz R. (1988) Speech signals, Communication Publisher,Warsaw, Poland, [in Polish]
[10] (2005) The HTK Book (for HTK 3.3), Cambridge University Engineering Department, England.
[11] Koronacki J., ´Cwik J. (2005) Statistical Learning Systems, WNT, Warsaw, Poland, [in Polish]
[12] Stąpor K. (2005) The automatic classification of objects, EXIT,Warsaw, Poland, [in Polish]

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-article-BUS2-0010-0069