PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Powiadomienia systemowe
  • Sesja wygasła!
  • Sesja wygasła!
Tytuł artykułu

Optimization of Speech Recognition by Clustering of Phones

Wybrane pełne teksty z tego czasopisma
Identyfikatory
Warianty tytułu
Języki publikacji
EN
Abstrakty
EN
Optimization of the speech recognition process is aiming at achieving short time of classification (speech to text system), while preserving the content of speech signal description, and all necessary details of speech signal in considered application. The goal of parametrization of the human's speech is to eliminate of those physical features of speech signal, that do not bring any useful information (e.g., frequency of laryngeal tone, timbre of voice). The purpose of the parametrization of a speech signal is to minimize the volume of information that is to be analyzed. Our experiments suggest that using the cluster analysis method with agglomerative hierarchical technique is very helpful in finding relationships between speech phones. It lets us accelerate the process of speech recognition, simply because it is not necessary to analyze each phone separately and comparing it with an unclassified object. This principle has been carried to hidden Markov models. To organize those models we use the cluster analysis method with hierarchical techniques. Each model represents a single sequence of speech (probably the phone sequence). At the "top" of the structure we have models of phones in the most general context. When we go thru this structure to the bottom, there are models of phones in particular context. By the context we understand the juxtaposition the different phones.
Słowa kluczowe
Wydawca
Rocznik
Strony
283--293
Opis fizyczny
tab., wykr., bibliogr. 12 poz.
Twórcy
autor
  • Institute of Computer Science University of Silesia Będzińska 39, 41-200 Sosnowiec, Poland, nowak@us.edu.pll
Bibliografia
  • [1] Bachliński S., (2003) PHONOLAB - Sound analysis with ant system for speech recognition task,Master thesis, Silesian University, Poland, [in Polish]
  • [2] Czyżewski A. (1998) Digital sound, Academic Outbuilding Publishing EXIT,Warsaw, Poland, [in Polish]
  • [3] Deller J. (2000) Discrete-Time Processing of Speech Signals, IEEE Press
  • [4] Everitt B.S. (1993) Cluster Analysis (3rd edition), Edward Arnold / Halsted Press, London
  • [5] Huang X., Acero A., Hon H. W. (2001) Spoken Language Processing - A Guide to Theory, Algorithm, and System Development, Prentice Hall, Upper Saddle River, New Jersey, USA
  • [6] Jain A.K., Dubes R.C. (1988) Algorithms for clustering data, Prentice Hall, New Yersey
  • [7] Kaufman L., Rousseeuw P.J. (1990) Finding Groups in Data: An Introduction to Cluster Analysis, JohnWiley Sons, New York
  • [8] Nowak A., Wakulicz-Deja A. (2005) The concept of the hierarchical clustering algorithms for rules based systems, Intelligent Information Systems 2005 - New Trends in Intelligent Information Processing and Web Mining, Gdansk, Poland
  • [9] Tadeusiewicz R. (1988) Speech signals, Communication Publisher,Warsaw, Poland, [in Polish]
  • [10] (2005) The HTK Book (for HTK 3.3), Cambridge University Engineering Department, England.
  • [11] Koronacki J., ´Cwik J. (2005) Statistical Learning Systems, WNT, Warsaw, Poland, [in Polish]
  • [12] Stąpor K. (2005) The automatic classification of objects, EXIT,Warsaw, Poland, [in Polish]
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-article-BUS2-0010-0069
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.