Ocena parametrów analizy akustycznej w detekcji patologii mowy

Panek, D.

doi:10.12915/pe.2014.05.29

Artykuł - szczegóły

Tytuł artykułu

Ocena parametrów analizy akustycznej w detekcji patologii mowy

Autorzy

Panek D.

Wybrane pełne teksty z tego czasopisma

http://pe.org.pl/

Identyfikatory

DOI

10.12915/pe.2014.05.29

Warianty tytułu

Parameters evaluation of acoustic analysis in speech pathology detection

Języki publikacji

Abstrakty

Diagnostyka stanu dróg głosowych wymaga stworzenia wektora, który składa się z różnych parametrów akustycznych, co może pomóc w szybkiej oraz automatycznej detekcji patologii głosu. W niniejszym artykule przedstawiono wektor cech złożony z 31 parametrów. Parametry mowy zostały wyodrębnione w dziedzinie czasu, częstotliwości oraz cepstralnej. Wybór parametrów niezbędnych w ocenie patologii głosu został potwierdzony w analizie głównych składowych, jądrowej analizie głównych składowych (kernel PCA) oraz liniowej analizie dyskryminacyjnej (LDA).

The diagnosis of the current state of the vocal tract requires the creation of a feature vector that consists of various acoustic parameters, which can help in rapid and automatic detection of voice pathologies. Vector consisting of 31 parameters was done in this project. Speech parameters were extracted in the time, frequency and cepstral domain. Essential parameters were selected and analysed using principal component analysis, kernel principal component analysis and linear discriminant analysis.

Słowa kluczowe

analiza akustyczna patologiczny sygnał mowy analiza głównych składowych liniowa analiza dyskryminacyjna

acoustic analysis pathological speech signal principal component analysis linear discriminant analysis

Wydawca

Wydawnictwo SIGMA-NOT

Czasopismo

Przegląd Elektrotechniczny

Rocznik

2014

Tom

R. 90, nr 5

Strony

126--129

Opis fizyczny

Bibliogr. 18 poz., schem., tab., wykr.

Twórcy

autor

Panek D.

dpanek@agh.edu.pl

AGH Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie, Katedra Metrologii i Elektroniki, al. Mickiewicza 30, 30-060 Kraków

Bibliografia

[1] Gogh C.D.L., Festen J.M., Verdonck-de Leeuw I.M., Parker A.J.,Acoustical analysis of tracheoesophageal voice, Speech Communication, Vol. 47, Issue 1-2 (2005), 160-168
[2] Martinez D., Lleida E., Ortega A., Miguel A., Villalba J., Voice Pathology Detection on the Saarbruecken Voice Database with Calibration and Fusion of Scores Using MultiFocal Toolkit, Advances in Speech and Language Technologies for Iberian Languages Communications in Computer and Information Science, vol. 328 (2012), 99-109
[3] Orozco-Arroyave J.R., Murillo-Rendon S., Alvares-Meza A.M., Arias-Londono J.D., Delgado-Trejos E., Vargas-Bonilla J.F., Castellanos-DOmingues C.G., Automatic Selection of Acoustic and Non-linear Dynamic Features in Voice Signals for Hypernasality Detection, Interspeech (2011), 529-532
[4] Bishop Ch.M., Pattern Recognition and Machine Learning, Springer Science (2006), Singapur, 559-599
[5] Vetter R., Virag N., Renevey P., Vesin J.M., Single channel speech enhancement using principal component analysis and MDL subspace selection, Eurospeech (1999), ISCA
[6] Yang J., Yang J.-Y, Why can LDA be performed in PCA transformed space?, Pattern Recogn. 36(2) (2003), 563-566
[7] Liu C., Gabor-based kernel PCA with fractional power polynomial models for face recognition, IEEE Trans, Pattern Anal. Mach. Intell., 26(5) (2004), 572-581
[8] Gramacki J., Gramacki A., Wybrane metody redukcji wymiarowości danych oraz ich wizualizacje, XIV Konferencja PLOUG (2008), Szczyrk
[9] Scholkopf B., Smola A., Muller K.R., Kernel principal component analysis, Artufucial Neural Networks - iCANN'97, Berlin (1997), 583-588
[10] Scholkopf B., Smola A., Muller K.R., Nonlinear component analysis as a kernel eigenvalue problem, Neural Comput., 10(5) (1998), 1299-1319
[11] Wu J., Wang J., Liu L., Feature extraction via KPCA for classification of gair patterns, ScienceDirect, Human Movement Science 26(2007), 393-411
[12] Balakrishnama S., Ganapathiraju A., Linear Discriminant Analysis - a brief tutorial, Institute for Signal and Information Processing, Department of Electrical and Computer Engineering, Mississipi State University (2014)
[13] Wu Y., Principal Component Analysis and Linear Discriminant Analysis, Electrical Engineering and Computer Science, Northwestern University, Evanston, wykład (2014)
[14] Young X., Zhang D., Song F., Yang J-Y., Jing Z., Li M., A method for speeding up feature extraction based on KPCA, Neurocomputing 70 (2007), 1056-1061
[15] Epstein M.A., Payri B.G., The effects of vowel quality and pitch on spectral and glottal flow measurements of the voice source, Lecture, University of California, Los Angeles, http://www.linguistics.ucla.edu/people/grads/melissa/effects_of _vowel_quality.PDF (2014)
[16] Maciel C.D., Pereira J.S., Identifying healthy and pathologically affected voice signals, IEEE Signal Processing Magazine (2010), 120-123
[17] Osowski S., Sieci neuronowe w ujęciu algorytmicznym, WNT, (1996)
[18] Engel Z.W., Klaczynski M., Wszolek W., A Vibroacoustic Model of Selected Human Larynx Diseases, International Journal of Occupational Safety and Ergonomics (JOSE), 13(4) (2007), 367-379

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-7870ff5e-e8f0-47cd-81b9-b2fc8a19c7f2