Cepstral separation difference: A novel approach for speech impairment quantification in Parkinson's disease

Khan, T.; Westin, J.; Dougherty, M.

doi:10.1016/j.bbe.2013.06.001

Artykuł - szczegóły

Tytuł artykułu

Cepstral separation difference: A novel approach for speech impairment quantification in Parkinson's disease

Autorzy

Khan T. , Westin J. , Dougherty M.

Wybrane pełne teksty z tego czasopisma

Identyfikatory

DOI

10.1016/j.bbe.2013.06.001

Warianty tytułu

Języki publikacji

Abstrakty

This paper introduces a novel approach, Cepstral Separation Difference (CSD), for quantification of speech impairment in Parkinson's disease (PD). CSD represents a ratio between the magnitudes of glottal (source) and supra-glottal (filter) log-spectrums acquired using the source-filter speech model. The CSD-based features were tested on a database consisting of 240 clinically rated running speech samples acquired from 60 PD patients and 20 healthy controls. The Guttmann (μ₂) monotonic correlations between the CSD features and the speech symptom severity ratings were strong (up to 0.78). This correlation increased with the increasing textual difficulty in different speech tests. CSD was compared with some non-CSD speech features (harmonic ratio, harmonic-to-noise ratio and Mel-frequency cepstral coefficients) for speech symptom characterization in terms of consistency and reproducibility. The high intra-class correlation coefficient (>0.9) and analysis of variance indicates that CSD features can be used reliably to distinguish between severity levels of speech impairment. Results motivate the use of CSD in monitoring speech symptoms in PD.

Słowa kluczowe

Parkinson's disease speech processing dysarthria acoustic analysis speech cepstrum

choroba Parkinsona przetwarzanie mowy dyzartria analiza akustyczna

Wydawca

Nałęcz Institute of Biocybernetics and Biomedical Engineering of the Polish Academy of Sciences
Elsevier

Czasopismo

Biocybernetics and Biomedical Engineering

Rocznik

2014

Tom

Vol. 34, no. 1

Strony

25--34

Opis fizyczny

Bibliogr. 37 poz., rys., tab., wykr.

Twórcy

autor

Khan T.

tkh@du.se

School of Technology and Business Studies, Computer Engineering, Dalarna University, Falun, Sweden; School of Innovation, Design and Technology, Malardalen University, Vasteras, Sweden

autor

Westin J.

jwe@du.se

School of Technology and Business Studies, Computer Engineering, Dalarna University, Falun, Sweden

autor

Dougherty M.

mdo@du.se

School of Technology and Business Studies, Computer Engineering, Dalarna University, Falun, Sweden

Bibliografia

[1] Olanow CW, Stern MB, Sethi K. The scientific and clinical basis for the treatment of Parkinson's disease. Neurology 2009;72:1–36.
[2] Ruggiero C, Sacile R, Giacomini M. Home telecare. J Telemed Telecare 1999;5:11–7.
[3] Ho A, Iansek R, Marigliani C, Bradshaw J, Gates S. Speech impairment in a large sample of patients with Parkinson's disease. Behav Neurol 1998;11:131–7.
[4] Solomon N. Speech breathing in Parkinson's disease. J Speech Lang Hear Res 1993;36:294–310.
[5] Gelzinis A, Verikas A, Bacauskiene M. Automated speech analysis applied to laryngeal disease categorization. Comput Methods Programs Biomed 2008;91:36–47.
[6] Rusz J, Cmejla R, Ruzickova H, Ruzicka E. Quantitative acoustic measurements for characterization of speech and voice disorders in early treated Parkinson's disease. J Acoust Soc Am 2011;129:350–67.
[7] Londono JDA, Llorente JIG, Lechon NS, Ruiz VO, Dominguez GC. Automatic detection of pathological voices using complexity measures, noise parameters, and mel-cepstral coefficients. IEEE Trans Biomed Eng 2011;58:370–8.
[8] Ganchev T, Fakotakis N, Kokkinakis G. Comparative evaluation of various MFCC implementations on the speaker verification task. In: 10 International Conference on SPECOM. 2005. pp. 191–4. 1.
[9] Llorente JIG, Gomez-Vilda P, Blanco-Velasco M. Dimensionality reduction of a pathological voice quality assessment system based on Gaussian mixture models and short-term cepstral parameters. IEEE Trans Biomed Eng 2006;53(10):1943–53.
[10] Fraile R, Saenz-Lechon N, Llorente JIG, Osma-Ruiz V, Fredouille C. Automatic detection of laryngeal pathologies in records of sustained vowels by means of mel-frequency cepstral coefficient parameters and differentiation of patients by sex. Folia Phoniatr Logop 2009;61:146–52.
[11] Midi I, Dogan M, Koseoglu M, Can G, Sehitoglu MA, Gunal DI. Voice abnormalities and their relation with motor dysfunction in Parkinson's disease. Acta Neurol Scand 2008;117:26–34.
[12] Kent RD, Weismer G, Kent JF, Vorperia HK, Duffy JR. Acoustic studies of Dysarthric speech: methods, progress, and potential. J Commun Disord 1999;32:141–80.
[13] Thomson L, Lin E, Robb MP. The impact of breathiness on the intelligibility of speech. In: Proceedings of 8th APCSLH, Christchurch, New Zealand, January 11–14; 2011.
[14] Krom DG. A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals. JHSR 1993;36:254–66.
[15] Hammaberg B, Fritzell B, Gauffin J, Sundberg J. Perceptual and acoustic correlates of abnormal voice qualities. Acta Oto-Laryngol 1980;90:441–51.
[16] Rusz J, Cmejla R, Ruzickova H, Ruzicka E. Objectification of dysarthria in Parkinson's disease using Bayes Theorem. In: Proceedings of 10th WSEAS, Athens, Greece; 2011. pp. 165–9.
[17] Paja MS, Falk TH. Automated dysarthria severity classification for improved objective intelligibility assessment of spastic dysarthric speech. In: Proceedings of 13th Annual Conference of ISCA, Portland, USA; 2012.
[18] Llorente JIG, Fraile I, Lechón RS, Ruiz NO, Vilda VGP. Automatic detection of voice impairments from text-dependent running speech. BSPC 2009;4(3):176–82.
[19] Goetz CG, Stebbins GT, Wolff D, DeLeeuw W, Bronte-Stewart H, Elble R, et al. Testing objective measures of motor impairment in early Parkinson's disease: feasibility study of an at-home testing device. Mov Disord 2009;24:551–6.
[20] Silbert N, Jong KD. Focus, prosodic context, and phonological feature specification: patterns of variation in fricative production. J Acoust Soc Am 2008;5:2769–79.
[21] Fahn S, Elton R, the UPDRS Development Committee. Unified Parkinson's disease rating scale. Recent Dev Parkinson's Dis 1987;2:153–63.
[22] Flanagan JL, Ishizaka K, Shipley KL. Synthesis of speech from a dynamic model of the vocal cords and vocal tract. Bell Syst Tech J 1975;54:485–506.
[23] Murdoch BE, editor. Dysarthria: a physiological approach to assessment and treatment. Cheltenham UK: Stanley Thornes; 1998.
[24] Murphy P. Source-filter comparison of measurements of fundamental frequency perturbation and amplitude perturbation for synthesized voice signals. J Voice 2008;22:125–37.
[25] Bogert BP, Healy MJR, Tukey JW, Rosenblatt M. The quefrency analysis of time series for echoes: Cepstrum, pseudo auto-covariance, cross-cepstrum and saphe cracking. In: Symposium on Time Series Analysis, New York, USA; 1963.
[26] Kim S, Eriksson T, Kang HG. On the time variability of vocal tract for speaker recognition. In: 8th ICSLP, Korea, October 4–8; 2004.
[27] Leo B. Acoustics. New York: American Institute of Physics; 1986.
[28] Tsanas A, Little MA, McSherry EP, Spielman J, Ramig LO. Novel speech signal processing algorithms for high-accuracy classification of Parkinson's disease. IEEE Trans Biomed Eng 2012;59:1264–71.
[29] Guttman L. A basis for scaling qualitative data. Am Sociol Rev 1944;9:139–50.
[30] Berger YG. A jackknife variance estimator for uni-stage stratified samples with unequal probabilities. Biometrika 2007;94:953–64.
[31] McGraw KO, Wong SP. Forming inferences about some IntraClass correlation coefficients. Psychol Methods 1996;1:30–46.
[32] Box GEP, Cox D. An analysis of transformations. J Roy Statist Soc Ser 1964;B26:211–52.
[33] Titze IR. Principles of voice production, 2nd ed, Iowa City: National Center for Voice Speech; 2000.
[34] Rothenberg M. A new inverse-filtering technique for deriving the glottal air flow waveform during voicing. J Acoust Soc Am 1973;53:1632–45.
[35] McNeil MR, editor. Clinical management of sensorimotor speech disorders. New York: Thieme Medical Publishers; 2009.
[36] Oppenheim AV, Schafer RW. Homomorphic analysis of speech. IEEE Trans Audio Electroacoust 1968;16: 221–6.
[37] Memedi M, Westin J, Nyholm D, Dougherty M, Groth T. A web application for follow-up of results from a mobile device test battery for Parkinson's disease patients. Comput Methods Programs Biomed 2011; 104:219–26.

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-b7f21361-657c-491c-82e4-b6a4359e0197