Identyfikatory
Warianty tytułu
Języki publikacji
Abstrakty
Speech emotion recognition is deemed to be a meaningful and intractable issue among a number of do- mains comprising sentiment analysis, computer science, pedagogy, and so on. In this study, we investigate speech emotion recognition based on sparse partial least squares regression (SPLSR) approach in depth. We make use of the sparse partial least squares regression method to implement the feature selection and dimensionality reduction on the whole acquired speech emotion features. By the means of exploiting the SPLSR method, the component parts of those redundant and meaningless speech emotion features are lessened to zero while those serviceable and informative speech emotion features are maintained and selected to the following classification step. A number of tests on Berlin database reveal that the recogni- tion rate of the SPLSR method can reach up to 79.23% and is superior to other compared dimensionality reduction methods.
Wydawca
Czasopismo
Rocznik
Tom
Strony
465--470
Opis fizyczny
Bibliogr. 39 poz., rys., tab.
Twórcy
autor
- The School of Information Science and Engineering, Southeast University No.2 Sipailou, Nanjing, P.R.China
autor
- Research Center for Learning Science, Southeast University No.2 Sipailou, Nanjing, P.R.China
autor
- Research Center for Learning Science, Southeast University No.2 Sipailou, Nanjing, P.R.China
autor
- Research Center for Learning Science, Southeast University No.2 Sipailou, Nanjing, P.R.China
Bibliografia
- 1. Ayadi M.E., Kamel M.S., Karray F. (2011), Survey on speech emotion recognition: features, classification schemes, and databases, Pattern Recognition, 44, 572-587.
- 2. Belhumeur P.N., Hespanha P. (1997), Eigenfaces vs. Fisherfaces: recognition using class specific linear projection, IEEE Trans. Pattern Anal. Mach. Intell, 19, 711-720.
- 3. Baek J.S., Kim M. (2004), Face Recognition using paritial least squares components, Pattern Recognition, 37, 1303-1306.
- 4. Burkhardt F. et al. (2005), A database of german emotional speech, Interspeech, 1517-1520.
- 5. Bishop C. (2006), Pattern recognition and machine learning, Springer, United States of America.
- 6. Chen L.F., Mark H.Y., Ko M.T. (2000), A new LDA-based face recognition system which can solve the small sample size problem, Pattern Recognition, 33, 1713-1726.
- 7. Cen L., Ser W., Liang Z. (2008), Speech emotion recognition using canonical correlation analysis and probabilistic neural network, International Conference on Machine Learning and Applications, 859-862.
- 8. Cai D., He X.F., Han J.W. (2007), Spectral Regression: A Unified Approach for Sparse Subspace Learning, IEEE International Conference on Data Mining, 73-82.
- 9. Cao K.A., Rossouwy D., Grani C.R., Besse P. (2008), A sparse PLS for variable selection when integrating omics data, Statistical Applications in Genetics and Molecular Biology, 7, 35-45.
- 10. Cao K.A., Martin P.G.P., Robert-Granié C., Besse P. (2009), Sparse canonical methods for biological data integration: application to a cross-platform study, BMC Bioinformatics, 10.
- 11. Cao K.A., Boitard S., C.R., Besse P. (2011), Sparse PLS Discriminant Analysis: biologically relevant feature selection and graphical displays for multiclass problems, Technical report, University of Queensland.
- 12. Cao K.A., Gall G.L. (2011), Integration and variable selection of ‘omics’ data sets with PLS: a survey, Journal de la Société Fran,caise de Statistique, 152, 77-96.
- 13. Chen L.J., Mao X., Xue Y.L, Cheng L.L. (2012), Speech emotion recognition: features and classification models, Digital Signal Processing, 2, 154-1160.
- 14. Chun H., Keles S. (2010), Sparse partial least squares regression for simultaneous dimension reduction and variable selection, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 72, 3-25.
- 15. Chung D., Keles S. (2010), Sparse Partial Least Squares Classification for High Dimensional Data, Statistical Applications in Genetics and Molecular Biology, 9.
- 16. Gu W.Y. (2010), A Study on Speech Emotion Recognition, master thesis, Research Center for Learning Science, Southeast University (in chinese).
- 17. Hotelling H. (1936), Relations between two sets of variates, Biometrika, 28, 312-377.
- 18. Huang D.Y., Zhu Y.W., Wu D.J., Yu R.S. (2012), Detecting Intelligibility by Linear Dimensionality Reduction and Normalized Voice Quality Hierarchical Features, Interspeech, 546-549.
- 19. Jin Y., Zheng W.M., Zhao L., Yan J.J. (2013), Generalized maximal margin discriminant analysis for speech emotion recognition, Electrical review.
- 20. Kembhavi A., Harwood D., Davis L.S. (2011), Vehicle detection using partial least squares, IEEE Transactions on Pattern Analysis and Machine Intelligence, 33, 1250-1265.
- 21. Krishnan A., Williams L.J., McIntosh A.R., Abdi H. (2011), Partial Least Squares (PLS) methods for neuroimaging: A tutorial and review, 56, 455-475.
- 22. Lai Z.H., Wan M.H., Jin Z., Yang J. (2012), Sparse two-dimensional local discriminant projections for feature extraction, Neurocomputing, 74, 629-637.
- 23. Long N., Gianola D., Rosa G.J.M., Weigel K.A. (2011), Dimension reduction and variable selection for genomic selection: application to predicting milk yield in Holsteins, Journal of Animal Breeding and Genetics, 128, 147-257.
- 24. Ma L.L. (2010), The Research and Application of Facial Expression Recognition based on Semantic Feature, master thesis, Research Center for Learning Science, Southeast University (in chinese).
- 25. Manne R. (1987), Analysis of Two Partial-Least-Squares Algorithms for Multivariate, Calibration Chemometrics and Intelligent Laboratory Systems, 2, 187-197.
- 26. McWilliams B., Montana G. (2010), Sparse partial least squares regression for on-Line variable selection with multivariate data streams, Statistical Analysis and Data Mining, 3, 170-193.
- 27. Parkhomenko E., Tritchler D., Beyene J. (2009), Sparse canonical correlation analysis with application to genomic data integration, Statistical Applications in Genetics and Molecular Biology, 8, 55-61.
- 28. Qiao Z., Zhou L., Huang J.Z. (2009), Sparse linear discriminant analysis with application to high dimensional low sample size data, International Journal of Applied Mathematics, 39, 48-60.
- 29. Rosopal R., Kamer N. (2008), Overview and recent advances in partial least squares. subspace latent structure and feature selection, Statistical and Optimization Perspectives Workshop, 3940, 34-51.
- 30. Shen H., Huang J. (2008), Sparse principal component analysis via regularized low rank matrix approximation, Journal of Multivariate Analysis, 99, 1015-1034.
- 31. Ser W., Cen L., Yu Z.L. (2008), A Hybrid PNNGMM classification scheme for speech emotion recognition, International Conference on Pattern Recognition, 1-4.
- 32. Schwartz W.R., Kembhavi A., Harwood D., Davis L.S. (2009), Human detection using partial least squares analysis, IEEE International Conference on Computer Vision, 24-31.
- 33. Turk M., Pentland A.P. (1991), Face recognition using eigenfaces, IEEE Conference on Computer Vision and Pattern Recognition, 586-591.
- 34. Wu S.Q., Tiago H., Falk, Chan W.Y. (2011), Automatic speech emotion recognition using modulation spectral features, Speech Communication, 53, 768-785.
- 35. Yan J.J., Zheng W.M., Zhou X.Y., Zhao Zhi.J. (2012), Sparse Two-dimensional Canonical Correlation Analysis via Low Rank Matrix Approximation for Feature Extraction, IEEE Signal Processing Letters, 19, 51-54.
- 36. Yan J.J., Zheng W.M., Xin M.H., Qiu Wei (2013), Bimodal Emotion Recognition based on Body Gesture and Facial Expression, Journal of Image and Graphics, 18, 1101-1106 (in chinese).
- 37. Zou H., Hastie T., Tibshirani R. (2006), Sparse principal component analysis, Journal of Computational and Graphical Statistics, 15, 265-286.
- 38. Zhou X.Y., Zheng W.M., Xin M.H. (2012), Improving CCA via spectral components selection for facial expression recognition, IEEE International Symposium on Circuits and Systems (ISCAS), 1696-1699.
- 39. Zheng W.M., Zhou X.Y. (2012), Speech Emotion Recognition Based on Kernel Reduced-rank Regression, International Conference on Pattern Recognition (ICPR 2012), 1972-1976.
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-fedf941e-c2a0-4060-b635-54cfba219d0e