Current statistical methods and technologies used for speaker identification via dynamic formant frequency often involve classic multivariate analyses that must meet a number of criteria in order to be considered trustworthy. The authors propose more advanced classification techniques, including artificial neural networks. Owing to iterative learning algorithms, neural networks can be trained to detect highly complex, nonlinear relations hidden in input data. This study specifically considers feed-forward multilayer perceptron and radial basic function network models. The investigation involves an analysis of the Polish vowel (stressed or unstressed) in selected contexts described by the four lowest formant frequencies. Results indicate high accuracy of neural networks as a speaker identification tool reaching up to 100%. In addition, the authors have determined that the accuracy of classification is similar when based on a single context to when input data are aggregated over several different contexts.
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.