High accuracy and octave error immune pitch detection algorithms

Dziubiński, M.; Kostek, B.

Artykuł - szczegóły

Tytuł artykułu

High accuracy and octave error immune pitch detection algorithms

Autorzy

Dziubiński M. , Kostek B.

Wybrane pełne teksty z tego czasopisma

Identyfikatory

Warianty tytułu

Języki publikacji

Abstrakty

The aim of this paper is to present a method improving pitch estimation accuracy, showing high performance for both synthetic harmonic signals and musical instrument sounds. This method employs an Artificial Neural Network of a feed-forward type. In addition, octave error optimized pitch detection algorithm, based on spectral analysis is introduced. The proposed algorithm is very effective for signals with strong harmonic, as well as nearly sinusoidal contents. Experiments were performed on a variety of musical instrument sounds and sample results exemplifying main issues of both engineered algorithms are shown.

Słowa kluczowe

Wydawca

Instytut Podstawowych Problemów Techniki PAN
Komitet Akustyki PAN
Polskie Towarzystwo Akustyczne

Czasopismo

Archives of Acoustics

Rocznik

2004

Tom

Vol. 29, no. 1

Strony

3--23

Opis fizyczny

Bibliogr. 21 poz., tab., wykr.

Twórcy

autor

Dziubiński M.

kido@sound.eti.pg.gda.pl

Multimedia Systems Department Gdańsk University of Technology, Narutowicza 11/12, 80-952 Gdańsk, Poland

autor

Kostek B.

Multimedia Systems Department Gdańsk University of Technology, Narutowicza 11/12, 80-952 Gdańsk, Poland

Bibliografia

[1] W. HESS, Pitch determination of speech signal processing, Springer-Verlag, New York 1983.
[2] A. M. NOLL, Cepstrum pitch determination, J. Acoust. Soc. Am., 14, 293–309 (1967).
[3] L. R. RABINER, On the use of autocorrelation analysis for pitch detection, IEEE Trans. on ASSP, 25, 24–33 (1977).
[4] X. QUIAN, R. KIMARESAN, A variable frame pitch estimator and test results, IEEE Int. Conf. On Acoustics, Speech, and Signal Processing, 1, Atlanta GA, 228–231, May (1996).
[5] D. TALKIN, A robust algorithm for pitch tracking (RAPT), Speech Coding And Synthesis, pp. 495-518, Elsevier, 1995.
[6] G. S. YING, L. H. JAMIESON, C. D. MICHELL, A probabilistic approach to AMDF pitch detection, http://purcell.ecn.purdue.edu/~speechg.
[7] Y. MEDAN, E. YAIR, D. CHAZAN, An accurate pitch detection algorithm, 9-th Int. Conference on Pattern Recognition, Rome, Italy, 1, 476–80, November (1988).
[8] W. ZHANG, G. XU, Y. WANG, Pitch estimation based on circular AMDF, ICASSP 1, 341–344 (2002).
[9] X. MEI, J. PAN, S. SUN, Efficient algorithms for speech pitch estimation, Proceedings of 2001 International Symposium on Intelligent Multimedia, Video and Speech Proc. Hong Kong, pp. 421-424, (2001).
[10] B. KOSTEK, A. CZYŻEWSKI, Representing musical instrument sounds for their automatic classification, J. Audio Eng. Soc., 49, 9, 768–785 (2001).
[11] J. D. WIZE, J. R. CAPRIO, T. W. PARKS, Maximum-likelihood pitch estimation, IEEE Trans. Of ASSP, 24, 418–423, October (1976).
[12] J. HU, S. XU, J. CHEN, A modified pitch detection algorithm, IEEE Communications Letters, 5, 2 (2001).
[13] K. KASI, S. A. ZAHORIAN, Yet another algorithm for pitch tracking, ICASSP, 1, 361–364 (2002).
[14] N. KUNIEDA, T. SHIMAMURA, J. SUZUKI, Robust method of measurement of fundamental frequency by ACOLS-autocorrelation of log spectrum, IEEE Int. Conf. On Acoustics, Speech, and Signal Processing, 1, Atlanta, GA, 232–235, May (1996).
[15] L. JANER, Modulated gaussian wavelet transform based speech analyser pitch detection algorithm, Proc. EUROSPEECH, 1, 401–404 (1995).
[16] R. J. MCAULAY, T. F. QUATIERI, Pitch estimation and voicing detection based on a sinusoidal speech model, ICASSP, 1, 249–252 (1990).
[17] L. R. RABINER, M. J. CHENG, A. E. ROSENBERG, C. A. MCGOGENAL, A comparative performance study of several pitch detection algorithms, IEEE Trans. on Acoustics, Speech and Signal Proc., ASSP-24, 5, October (1976).
[18] C. A. MCGOGENAL, L. R. RABINER, A. E. ROSENBERG, A subjective evaluation of pitch detection methods using LPC synthesized speech, IEEE Trans. on Acoustics, Speech and Signal Proc., ASSP-25, 3, June (1977).
[19] R. AHN, W. H. HOLMES, An improved harmonic-plus-noise decomposition method and its application in pitch determination, Proc. IEEE Workshop on Speech Coding for Telecommunications, Pocono Manor, Pennsylvania, pp. 41-42, (1997).
[20] C. D’ALESSANDRO, B. YEGNANARAYANA, V. DARSINOS, Decomposition of speech signals into deterministic and stochastic components, ICASSP, 1, 760–763 (1995).
[21] S. OSOWSKI, Artificial neural networks in algorithmic approach [in Polish], WNT, Warsaw 1996.

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-article-BAT3-0004-0001