PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

High accuracy and octave error immune pitch detection algorithms

Identyfikatory
Warianty tytułu
Języki publikacji
EN
Abstrakty
EN
The aim of this paper is to present a method improving pitch estimation accuracy, showing high performance for both synthetic harmonic signals and musical instrument sounds. This method employs an Artificial Neural Network of a feed-forward type. In addition, octave error optimized pitch detection algorithm, based on spectral analysis is introduced. The proposed algorithm is very effective for signals with strong harmonic, as well as nearly sinusoidal contents. Experiments were performed on a variety of musical instrument sounds and sample results exemplifying main issues of both engineered algorithms are shown.
Słowa kluczowe
Rocznik
Strony
3--23
Opis fizyczny
Bibliogr. 21 poz., tab., wykr.
Twórcy
  • Multimedia Systems Department Gdańsk University of Technology, Narutowicza 11/12, 80-952 Gdańsk, Poland
autor
  • Multimedia Systems Department Gdańsk University of Technology, Narutowicza 11/12, 80-952 Gdańsk, Poland
Bibliografia
  • [1] W. HESS, Pitch determination of speech signal processing, Springer-Verlag, New York 1983.
  • [2] A. M. NOLL, Cepstrum pitch determination, J. Acoust. Soc. Am., 14, 293–309 (1967).
  • [3] L. R. RABINER, On the use of autocorrelation analysis for pitch detection, IEEE Trans. on ASSP, 25, 24–33 (1977).
  • [4] X. QUIAN, R. KIMARESAN, A variable frame pitch estimator and test results, IEEE Int. Conf. On Acoustics, Speech, and Signal Processing, 1, Atlanta GA, 228–231, May (1996).
  • [5] D. TALKIN, A robust algorithm for pitch tracking (RAPT), Speech Coding And Synthesis, pp. 495-518, Elsevier, 1995.
  • [6] G. S. YING, L. H. JAMIESON, C. D. MICHELL, A probabilistic approach to AMDF pitch detection, http://purcell.ecn.purdue.edu/~speechg.
  • [7] Y. MEDAN, E. YAIR, D. CHAZAN, An accurate pitch detection algorithm, 9-th Int. Conference on Pattern Recognition, Rome, Italy, 1, 476–80, November (1988).
  • [8] W. ZHANG, G. XU, Y. WANG, Pitch estimation based on circular AMDF, ICASSP 1, 341–344 (2002).
  • [9] X. MEI, J. PAN, S. SUN, Efficient algorithms for speech pitch estimation, Proceedings of 2001 International Symposium on Intelligent Multimedia, Video and Speech Proc. Hong Kong, pp. 421-424, (2001).
  • [10] B. KOSTEK, A. CZYŻEWSKI, Representing musical instrument sounds for their automatic classification, J. Audio Eng. Soc., 49, 9, 768–785 (2001).
  • [11] J. D. WIZE, J. R. CAPRIO, T. W. PARKS, Maximum-likelihood pitch estimation, IEEE Trans. Of ASSP, 24, 418–423, October (1976).
  • [12] J. HU, S. XU, J. CHEN, A modified pitch detection algorithm, IEEE Communications Letters, 5, 2 (2001).
  • [13] K. KASI, S. A. ZAHORIAN, Yet another algorithm for pitch tracking, ICASSP, 1, 361–364 (2002).
  • [14] N. KUNIEDA, T. SHIMAMURA, J. SUZUKI, Robust method of measurement of fundamental frequency by ACOLS-autocorrelation of log spectrum, IEEE Int. Conf. On Acoustics, Speech, and Signal Processing, 1, Atlanta, GA, 232–235, May (1996).
  • [15] L. JANER, Modulated gaussian wavelet transform based speech analyser pitch detection algorithm, Proc. EUROSPEECH, 1, 401–404 (1995).
  • [16] R. J. MCAULAY, T. F. QUATIERI, Pitch estimation and voicing detection based on a sinusoidal speech model, ICASSP, 1, 249–252 (1990).
  • [17] L. R. RABINER, M. J. CHENG, A. E. ROSENBERG, C. A. MCGOGENAL, A comparative performance study of several pitch detection algorithms, IEEE Trans. on Acoustics, Speech and Signal Proc., ASSP-24, 5, October (1976).
  • [18] C. A. MCGOGENAL, L. R. RABINER, A. E. ROSENBERG, A subjective evaluation of pitch detection methods using LPC synthesized speech, IEEE Trans. on Acoustics, Speech and Signal Proc., ASSP-25, 3, June (1977).
  • [19] R. AHN, W. H. HOLMES, An improved harmonic-plus-noise decomposition method and its application in pitch determination, Proc. IEEE Workshop on Speech Coding for Telecommunications, Pocono Manor, Pennsylvania, pp. 41-42, (1997).
  • [20] C. D’ALESSANDRO, B. YEGNANARAYANA, V. DARSINOS, Decomposition of speech signals into deterministic and stochastic components, ICASSP, 1, 760–763 (1995).
  • [21] S. OSOWSKI, Artificial neural networks in algorithmic approach [in Polish], WNT, Warsaw 1996.
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-article-BAT3-0004-0001
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.