PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

Automatic Audio Content Identification

Autorzy
Wybrane pełne teksty z tego czasopisma
Identyfikatory
Warianty tytułu
Języki publikacji
EN
Abstrakty
EN
This paper addresses the problem of automatic audio content identification. In order to determine regions of speech, music and silence in audio stream, the fusion of feature contours and their envelopes has been used. Additionally, a voicing detector and four class music genre identification stage have been incorporated into classification system. To minimize boundary errors of different audio regions, a smoothed envelope of feature contours has been proposed. Experimental results show that using proposed scheme, makes it possible to achieve acceptable classification rates for audio data segmentation. In result, this approach can be applied to the content type dependent multimedia processing.
Rocznik
Strony
485--491
Opis fizyczny
Bibliogr. 16 poz., wykr.
Twórcy
autor
  • West Pomeranian University of Technology, ul. Żołnierska 49, 71-210 Szczecin, Poland, tmaka@wi.zut.edu.pl
Bibliografia
  • 1. A. I. Al-Shoshan: Speech and Music Classification and Separation: A Review, J. King Saud Univ., Eng. Sci., vol. 19 (1), pp. 95-133, Riyadh, 2006
  • 2. K. El-Maleh, M. Klein, G. Petrucci, P. Kabal: Speech/Music Discrimination for Multimedia Applications, IEEE Int. Conf. on Acoustics, Speech, and Signal Processing: ICASSP'00, 2000
  • 3. S. Z. Li: Content-Based Audio Classification and Retrieval Using the Nearest Feature Line Method, IEEE Transactions On Speech And Audio Processing, vol. 8, no. 5, Spetember, 2000
  • 4. S. Yuan-Yuan and W. Xue and S. Bin: Several Features for Discrimination Between Vocal Sounds and Other Environmental Sounds, XII European Signal Processing Conference - EUSIPCO'04, September 6-10, Austria, 2004
  • 5. S. A. Ramprashad: A multimode transform predictive coder (MTCP) for speech and audio, IEEE Workshop on Speech Coding for Telecommunication, pp. 10-12, Finland, 1999
  • 6. B. Bessette, R. Lefebvre, R. Salami: Universal Speech/Audio Coding Using Hybrid ACELP/TCX Techniques, IEEE Int. Conf. on Acoustics, Speech and Signal Processing: ICASSP'05, 2005
  • 7. L. Tancerel, S. Ragot, R. Lefebvre: Speech/Music Discrimination for Universal Audio Coding, 20th Biennal Symposium on Communications, Queen's University, Kingston, Canada, May 28-31, 2000
  • 8. M. Hans and R. W. Schafer: Lossless Compression of Digital Audio, IEEE Signal Processing Maga zine, July, 2001
  • 9. A. M. Peinado and J. C. Segura: Speech Recognition Over Digital Channels: Robustness and Standards, John Wiley & Sons, Ltd., 2006
  • 10. P. R. Cook: Real Sound Synthesis for Interactive Applications, A K Peters, Ltd., 2002
  • 11. J. O. Smith III, Introduction To Digital Filters With Audio Applications, W3K Publishing, 2008
  • 12. A. M. Kondoz, Digital Speech - Coding for Low Bit Rate Communication Systems, John Wiley & Sons, Ltd., 2004
  • 13. J. J. Aucouturier and F. Pachet, Representing Music Genre: A State of the Art, Journal of New Music Research, vol. 32, no. 1, pp. 83-93, 2003
  • 14. G. Tzanetakis and P. Cook, Musical Genre Classification of Audio Signals, IEEE Transactions On Speech And Audio Processing, vol. 10, no. 5, July, 2002
  • 15. J. Krimphoff and S. McAdams and S. Winsberg: Characterization of the timbre of complex sounds. 2. Acoustic analysis and psychophysical quantification, J. de Physique, 4(C5), pp. 625-628, 1994
  • 16. C. C. Chang and C. J Lin: LIBSVM: a library for support vector machines, Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm, 2001
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-article-BWA0-0041-0019
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.