Automatic Audio Content Identification

Mąka, T.

Artykuł - szczegóły

Tytuł artykułu

Automatic Audio Content Identification

Autorzy

Mąka T.

Wybrane pełne teksty z tego czasopisma

http://ijet.pl/index.php/ijet

Identyfikatory

Warianty tytułu

Języki publikacji

Abstrakty

This paper addresses the problem of automatic audio content identification. In order to determine regions of speech, music and silence in audio stream, the fusion of feature contours and their envelopes has been used. Additionally, a voicing detector and four class music genre identification stage have been incorporated into classification system. To minimize boundary errors of different audio regions, a smoothed envelope of feature contours has been proposed. Experimental results show that using proposed scheme, makes it possible to achieve acceptable classification rates for audio data segmentation. In result, this approach can be applied to the content type dependent multimedia processing.

Słowa kluczowe

audio classification feature contours envelope of feature contours music genre identification SVM

Wydawca

Polish Academy of Sciences, Committee of Electronics and Telecommunication

Czasopismo

Electronics and Telecommunications Quarterly

Rocznik

2009

Tom

Vol. 55, No 3

Strony

485--491

Opis fizyczny

Bibliogr. 16 poz., wykr.

Twórcy

autor

Mąka T.

West Pomeranian University of Technology, ul. Żołnierska 49, 71-210 Szczecin, Poland, tmaka@wi.zut.edu.pl

Bibliografia

1. A. I. Al-Shoshan: Speech and Music Classification and Separation: A Review, J. King Saud Univ., Eng. Sci., vol. 19 (1), pp. 95-133, Riyadh, 2006
2. K. El-Maleh, M. Klein, G. Petrucci, P. Kabal: Speech/Music Discrimination for Multimedia Applications, IEEE Int. Conf. on Acoustics, Speech, and Signal Processing: ICASSP'00, 2000
3. S. Z. Li: Content-Based Audio Classification and Retrieval Using the Nearest Feature Line Method, IEEE Transactions On Speech And Audio Processing, vol. 8, no. 5, Spetember, 2000
4. S. Yuan-Yuan and W. Xue and S. Bin: Several Features for Discrimination Between Vocal Sounds and Other Environmental Sounds, XII European Signal Processing Conference - EUSIPCO'04, September 6-10, Austria, 2004
5. S. A. Ramprashad: A multimode transform predictive coder (MTCP) for speech and audio, IEEE Workshop on Speech Coding for Telecommunication, pp. 10-12, Finland, 1999
6. B. Bessette, R. Lefebvre, R. Salami: Universal Speech/Audio Coding Using Hybrid ACELP/TCX Techniques, IEEE Int. Conf. on Acoustics, Speech and Signal Processing: ICASSP'05, 2005
7. L. Tancerel, S. Ragot, R. Lefebvre: Speech/Music Discrimination for Universal Audio Coding, 20th Biennal Symposium on Communications, Queen's University, Kingston, Canada, May 28-31, 2000
8. M. Hans and R. W. Schafer: Lossless Compression of Digital Audio, IEEE Signal Processing Maga zine, July, 2001
9. A. M. Peinado and J. C. Segura: Speech Recognition Over Digital Channels: Robustness and Standards, John Wiley & Sons, Ltd., 2006
10. P. R. Cook: Real Sound Synthesis for Interactive Applications, A K Peters, Ltd., 2002
11. J. O. Smith III, Introduction To Digital Filters With Audio Applications, W3K Publishing, 2008
12. A. M. Kondoz, Digital Speech - Coding for Low Bit Rate Communication Systems, John Wiley & Sons, Ltd., 2004
13. J. J. Aucouturier and F. Pachet, Representing Music Genre: A State of the Art, Journal of New Music Research, vol. 32, no. 1, pp. 83-93, 2003
14. G. Tzanetakis and P. Cook, Musical Genre Classification of Audio Signals, IEEE Transactions On Speech And Audio Processing, vol. 10, no. 5, July, 2002
15. J. Krimphoff and S. McAdams and S. Winsberg: Characterization of the timbre of complex sounds. 2. Acoustic analysis and psychophysical quantification, J. de Physique, 4(C5), pp. 625-628, 1994
16. C. C. Chang and C. J Lin: LIBSVM: a library for support vector machines, Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm, 2001

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-article-BWA0-0041-0019