A Nonnegative Subspace Approach for Packet Loss Concealment

Huang, J.; Zhang, Y.; Zhang, X.; Xia, Z.

Artykuł - szczegóły

Tytuł artykułu

A Nonnegative Subspace Approach for Packet Loss Concealment

Autorzy

Huang J. , Zhang Y. , Zhang X. , Xia Z.

Wybrane pełne teksty z tego czasopisma

http://pe.org.pl/

Identyfikatory

Warianty tytułu

Metoda nieujemnej podprzestrzeni stosowana w przypadku straty pakietu

Języki publikacji

Abstrakty

This paper presents a nonnegative subspace approach for packet loss concealment problem. The magnitude spectrogram of speech signal is projected onto nonnegative subspace using nonnegative matrix factorization algorithm. Consequently, packet loss concealment problem is transformed to linear interpolation of the projective coefficients in nonnegative subspace. Simulation examples, objective tests show that packet loss concealment in the nonnegative subspace results in improved perceptual quality of speech compared to popular packet loss concealment algorithms.

Zaprezentowano metodę subprzestrzeni dla rozwiązania problemu straty pakietu. Spektrogram amplitudowy sygnału mowy .jest poddawany projekcji do nieujemnej podprzestrzeni przy wykorzystaniu macierzy faktoryzacji. W rezultacie problem staje się możliwy do liniowej interpolacji. Osiągnięto dostrzegalną poprawę jakości przetwarzania sygnału mowy.

Słowa kluczowe

electrical technology electrical power engineering packet loss concealment nonnegative subspace nonnegative matrix factorization

elektrotechnika elektroenergetyka strata pakietu nieujemna przestrzeń

Wydawca

Wydawnictwo SIGMA-NOT

Czasopismo

Przegląd Elektrotechniczny

Rocznik

2011

Tom

R. 87, nr 10

Strony

350--354

Opis fizyczny

Bibliogr. 16 poz., il., tabl., wykr.

Twórcy

autor

Huang J.

autor

Zhang Y.

autor

Zhang X.

autor

Xia Z.

Postgraduate Team 2, Haifu Xiang 1, Baixia District, Nanjing, China, 210007, hjj954@gmail.com

Bibliografia

[1] C. A. R?dbro, M. N. Murthi, S. V. Andersen, and S. H. Jensen, “Hidden Markov model-based packet loss concealment for voice over IP,” IEEE Trans. Audio, Speech, Lang. Process., 14(2006), No. 5, 1609-1623
[2] D. J. Goodman, G. B. Lockhart, O. J. Wasem, and W. C. Wang, “Waveform substitution techniques for recovering missing speech segments in packet voice communications,” IEEE Trans. Acoustics, Speech, Signal Process., 34(1986), No. 5, 1440-1448
[3] Y. J. Liang, N. Färber, and B. Girod, “Adaptive playout scheduling and loss concealment for voice communication over ip networks,” IEEE Trans. Multimedia, 5(2003), No.2, 532-543
[4] Esfandiar Zavarehei and Saeed Vaseghi, “Interpolation of Lost Speech Segments Using LP-HNM Model With Codebook Post-Processing” IEEE Trans. Multimedia, 10(2008), No.3, 493-502
[5] D.D. Lee and H.S. Seung, “Learning the Parts of Objects by Nonnegative Matrix Factorization,” Nature, 401(1999), No. 6755, 788-791
[6] D. D. Lee and H. S. Seung, “Algorithms for nonnegative matrix factorization,” Neural Inf. Process. Syst., 2001, 556-562
[7] T.Virtanen, “Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria,” IEEE Trans. Audio, Speech, Lang. Process., 15(2007), No. 3, 1066-1074
[8] E. Vincent, N. Bertin, and R. Badeau, “Harmonic and inharmonic nonnegative matrix factorization for polyphonic pitch transcription,” Proc. ICASSP, 2008, 109-112
[9] Alexey Ozerov and Cédric Févotte, “Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation,” IEEE Trans. Audio, Speech, Lang. Process., 18(2010), No. 3, 550-563
[10] Nancy Bertin, Roland Badeau, and Emmanuel Vincent, “Enforcing harmonicity and smoothness in bayesian nonnegative matrix factorization applied to polyphonic music transcription,” IEEE Trans. Audio, Speech, Lang. Process., 18(2010), No. 3, 538-549
[11] Romain Hennequin, Roland Badeau and Bertrand David, “NMF with time-frequency activations to model non stationary audio events,” Proc. ICASSP, 2010, 445-448
[12] Asari, H., Pearlmutter, B. A., and Zador, A. M., “Sparse representations for the cocktail party problem”. J Neurosci. 26(2006), No.28, 7477–7490
[13] Zhe Chen, Cichocki, A., Rutkowski, T.M., “Constrained non- Negative Matrix Factorization Method for EEG Analysis in Early Detection of Alzheimer Disease,” Proc. ICASSP, 2006, 109-112
[14] Stone J. V. “Blind source separation using temporal predictability,” Neural Computation, 13(2001), 1559-1574
[15] Appendix I: A High Quality Low-Complexity Algorithm for Packet Loss Concealment with G.711. ITU-T Recommendation- G.711, 1999.
[16] Perceptual Evaluation of Speech Quality (PESQ), an Objective Method for End-to-End Speech Quality Assessment of Narrowband Telephone Networks and Speech Codecs, ITU-T Recommendation-862, 2001.

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-article-PWA7-0054-0006