PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

A Nonnegative Subspace Approach for Packet Loss Concealment

Wybrane pełne teksty z tego czasopisma
Identyfikatory
Warianty tytułu
PL
Metoda nieujemnej podprzestrzeni stosowana w przypadku straty pakietu
Języki publikacji
EN
Abstrakty
EN
This paper presents a nonnegative subspace approach for packet loss concealment problem. The magnitude spectrogram of speech signal is projected onto nonnegative subspace using nonnegative matrix factorization algorithm. Consequently, packet loss concealment problem is transformed to linear interpolation of the projective coefficients in nonnegative subspace. Simulation examples, objective tests show that packet loss concealment in the nonnegative subspace results in improved perceptual quality of speech compared to popular packet loss concealment algorithms.
PL
Zaprezentowano metodę subprzestrzeni dla rozwiązania problemu straty pakietu. Spektrogram amplitudowy sygnału mowy .jest poddawany projekcji do nieujemnej podprzestrzeni przy wykorzystaniu macierzy faktoryzacji. W rezultacie problem staje się możliwy do liniowej interpolacji. Osiągnięto dostrzegalną poprawę jakości przetwarzania sygnału mowy.
Rocznik
Strony
350--354
Opis fizyczny
Bibliogr. 16 poz., il., tabl., wykr.
Twórcy
autor
autor
autor
autor
  • Postgraduate Team 2, Haifu Xiang 1, Baixia District, Nanjing, China, 210007, hjj954@gmail.com
Bibliografia
  • [1] C. A. R?dbro, M. N. Murthi, S. V. Andersen, and S. H. Jensen, “Hidden Markov model-based packet loss concealment for voice over IP,” IEEE Trans. Audio, Speech, Lang. Process., 14(2006), No. 5, 1609-1623
  • [2] D. J. Goodman, G. B. Lockhart, O. J. Wasem, and W. C. Wang, “Waveform substitution techniques for recovering missing speech segments in packet voice communications,” IEEE Trans. Acoustics, Speech, Signal Process., 34(1986), No. 5, 1440-1448
  • [3] Y. J. Liang, N. Färber, and B. Girod, “Adaptive playout scheduling and loss concealment for voice communication over ip networks,” IEEE Trans. Multimedia, 5(2003), No.2, 532-543
  • [4] Esfandiar Zavarehei and Saeed Vaseghi, “Interpolation of Lost Speech Segments Using LP-HNM Model With Codebook Post-Processing” IEEE Trans. Multimedia, 10(2008), No.3, 493-502
  • [5] D.D. Lee and H.S. Seung, “Learning the Parts of Objects by Nonnegative Matrix Factorization,” Nature, 401(1999), No. 6755, 788-791
  • [6] D. D. Lee and H. S. Seung, “Algorithms for nonnegative matrix factorization,” Neural Inf. Process. Syst., 2001, 556-562
  • [7] T.Virtanen, “Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria,” IEEE Trans. Audio, Speech, Lang. Process., 15(2007), No. 3, 1066-1074
  • [8] E. Vincent, N. Bertin, and R. Badeau, “Harmonic and inharmonic nonnegative matrix factorization for polyphonic pitch transcription,” Proc. ICASSP, 2008, 109-112
  • [9] Alexey Ozerov and Cédric Févotte, “Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation,” IEEE Trans. Audio, Speech, Lang. Process., 18(2010), No. 3, 550-563
  • [10] Nancy Bertin, Roland Badeau, and Emmanuel Vincent, “Enforcing harmonicity and smoothness in bayesian nonnegative matrix factorization applied to polyphonic music transcription,” IEEE Trans. Audio, Speech, Lang. Process., 18(2010), No. 3, 538-549
  • [11] Romain Hennequin, Roland Badeau and Bertrand David, “NMF with time-frequency activations to model non stationary audio events,” Proc. ICASSP, 2010, 445-448
  • [12] Asari, H., Pearlmutter, B. A., and Zador, A. M., “Sparse representations for the cocktail party problem”. J Neurosci. 26(2006), No.28, 7477–7490
  • [13] Zhe Chen, Cichocki, A., Rutkowski, T.M., “Constrained non- Negative Matrix Factorization Method for EEG Analysis in Early Detection of Alzheimer Disease,” Proc. ICASSP, 2006, 109-112
  • [14] Stone J. V. “Blind source separation using temporal predictability,” Neural Computation, 13(2001), 1559-1574
  • [15] Appendix I: A High Quality Low-Complexity Algorithm for Packet Loss Concealment with G.711. ITU-T Recommendation- G.711, 1999.
  • [16] Perceptual Evaluation of Speech Quality (PESQ), an Objective Method for End-to-End Speech Quality Assessment of Narrowband Telephone Networks and Speech Codecs, ITU-T Recommendation-862, 2001.
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-article-PWA7-0054-0006
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.