PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

Low Delay Sparse and Mixed Excitation CELP Coders for Wideband Speech Coding

Treść / Zawartość
Identyfikatory
Warianty tytułu
Języki publikacji
EN
Abstrakty
EN
Code Excited Linear Prediction (CELP) algorithms are proposed for compression of speech in 8 kHz band at switched or variable bit rate and algorithmic delay not exceeding 2 msec. Two structures of Low-Delay CELP coders are analyzed: Low-delay sparse excitation and mixed excitation CELP. Sparse excitation is based on MP-MLQ and multilayer models. Mixed excitation CELP algorithm stems from the narrowband G.728 standard. As opposed to G.728 LD-CELP coder, mixed excitation codebook consists of pseudorandom vectors and sequences obtained with Long-Term Prediction (LTP). Variable rate coding consists in maximizing vector dimension while keeping the required speech quality. Good speech quality (MOS=3.9 according to PESQ algorithm) is obtained at average bit rate 33.5 kbit/sec.
Słowa kluczowe
Rocznik
Strony
69--76
Opis fizyczny
Bibliogr. 22 poz., schem., tab., wykr.
Twórcy
  • Warsaw University of Technology, Institute of Telecommunications, Poland
Bibliografia
  • [1] Chen Juin-Hwey and J. Thyssen, “The Broadvoice Speech Coding Algorithm”. IEEE International Conference on Acoustics, Speech and Signal Processing – ICASSP2007, pp.537-540, DOI 10.1109/ICASSP.2007.366968
  • [2] ETSI. “3GPP TS 26.441 EVS codec”, 2014.
  • [3] ITU-T, “Recommendation G.722.2, Wideband coding of speech at around 16 kbit/s using Adaptive Multi-Rate Wideband (AMR-WB)”, 2003.
  • [4] ITU-T, “Recommendation G.722.1, Low-complexity coding at 24 and 32 kbit/s for hands-free operation in systems with low frame loss”, 2005.
  • [5] ITU-T, “Recommendation G.729.1:G.729-based embedded variable bitrate coder: An 8-32 kbit/s scalable wideband coder bitstream interoperable with G.729”, 2006
  • [6] ITU-T, “Recommendation G.718, Frame error robust narrow-band and wideband embedded variable bit-rate coding of speech and audio from 8-32 kbit/s”, 2008.
  • [7] ITU-T, “Recommendation G.722, 7 kHz audio-coding within 64 kbit/s”, 2012.
  • [8] ITU-T, “Recommendation G.711.1: Wideband embedded extension for ITU-T G.711 pulse code modulation”, 2012.
  • [9] J.M. Valin, T.B. Terriberry, C. Montgomery and G. Maxwell, “A HighQuality Speech and Audio Codec With Less Than 10 ms Delay”. IEEE Trans. on Audio, Speech and Language Processing, vol. 18, no. 1, Jan. 2010, DOI 10.1109/TASL.2009.2023186
  • [10] K. Vos, K. V. Sorensen, S. S. Jensen and J.M. Valin “Voice coding with Opus” 135th AES Convention. 2013
  • [11] Z. Kurtisi; X. Gu and L. Wolf, "Enabling network-centric music performance in wide-area networks". Communications of the ACM. 49 (11) 2006, pp.52–54, DOI 10.1145/1167838.1167862
  • [12] J. Stachurski, “Embedded CELP with adaptive codebooks in enhancement layers and multi-layer gain optimization”, Proc. ICASSP 2009, pp.4133-4136, DOI 10.1109/ICASSP.2009.4960538
  • [13] ITU-T, “Recommendation G.728, Coding of speech at 16 kbit/s using low-delay code excited linear prediction”, 2012.
  • [14] F. K. Chen, G. M. Chen, B. K. Su and Y. R. Tsai, “Unified pulse replacement search algorithms for algebra codebooks of speech code”, IET Signal Proc., 2010, Vol. 4, Iss. 6, pp. 658-665, DOI 10.1049/ietspr.2009.0216
  • [15] P. Dymarski, R. Romaniuk "Sparse Signal Modeling in a Scalable CELP Coder", Proc.21st European Signal Processing Conf. EUSIPCO 2013, Marrakech, Morocco, We-P.1.1, ISBN 978-1-4799-3687-8
  • [16] P. Dymarski, R. Romaniuk, "Modified Sphere Decoding Algorithms and their applications to some sparse approximation problems", Proc. 22nd European Signal Processing Conf. EUSIPCO 2014, Lisbon, DOI 10.5281/zenodo.43826
  • [17] R. Rose and T. Barnwell “The self-excited vocoder - an alternate approach to toll quality at 4800 bps”. IEEE International Conference on Acoustics, Speech, and Signal Processing ICASSP '86.
  • [18] P. Dymarski and N. Moreau. "Mixed excitation CELP Coder". Proc. European Conference on Speech Communication and Technology (EUROSPEECH'89), Paris 1989
  • [19] ITU-T, “Recommendation G.723.1, Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s”, 2006.
  • [20] ITU-T, „Recommendation P.862: Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs”, 2001.
  • [21] K. Kim, “Wideband LD-CELP coder” – BS thesis WEiTI, Warsaw University of Technology, supervisor P. Dymarski, 2019
  • [22] G. Kim, “Wideband speech coding using CELP algorithm” – BS thesis WEiTI, Warsaw University of Technology, supervisor P. Dymarski, 2019
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-31a53443-7a81-4972-8535-b2945e07bfec
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.