Audio Compression using a Modified Vector Quantization algorithm for Mastering Applications

Prince, Shajin; D, Bini; Kirubaraj, Alfred A.; Immanuel, Samson J.; M, Surya

doi:10.24425/ijet.2023.144363

Artykuł - szczegóły

Tytuł artykułu

Audio Compression using a Modified Vector Quantization algorithm for Mastering Applications

Autorzy

Prince Shajin , D Bini , Kirubaraj Alfred A. , Immanuel Samson J. , M Surya

Treść / Zawartość

Pełne teksty:

Pobierz

Identyfikatory

DOI

10.24425/ijet.2023.144363

Warianty tytułu

Języki publikacji

Abstrakty

Audio data compression is used to reduce the transmission bandwidth and storage requirements of audio data. It is the second stage in the audio mastering process with audio equalization being the first stage. Compression algorithms such as BSAC, MP3 and AAC are used as standards in this paper. The challenge faced in audio compression is compressing the signal at low bit rates. The previous algorithms which work well at low bit rates cannot be dominant at higher bit rates and vice-versa. This paper proposes an altered form of vector quantization algorithm which produces a scalable bit stream which has a number of fine layers of audio fidelity. This modified form of the vector quantization algorithm is used to generate a perceptually audio coder which is scalable and uses the quantization and encoding stages which are responsible for the psychoacoustic and arithmetical terminations that are actually detached as practically all the data detached during the prediction phases at the encoder side is supplemented towards the audio signal at decoder stage. Therefore, clearly the quantization phase which is modified to produce a bit stream which is scalable. This modified algorithm works well at both lower and higher bit rates. Subjective evaluations were done by audio professionals using the MUSHRA test and the mean normalized scores at various bit rates was noted and compared with the previous algorithms.

Słowa kluczowe

vector quantization scalable perceptual coder audio mastering bit stream

Wydawca

Polish Academy of Sciences, Committee of Electronics and Telecommunication

Czasopismo

International Journal of Electronics and Telecommunications

Rocznik

2023

Tom

Vol. 69, No. 2

Strony

287--292

Opis fizyczny

Bibliogr. 20 poz., rys., tab., wykr.

Twórcy

autor

Prince Shajin

shajinprince@gmail.com

Karunya Institute of Technology and Sciences, Coimbatore, India

autor

D Bini

binivlsies@gmail.com

Karunya Institute of Technology and Sciences, Coimbatore, India

autor

Kirubaraj Alfred A.

alfred@karunya.edu

Karunya Institute of Technology and Sciences, Coimbatore, India

autor

Immanuel Samson J.

samsonimmanuel@karunya.edu

Karunya Institute of Technology and Sciences, Coimbatore, India

autor

M Surya

suryamp14@gmail.com

Karunya Institute of Technology and Sciences, Coimbatore, India, Roever Engineering College, Perambalur, India

Bibliografia

[1] Sinha D and C. Sundberg, “Unequal error protection (UEP) for perceptual audio coders,” IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP), 1999, pp. 2423-2326. https://doi.org/10.1109/ICASSP.1999.760616
[2] Mondal, U.K, “Achieving lossless compression of audio by encoding its constituted components (LCAEC),” Innovations Syst Softw Eng Vol 15, 2019, pp.75-85. https://doi.org/10.1007/s11334-018-0321-x
[3] Huang, H. Shu, and R. Yu, “Lossless Audio Compression in The New IEEE Standard for Advanced Audio Coding,” IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP), 2014, pp. 6934 - 6938. https://doi.org/10.1109/ICASSP.2014.6854944
[4] M. Sandler and D. Black, “Scalable audio coding for compression and loss resilient streaming,” IEEE Proceeding. -Visual. Image Signal Processing., Vol. 153, No. 3, 2006, pp. 331-339. https://doi.org/10.1049/ip-vis:20050054
[5] Srivatsan Kandadai & Charles D. Creusere, “Scalable Audio Compression at Low Bitrates,” IEEE Transactions on Audio, Speech, and Language Processing. Vol.16, No.5, 2008, pp. 969-979. https://doi.org/10.1109/TASL.2008.925881
[6] Pramila Srinivasan and Leah H. Jamieson, “High-Quality Audio Compression Using an Adaptive Wavelet Packet Decomposition and Psychoacoustic Modeling,” IEEE Transactions on Signal Processing, Vol. 46, No.4, 1998, pp.1085 - 1093. https://doi.org/0.1109/78.668558
[7] Manas Arora, Neha Maurya, “Audio Compression in MPEG Technology,” International Journal of Scientific and Research Publications. Vol.3, No.12, 2013, pp.1-4.
[8] D. Pan, “A tutorial on MPEG/audio compression,” IEEE Multimedia. Vol. 2, No.2, 1995, pp.60-74. https://doi.org/10.1109/93.388209
[9] Moreno-Alvarado R.G, Mauricio Martinez-Garcia, Mariko Nakano and Héctor M. Pérez, “DCT-Compressive Sampling of Multifrequency sparse audio signals,” IEEE Latin-America Conference on Communications, 2014. https://doi.org/10.1109/LATINCOM.2014.7041859
[10] Subbarao V. Wunnava, and Craig Chin, “Multilevel Data Compression Techniques for Transmission of Audio over Networks. Proceedings,” IEEE South east Conference, 2001, pp.234 - 238. https://doi.org/10.1109/SECON.2001.923121
[11] Florin Ghido, “An Asymptotically Optimal Predictor for Stereo Lossless Audio Compression,” Proceedings of the Data Compression Conference, 2003. https://doi.org/0.1109/DCC.2003.1194048
[12] Rongshan Yu and Chi Chung Ko, “Lossless Compression of Digital Audio Using Cascaded RLS-LMS Prediction.” IEEE Transactions on Audio, Speech, and Language Processing, Vol.11, No.6, 2003, pp.532 - 537. https://doi.org/10.1109/TSA.2003.818111
[13] Teddy Surya Gunawan, M. Khalif Mat Zain, Fathiah Abdul Muin and Mira Kartiwi, “Investigation of Lossless Audio Compression using IEEE 1857.2 Advanced Audio Coding,” Indonesian Journal of Electrical Engineering and Computer Science Vol.6, No.2, 2017, pp.422 - 430. https://doi.org/10.11591/ijeecs.v6.i2.pp422-430
[14] Anthony Griffin, Toni Hirvonen, Christos Tzagkarakis, Athanasios Mouchtaris and Panagiotis Tsakalides, “Single-Channel and Multi-Channel Sinusoidal Audio Coding Using Compressed Sensing,” IEEE Transactions on Audio, Speech, and Language Processing. Vol.19, No.5, 2010, pp.1382 - 1395. https://doi.org/10.1109/TASL.2010.2090656
[15] Rubem J. V. de Medeiros, Edmar C. Gurjão and Joâo M. de Carvalho, “Lossy Audio Compression Via Compressed Sensing. Proceedings of the Data Compression Conference, 2010. https://doi.org/10.1109/DCC.2010.88
[16] Duarte, M. Davenport, D. Takhar, J. Laska, T. Sun, K. Kelly, and R. Baraniuk, “Single- pixel imaging via compressive sampling,” IEEE Signal Processing Magazine. Vol.25, No.2, 2008, pp.83-91. https://doi.org/10.1109/MSP.2007.914730
[17] Larsen M.H, M. G. Christensen, and S. H. Jensen, “Variable dimension trellis-coded quantization of sinusoidal parameters,” IEEE Signal Processing Letters. Vol.15, 2008, pp.17-20. https://doi.org/10.1109/LSP.2007.910244
[18] Vafin R and W. B. Kleijn, “Entropy-constrained polar quantization and its application to audio coding,” IEEE Transactions on Audio, Speech, and Language Processing, Vol.13, No. 2, 2005, pp.220-232. https://doi.org/10.1109/TSA.2004.840942
[19] Cecchi, S.; Virgulti, M.; Primavera, A.; Piazza, F.; Bettarelli, F.; Li, J, “Investigation on audio algorithms architecture for stereo portable devices,” Journal of Audio Engineering Society, Vol.64, 2016, pp.175-188. https://doi.org/10.17743/jaes.2015.0084
[20] Creusere C, “Understanding perceptual distortion in MPEG scalable audio coding“. IEEE Transactions on Audio, Speech, and Language Processing, Vol.13, No.3, 2005, pp. 422-431. https://doi.org/10.1109/TSA.2005.845817

Uwagi

Opracowanie rekordu ze środków MEiN, umowa nr SONP/SP/546092/2022 w ramach programu "Społeczna odpowiedzialność nauki" - moduł: Popularyzacja nauki i promocja sportu (2022-2023).

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-c9bc6612-c16a-4c1a-ac35-d19a3d516250