In this paper a scheme of utilizing shape independent basis functions for the hierarchical multiresolution image compression is shown. For a given image texture region segmentation method is used. Following polygonal approximation of created segments causes a degradation of their boundaries. Using NURBS and Bezier interpolation and approximation segments' boundaries are created, thus achieving an image mask. As an input of the three-level hierarchical encoder this image mask and image are used. The image mask and image are subsampled by a factor of 2 on each level. The hierarchical encoder encodes them shape independently. Especially for a very low bit rate image coding gives better results for objective criteria (PSNR). For segment approximation the 2D shape independent orthogonal transform (DCT II) is used. Splines encoding and decoding is very efficient, because only their control points need to be stored. The segment is coded with a modified code similar to the JPEG code.
In this paper we present the objective video quality metric based on mutual information and Human Visual System. The calculation of proposed metric consists of two stages. In the first stage of quality evaluation whole original and test sequence are pre-processed by the Human Visual System. In the second stage we calculate mutual information which has been utilized as the quality evaluation criteria. The mutual information was calculated between the frame from original sequence and the corresponding frame from test sequence. For this testing purpose we choose Foreman video at CIF resolution. To prove reliability of our metric were compared it with some commonly used objective methods for measuring the video quality. The results show that presented objective video quality metric based on mutual information and Human Visual System provides relevant results in comparison with results of other objective methods so it is suitable candidate for measuring the video quality.
In this paper we present a novel two stage algorithm for improving video coding efficiency. The proposed method combines video cut detection and adaptive GOP structure. At first, we have proposed a new technique of frames' comparison for the shot cut detection. The majority of existing methods compare pairs of successive frames. We compare actual frame with its motion estimated prediction. We also present adaptive threshold. The efficiency of novel technique for video cut detection was confirmed through experiment and compared to the commonly used ones in the terms of recall and precision. The next step is to situate I frames to the positions of detected cuts during the process of video encoding. Finally the proposed method is verified by simulations and the obtained results are compared with fixed GOP structures of sizes 4, 8, 12, 16, 32, 64, 128 and GOP structure with length of entire video. Proposed method achieved the gain in bit rate from 15,33% to 50,59%, while not degrading PSNR in comparison to simulated fixed GOP structures.
This paper discusses the cued speech recognition methods in videoconference. Cued speech is a specific gesture language that is used for communication between deaf people. We define the criteria for sentence intelligibility according to answers of testing subjects (deaf people). In our tests we use 30 sample videos coded by H.264 codec with various bit-rates and various speed of cued speech. Additionally, we define the criteria for consonant sign recognizability in single-handed finger alphabet (dactyl) analogically to acoustics. We use another 12 sample videos coded by H.264 codec with various bit-rates in four different video formats. To interpret the results we apply the standard scale for subjective video quality evaluation and the percentual evaluation of intelligibility as in acoustics. From the results we construct the minimum coded bit-rate recommendations for every spatial resolution.
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.