對可伸展式視訊編碼之最佳碼率分配

Guan-Ju Peng; 彭冠舉

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/6968

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	陳少傑(Sao-Jie Chen)
dc.contributor.author	Guan-Ju Peng	en
dc.contributor.author	彭冠舉	zh_TW
dc.date.accessioned	2021-05-17T09:22:35Z	-
dc.date.available	2017-02-16
dc.date.available	2021-05-17T09:22:35Z	-
dc.date.copyright	2012-02-16
dc.date.issued	2012
dc.date.submitted	2012-01-30
dc.identifier.citation	REFERENCE [1] P. Steenkiste, “Adaptation models for network-aware disributed computation,” in Workshop on Communication, Architecture, and Applicatios for Network-based Parallel Computing, January 1999, pp. 16–31. [2] B. Tierney, D. Gunter, J. Lee, M. Stoufer, and J. B. Evans, “Enabling networkaware applications,” in IEEE International Symposium on High Performance Distributed Computing, August 2001, pp. 281–288. [3] K. Ramchandran, A. Ortega, and M. Vetterli, “Bit allocation for dependent quantization with applications to multiresolution and MPEG video coders,” IEEE Transactions on Image Processing, vol. 3, no. 5, pp. 533–545, March 1994. [4] H. Schwarz and T. Wiegand, “R-D optimized multi-layer encoder control for SVC,” in IEEE International Conference on Image Processing, Sepetember 2007, pp. 281–284. [5] A. Das, A. Hazra, and S. Banerjee, “An efficient architecture for 3-d discrete wavelet transform,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 20, no. 2, pp. 286 –296, Feburary 2010. [6] W. van der Laan, A. Jalba, and J. Roerdink, Accelerating wavelet lifting on graphics hardware using cuda,” IEEE Transactions on Parallel and Distributed Systems, vol. 22, no. 1, pp. 132 –146, Janurary 2011. [7] L. Zhang, D.Wang, and A. Vincent, “Decoupled 3-d zerotree structure for waveletbased video coding,” IEEE Transactions on Broadcasting, vol. 54, no. 3, pp. 430 –436, September 2008. [8] M.-P. Kao and T. Nguyen, “Rate-distortion optimized bitstream extractor for motion scalability in wavelet-based scalable video coding,” IEEE Transactions on Image Processing, vol. 19, no. 5, pp. 1214 –1223, May 2010. [9] M. Cagnazzo, F. Castaldo, T. Andre, M. Antonini, and M. Barlaud, “Optimal motion estimation for wavelet motion compensated video coding,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 7, pp. 907–911, July 2007. [10] T. Wiegand, G. J. Sullivan, G. Bjontegaaard, and A. Luthra, “Overview of the H.264/AVC video coding standard,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 13, no. 7, pp. 560–576, July 2003. [11] H. Schwarz, D. Marpe, and T. Wiegand, “Overview of the scalable video coding extension of the H.264/AVC standard,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 9, pp. 1103–1120, September 2007. [12] S.-T. Hsiang and J. Woods, “Embedded image coding using zeroblocks of subband/wavelet coefficients and context modeling,” in IEEE International Symposium on Circuits and Systems, vol. 3, May 2000, pp. 662–665. [13] P. Chen and J. W. Woods, “Bidirectional mc-ezbc with lifting implementation,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 14, no. 10, pp. 1183–1194, October 2004. [14] J.-R. Ohm, “Three-dimensional subband coding with motion compensation,” IEEE Transactions on Image Processing, vol. 3, no. 5, pp. 559–571, September 1994. [15] C.-C. Cheng, G.-J. Peng, and W.-L. Hwang, “Subband weighting with pixel connectivity for 3-D wavelet coding,” IEEE Transactions on Image Processing, vol. 18, no. 1, pp. 52–62, January 2009. [16] M. Wien, H. Schwarz, and T. Oelbaum, “Performance analysis of SVC,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 12, pp.1771–1771, December 2007. [17] G. J. Sullivan and T. Wiegand, “Rate-distortion optimization for video compression,” IEEE Signal Processing Magazine, vol. 15, no. 6, pp. 74–90, November 1998. [18] M. Koziri and A. Eleftheriadis, “Joint quantizer optimization for scalable coding,” in IEEE International Conference on Image Processing, Sepetember 2010, pp.1281 –1284. [19] J. Reichel, H. Schwarz, and M.Wien, “Joint Scalable Video Model 11 (jsvm 11),” Joint Video Team, Doc, JVT-X202, July 2007. [20] Q. Zhang, Q. Guo, Q. Ni,W. Zhu, and Y.-Q. Zhang, “Sender-adaptive and receiver driven layered multicast for scalable video over the internet,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 15, no. 4, pp. 482–495, April 2005. [21] B. Girod and S. Han, “Optimum update for motion-compensated lifting,” IEEE Signal Processing Letters, vol. 12, no. 2, pp. 150 – 153, Feburary 2005. [22] J.-R. Ohm, M. V. der Schaar, and J. W. Woods, “Interframe wavelet coding—Motion picture representation for universal scalability,” Signal Processing Image Communication, vol. 19, no. 9, pp. 877–908, October 2004. [23] B. Usevitch, “Optimal bit allocation for biorthogonal wavelet coding,” in Data Compression Conference, March-April 1996, pp. 387–395. [24] D. S. Taubman, “High performance scalable image compression with EBCOT,” IEEE Transactions on Image Processing, vol. 9, no. 7, pp. 1158–1170, Sepetmber 2000. [25] L. Luo, J. Li, S. Li, Z. Zhuang, and Y.-Q. Zhang, “Motion compensated lifting wavelet and its application in video coding,” in IEEE International Conference on Multimedia and Expo, August 2001, pp. 365 – 368. [26] Z. He and S. K. Mitra, “Optimum bit allocation and accurate rate control for video coding via rho domain source modeling,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 12, no. 10, pp. 840–849, October 2002. [27] H. Li, Z. Li, and C. Wen, “Fast mode decision algorithm for inter-frame coding in fully scalable video coding,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 16, no. 7, pp. 889 –895, July 2006. [28] L. Cz’uni, G. Cs’asz’ar, and A. Lics’ar, “Estimating the optimal quantization parameter in H.264,” in IEEE International Conference on Pattern Recognition, August 2006, pp. 330–333. [29] T. Wiegrand, H. Schwarz, A. Joch, F.Kossentini, and G. J. Sullivan, “Rate-constrained coder control and comparison of video coding standards,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 13, no. 7, pp. 688–703, July 2003.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/6968	-
dc.description.abstract	我們在可伸縮視頻編碼中考慮了每個使用者對於不同解析度的偏好。並且根據這些用戶的偏好，我們制定並解決基於小波變換的可伸縮視頻編碼和 H.264/SVC 的位元分配問題。首先我們考慮基於小波變換的視頻編碼器的位元分配方法。我們提出了三種方法來解決位元分配的問題。首先是使用拉格朗日的高效率方法來解決優化問題上限，其二是效率較低的動態規劃法，但其可以得到問題的最佳解。這兩種方法都需要先了解用戶的喜好。對於未知用戶喜好的情況下，我們用最小化最大失真法來解決問題。我們發現，最糟糕的表現往往發生在所有的用戶都訂閱相同的解析度的時候。因此，最小化最大失真法與小波編解碼器傳統的位元分配方法相同。根據我們進行多次的實驗，這些實驗分別考量了各種用戶的偏好，結果表明，了解用戶的喜好顯著提高的可伸縮視頻編解碼器的編碼性能。H.264/SVC 的位元分配問題要複雜得多，我們必須了解並分析H.264/SVC 多層編碼結構造成的失真。在這篇論文中，我們具體地分析了用於實現時間 (temporal)，空間(spatial)和質量(quality)的可伸縮視頻編碼（SVC）的編碼結構，並且根據分析的結果，提出了兩個對於失真-碼率線（R-D Curve）的優化算法：其一是已知用戶偏好的最優化演算法，另一個是最小化最大失真法。與目前最先進的位元分配法相較，當用戶的偏好都是已知的時候，我們的演算法在壓縮效率上有著顯著的改進。這篇論文中，我們對可伸縮視頻編碼提出了用戶偏好的概念，並在兩個最常見的可伸縮性視頻編碼方法中解決相應的位元分配問題。它們分別是，MCTF-EZBC 基於小波編碼器和H.264/SVC。在比較已知使用者偏好方法與未知使用者偏好方法的效能之後，我們亦驗證了可伸縮視頻編碼中用戶偏好的重要性。	zh_TW
dc.description.abstract	The scalable video coding problem is investigated, and based on the preferred resolution, the bit allocation problems of wavelet-based scalable video coding and H.264/SVC are formulated and solved. For the wavelet-based video encoder, three methods are proposed. The first is an efficient Lagrangian-based method that solves the upper bound of the problem optimally, and the second is a less efficient dynamic programming method that solves the problem optimally. Both methods require knowledge of the user preference on resolution. For the case where the user preference is unknown, we solve the problem by a min-max approach. Our objective is to find the bit allocation solution that maximizes the worst possible performance. We show that the worst performance occurs when all users subscribe to the same spatial, temporal, and quality resolutions. Thus, the min-max solution is exactly the same as the traditional bit allocation method for a non-scalable wavelet codec. We conduct several experiments on the 2D+t MCTF-EZBC wavelet codec with respect to various subscriber preferences. The results demonstrate that knowing the user preferences improves the coding performance of the scalable video codec significantly. For the rate allocation problem of H.264/SVC, we present a theoretical analysis of the distortion in multiple layer coding structures. Specifically, we analyze the prediction structure used to achieve temporal, spatial, and quality scalabilities in a scalable video coding (SVC), and show that the average peak-signal-to-noise (PSNR) of SVC is a weighted combination of the bit rates assigned to all the streams. We propose two rate-distortion (R-D) optimization algorithms: one employs the known user preference, and the other is based on the min-max approach which assumes the least favorable prior of the user preference. We compare the performance of our algorithms with that of a state-of-the-art scalable bit allocation algorithm and demonstrate that they outperform the compared approach when the user preference is known to both coders. In this Dissertation, we propose the concept of the user preference in the scalable video coding, and solve the corresponding rate allocation problems for the two most prevalent scalable video coding methods, which are the MCTF-EZBC wavelet based encoder and H.264/SVC. After comparing the coding gains of the methods with complete preference information over those with incomplete preference information, we verify the importance of the user preference in the scalable video coding.	en
dc.description.provenance	Made available in DSpace on 2021-05-17T09:22:35Z (GMT). No. of bitstreams: 1 ntu-101-D95943035-1.pdf: 2833993 bytes, checksum: a0c1567621be24faab5a7915e37144ac (MD5) Previous issue date: 2012	en
dc.description.tableofcontents	TABLE OF CONTENTS ABSTRACT . . . . . . . . . . . . . . . . . . . . . . . . . i LIST OF FIGURES . . . . . . . . . . . . . . . . . . . . . .v LIST OF FIGURES TABLES . . . . . . . . . . . . . . . . . vii 1 INTRODUCTION . . . . . . . . . . . . . . . . . . . . . . 1 2 SVC’S PERFORMANCE AND SUBSCRIBER PREFERENCES . . . . . . 7 2.1 Video Broadcasting System and Performance Metrics of SVC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .7 2.2 Comparison of Wavelet Based Codec and H.264/SVC . . . 10 3 RATE ALLOCATION FOR WAVELET BASED SVC . . . . . . . . . 13 3.1 MCTF-based 2D+t Wavelet Codec . . . . . . . . . . . . 13 3.1.1 Spatial Temporal Subband Weighting . . . . . . . . .16 3.2 Formulation of the Rate-Distortion Function . . . . . 17 3.3 Solving Rate-Allocation with Known Preferences . . . .19 3.3.1 Lagrangian-based Solution . . . . . . . . . . . . . 20 3.3.2 Optimal Solution Based on Dynamic Programming . . . 24 3.4 Min-Max Approach for Unknown Preferences . . . . . . .29 3.5 Experiment Results . . . . . . . . . . . . . . . . . .31 4 RATE ALLOCATION FOR H.264/SVC . . . . . . . . . . . . . 41 4.1 Rate-Allocation Problem for H.264/SVC . . . . . . . . 41 4.1.1 Rate-Distortion Model . . . . . . . . . . . . . . . 41 4.1.2 Layer Dependency and Sequence of Approximations . . 42 4.2 Prediction Residuals and Distortion Propagation . . . 44 4.2.1 Prediction Residuals . . . . . . . . . . . . . . . .44 4.2.2 Distortion Propagation in the Predictions . . . . . 49 4.3 Distortion of a Layer . . . . . . . . . . . . . . . . 54 4.3.1 Prediction Error Propagation in a Temporal Level . .55 4.3.2 Exploring Error Propagation . . . . . . . . . . . . 56 4.4 Average Distortion of SVC . . . . . . . . . . . . . . 57 4.5 Solving the Inter-Layer Rate Allocation Problem . . . 59 4.5.1 Optimal Bit Allocation with Fixed Weights . . . . . 60 4.5.2 Optimal Bit Allocation Algorithm . . . . . . . . . .62 4.5.3 Min-max Approach for Incomplete Preference Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63 4.6 Implementation Issues and Experimental Results . . . .65 4.6.1 Coding Structure and Implementation Details . . . . 65 4.6.2 Variance Approximation . . . . . . . . . . . . . . .66 4.6.3 Performance Comparison . . . . . . . . . . . . . . .69 5 CONCLUSION . . . . . . . . . . . . . . . . . . . . . . .77 APPENDICES . . . . . . . . . . . . . . . . . . . . . . . .79 A.1 Preference Settings in the Third Experiment of the Wavelet-Based Codec . . . . . . . . . . . . . . . . . . . 79 A.2 Variance Approximation . . . . . . . . . . . . . . . .83 A.3 The Two-Stream Relation of Temporal Prediction at a Low Bit Rate . . . . . . . . . . . . . . . . . . . . . . . . .86 A.4 The Two-Stream Relation of Spatial Prediction at a Low Bit Rate . . . . . . . . . . . . . . . . . . . . . . . . .88 A.5 The Two-Stream Relation of Quality Prediction at a Low Bit Rate . . . . . . . . . . . . . . . . . . . . . . . . .90 A.6 Supplementary Results of the Rate Allocation Methods for H.264/SVC . . . . . . . . . . . . . . . . . . . . . . . . 92 REFERENCE . . . . . . . . . . . . . . . . . . . . . . . .102
dc.language.iso	en
dc.title	對可伸展式視訊編碼之最佳碼率分配	zh_TW
dc.title	Optimal Rate Allocation for Scalable Video Coding	en
dc.type	Thesis
dc.date.schoolyear	100-1
dc.description.degree	博士
dc.contributor.coadvisor	黃文良(Wen-Liang Hwang)
dc.contributor.oralexamcommittee	貝蘇章(Soo-Chang Pei),何建明(Jan-Ming Ho),簡韶逸(Shao-Yi Chien),傅楸善(Chiou-Shann Fuh)
dc.subject.keyword	視訊壓縮,可伸縮式視訊編碼,	zh_TW
dc.subject.keyword	Video Coding,Scalable Video Coding,	en
dc.relation.page	105
dc.rights.note	同意授權(全球公開)
dc.date.accepted	2012-01-30
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	電子工程學研究所	zh_TW
顯示於系所單位：	電子工程學研究所

文件中的檔案：

檔案	大小	格式
ntu-101-1.pdf	2.77 MB	Adobe PDF	檢視/開啟

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。