應用於視頻上的紋理預測之演算法及硬體架構實作

Kuan-Yu Chen; 陳冠宇

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/24064

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	陳良基
dc.contributor.author	Kuan-Yu Chen	en
dc.contributor.author	陳冠宇	zh_TW
dc.date.accessioned	2021-06-08T05:15:08Z	-
dc.date.copyright	2011-08-22
dc.date.issued	2011
dc.date.submitted	2011-08-21
dc.identifier.citation	[1] D. M. A. Smolic, “Applications and requirements for 3DAV,” in ISO/IEC JTC1/SC29/WG11, N5877, July 2003. [2] T. K. Robert T. Collins, Omead Amid, “An active camera system for acquiring multi-view video,” in Image Processing, 2002. [3] H. P.Wojciech Matusik, “3D TV: a scalable system for real-time acquisition, transmission, and autostereoscopic display of dynamic scenes,” in ACM SIGGRAPH 2004 Papers, 2004. [4] T. Fujii, K. Mori, K. Takeda, K. Mase, M. Tanimoto, and Y. Suenaga, “Multipoint measuring system for video and sound - 100-camera and microphone system,” in Multimedia and Expo, 2006 IEEE Inter- national Conference on, pp. 437 –440, 2006. [5] M. Tanimoto, Image and Geometry Processing for 3-D Cinematogra- phy. Geometry and Computing, 2010. [6] “Introduction to multi-view video coding,” in ISO/IEC JTC1/SC 29/WG11, N7328, July 2005. [7] S. Shimizu, M. Kitahara, H. Kimata, K. Kamikura, and Y. Yashima, “View scalable multiview video coding using 3-D warping with depth map,” Circuits and Systems for Video Technology, IEEE Transactions on, vol. 17, pp. 1485 –1495, November 2007. 67 68 [8] I. Moccagatta, “Recent developments in video compression standards and their impact on implementation complexity: From scalable to multi-view video coding,” in Embedded Systems for Real Time Multi- media, Proceedings of the 2006 IEEE/ACM/IFIP Workshop on, pp. 4 –4, October 2006. [9] P. Merkle, A. Smolic, K. Muller, and T. Wiegand, “Multi-view video plus depth representation and coding,” in Image Processing, 2007. ICIP 2007. IEEE International Conference on, vol. 1, pp. I –201 –I –204, September 2007. [10] G. Ding, “A multi-view video coding method based on distributed source coding for free viewpoint switching,” in Intelligent Information Hiding and Multimedia Signal Processing, 2008. IIHMSP '08 Interna- tional Conference on, pp. 438 –441, 2008. [11] M. Tanimoto, “Free viewpoint television - FTV,” December 2004. [12] M. Tanimoto, “Overview of free viewpoint television,” vol. 21, pp. 454 –461, July 2006. [13] M. Tanimoto and T. Fujii, “FTV-free viewpoint television,” July 2002. [14] W. Li, J. Zhou, B. Li, and M. Sezan, “Virtual view specification and synthesis for free viewpoint television,” Circuits and Systems for Video Technology, IEEE Transactions on, vol. 19, pp. 533 –546, April 2009. [15] M. Tanimoto, “Overview of FTV (free-viewpoint television),” in Mul- timedia and Expo, 2009. ICME 2009. IEEE International Conference on, pp. 1552 –1553, June 2009. [16] A. Smolic, K. Mueller, P. Merkle, C. Fehn, P. Kauff, P. Eisert, and T. Wiegand, “3D video and free viewpoint video - technologies, appli69 cations and MPEG standards,” in Multimedia and Expo, 2006 IEEE International Conference on, pp. 2161 –2164, 2006. [17] M.-F. Group, “LDV virtual view rendering software,” in ISO/IEC JTC1/SC29/WG11 MPEG2008/M16040, February 2009. [18] “Video codec for audiovisual services at p x 64 Kbit/s,” in ITU-T Recommedation H.261, March 1993. [19] “Draft ITU-T recommendation and final draft international standard of joint video specification,” 2003. [20] T. Wiegand, G. Sullivan, G. Bjontegaard, and A. Luthra, “Overview of the H.264/AVC video coding standard,” vol. 13, pp. 560–576, July 2003. [21] A. Joch, F. Kossentini, H. Schwarz, T. Wiegand, and G. Sullivan, “Performance comparison of video coding standards using Lagrangian coder control,” in Image Processing. 2002, vol. 2, pp. II–501 – II–504, 2002. [22] G. Laroche, I. Jung, and B. Pesquet-Popescu, “Intra coding with prediction mode information inference,” in Circuits and Systems for Video Technology, IEEE Transactions, pp. 1786 – 1796, December 2010. [23] M. M. Oliveira, B. Bowen, R. Mckenna, and Y. sung Chang, “Fast digital image inpainting,” in Visualization, Imaging and Image Processing (VIIP 2001), 2001. [24] A. Criminisi, P. Perez, and K. Toyama, “Region filling and object removal by exemplar-based image inpainting,” in Image Processing, IEEE Transactions, pp. 1200 – 1212, September 2004. [25] I. Drori, D. Cohen-Or, and H. Yeshurun, “Fragment-based image completion,” in ACM SIGGRAPH 2003, 2003. 70 [26] M. Bertalmio and G. Sapiro, “Image inpainting,” in SIGGRAPH '00, 2000. [27] G. Sullivan and T. Wiegand, “Rate-distortion optimization for video compression,” vol. 15, pp. 74 – 90, November 1998. [28] T.-C. Chen, S.-Y. Chien, Y.-W. Huang, C.-H. Tsai, C.-Y. Chen, T.- W. Chen, and L.-G. Chen, “Analysis and architecture design of an HDTV720p 30 frames/s H.264/AVC encoder,” in Circuits and Sys- tems for Video Technology, pp. 673–688, June 2006. [29] Y.-W. Huang, T.-C. Chen, C.-H. Tsai, C.-Y. Chen, T.-W. Chen, C.-S. Chen, C.-F. Shen, S.-Y. Ma, T.-C. Wang, B.-Y. Hsieh, H.-C. Fang, and L.-G. Chen, “A 1.3TOPS H.264/AVC single-chip encoder for HDTV applications,” in Solid-State Circuits Conference. Digest of Technical Papers. ISSCC, February 2005. [30] Z. Liu, Y. Song, M. Shao, S. Li, L. Li, S. Ishiwata, M. Nakagawa, and S. G. T. Ikenaga, “A 1.41W H.264/AVC real-time encoder SOC for HDTV1080P,” in VLSI Circuits, pp. 12–13, June 2007. [31] M.-. V. Group, “Joint multiview video model (JMVM) 1.0,” in Number ISO/IEC JTC1/SC29/WG11 N8244, July 2006. [32] K.-J. Oh, S. Yea, and Y.-S. Ho, “Hole filling method using depth based in-painting for view synthesis in free viewpoint television and 3-d video,” in Picture Coding Symposium. PCS 2009, May 2009. [33] P.-K. Tsung, P.-C. Lin, K.-Y. Chen, T.-D. Chuang, H.-J. Yang, S.- Y. Chien, L.-F. Ding, W.-Y. Chen, C.-C. Cheng, T.-C. Chen, and L.-G. Chen, “A 216fps 40962160p 3DTV set-top box SoC for freeviewpoint 3DTV applications,” in Solid-State Circuits Conference Di- gest of Technical Papers (ISSCC), February 2011. 71 [34] G. J. Sullivana and J. R. Ohm, “Recent developments in standardization of high efficiency video coding (hevc).,” in Proceeding SPIE, August 2010. [35] K. Ugur, K. Andersson, A. Fuldseth, G. Bjontegaard, L. P. Endresen, J. Lainema, A. Hallapuro, J. Ridge, D. Rusanovskyy, C. Zhang, A. Norkin, C. Priddle, T. Rusert, J. Samuelsson, R. Sjoberg, and Z.Wu, “High performance, low complexity video coding and the emerging HEVC standard,” vol. 20, pp. 1688 – 1697, December 2010. [36] S. Ma and C.-C. J. Kuo, “High-definition video coding with supermacroblocks,” in Proceeding SPIE, January 2007. [37] R. Tarek, “Intra prediction in HEVC and AVC,” April 2011. [38] Y. Lin, “JCTVC-E286: Report of improved intra prediction for positive directions in UDI,” in ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11, March 2011. [39] X. Cao and Tsinghua, “JCTVC-E278: Report on short distance intra prediction method,” in ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11, March 2011. [40] “JCTVC-D282: Adaptive intra smoothing,” in ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11, March 2011. [41] Y. W. Huang, B. Y. Hsieh, T. C. Chen, and L. G. Chen, “Analysis, fast algorithm, and VLSI architecture design for H.264/AVC intra frame coder,” vol. 15, pp. 378 – 401, March 2005. [42] C. W. Ku, C. C. Cheng, G. S. Yu, M. C. Tsai, and T. S. Chang, “A high-definition H.264/AVC intra-frame codec IP for digital video and still camera applications,” vol. 16, pp. 917 – 928, August 2006. 72 [43] K. Suh, S. Park, and H. Cho, “An efficient hardware architecture of intra prediction and TQ/IQIT module for H.264 encoder,” vol. 27, pp. 511–524, October 2006. [44] L. F. Ding, “Multiview video coding: Algorithm, VLSI architecture, and syetem design,” in Ding's Doctor's thesis, January 2009. [45] Z. Wei and K. N. Ngan;, “A fast rate-distortion optimization algorithm for H.264/AVC,” in Acoustics, Speech and Signal Processing, 2007. ICASSP 2007, April 2007. [46] L. Liu and X. Zhuang;, “CABAC based bit estimation for fast H.264 RD optimization decision,” in Consumer Communications and Net- working Conference, January 2009.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/24064	-
dc.description.abstract	Advanced video applications are the epochal impacts to the history of human visual perception system. The evolution of television technology grows toward more colorful and higher resolution with these applications. To pursue higher visual quality and more realistic visual perception, more and more advance video applications are developed such as high definition TV (HDTV), 3D device, Internet video streaming and free-viewpoint 3DTV/Virtual reality. In these advance video application, the main challenge is the massive data capacity and data loss. Therefore, the role of texture prediction becomes more and more significant. Many texture prediction methods have been proposed and is a well-developed technique, such as intra prediction in video coding, interpolation, inpainting, etc. In this thesis, the texture prediction algorithm and its hardware architecture design are applied in many advanced video application to further ameliorate the visual quality. First, the texture prediction is employed as the occlusion recovery method in virtual view synthesis in multi-view system. To fulfill the more realistic experience, multi-view video brings the viewers a three-dimensional and real perceptual vision by transmitting different video sequences simultaneously on the display. However, multi-view video format is not enough to support free viewpoint sequences since its samples of spatial dimension is finite. For this purpose, the virtual view synthesis algorithm is developed for rendering images seen from any virtual viewpoints by the finite source of images seen from some fixed viewpoints only. In virtual view synthesis, the occlusion region which is blocked by the objects in reference view deteriorates the visual quality. Therefore, we propose a single iterative hybrid motion and depth-oriented inpainting algorithm and its corresponding hardware architecture to retrieve the texture in occlusion region. The simulation result outperforms by both perceptual quality and the objective metric measure. Our hardware architecture reduces 93.3% of computation cycles and still maintains the quality by isophote line propagation and depth enhancement. Video technology contributes a lot in modern society and digitization of video further simplifies the processing, transmission and storage of video content. Without unlimited storage capacity and transmission rate, video coding is necessary. In the second part of the thesis, we introduce and analyze the newest video coding standard, high efficiency video coding. HEVC targets to further reduce the bit rate by 50% compared to the H.264/AVC, current state-of-art of video coding. With the bandwidth limitation and targeting higher resolution, texture prediction is required to reaching better coding efficiency. In this thesis, a texture prediction technique, intra plus inpainting mode, is proposed to further decrease the bit rate or lower the computation complexity. Based on the proposed algorithm and architecture, a worldwide first HEVC standard of intra prediction mode with the specification of Quad-HD 4096x2160 sequence with 30 fps is revealed.	en
dc.description.provenance	Made available in DSpace on 2021-06-08T05:15:08Z (GMT). No. of bitstreams: 1 ntu-100-R98921034-1.pdf: 2794590 bytes, checksum: cc314cb5325bb5545fed24294c437dd6 (MD5) Previous issue date: 2011	en
dc.description.tableofcontents	1 Introduction 1 1.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.2 Free Viewpoint 3DTV and Texture Prediction . . . . . . . . 2 1.2.1 Multi-View Video System and Free-Viewpoint TV (FTV) System . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.2.2 Texture Prediction in Free-Viewpoint TV System . . 6 1.3 Video Coding and Texture Prediction . . . . . . . . . . . . . 6 1.4 State of the Art on Texture Prediction . . . . . . . . . . . . 8 1.4.1 Inpainting Algorithm . . . . . . . . . . . . . . . . . . 8 1.4.2 Intra Prediction in H.264/AVC and HEVC . . . . . . 10 1.5 Thesis Organization . . . . . . . . . . . . . . . . . . . . . . . 13 2 Proposed Algorithm of Single Iterative Hybrid Motion and Depth-Oriented Inpainting 15 2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 2.2 Proposed Virtual View Motion Vector Calibration . . . . . . 17 2.2.1 Problem Statement . . . . . . . . . . . . . . . . . . . 17 2.2.2 Proposed Algorithm . . . . . . . . . . . . . . . . . . 18 2.3 Proposed Depth-Oriented Inpainting . . . . . . . . . . . . . 21 2.3.1 Problem Statement . . . . . . . . . . . . . . . . . . . 21 2.3.2 Proposed Algorithm . . . . . . . . . . . . . . . . . . 21 2.4 Proposed Hybrid Motion/Depth-Oriented Inpainting Flow . 23 2.5 Experimental Result . . . . . . . . . . . . . . . . . . . . . . 25 2.6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 3 Proposed Hardware Architecture Design of Depth-Oriented Inpainting 29 3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 29 3.2 Line Crack-Based Interpolation . . . . . . . . . . . . . . . . 30 3.2.1 Problem Statement . . . . . . . . . . . . . . . . . . . 30 3.2.2 Proposed algorithm of Line Crack-Based Interpolation 31 3.3 Fast Gradient Production . . . . . . . . . . . . . . . . . . . 33 3.3.1 Problem Statement . . . . . . . . . . . . . . . . . . . 33 3.3.2 Proposed algorithm of Fast Gradient Production . . . 34 3.4 Overall Hardware Architecture Flow of Depth-Aware Inpainting in Virtual View Synthesis . . . . . . . . . . . . . . . . . 35 3.5 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38 4 Analysis and Algorithm Design of Intra Prediction in HEVC 41 4.1 The Texture Prediction in HEVC - Intra Prediction . . . . . 42 4.1.1 Wide-Range Variable Block-Size Prediction . . . . . . 42 4.1.2 Angular Intra Prediction . . . . . . . . . . . . . . . . 44 4.1.3 Transformation in Intra Prediction . . . . . . . . . . 45 4.2 Anisotropic Inpainting in HEVC Intra Prediction . . . . . . 46 4.2.1 Problem Statement . . . . . . . . . . . . . . . . . . . 46 4.2.2 Proposed Algorithm . . . . . . . . . . . . . . . . . . 47 4.3 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51 5 Analysis and Architecture Design of HEVC Intra Mode 53 5.1 Design Challenge . . . . . . . . . . . . . . . . . . . . . . . . 55 5.2 Analysis and Proposal of HEVC Intra Prediction of Hardware Architecture Design . . . . . . . . . . . . . . . . . . . . . . . 57 5.2.1 Hardware-Oriented Hybrid Open-Closed Loop Intra Prediction . . . . . . . . . . . . . . . . . . . . . . . . 57 5.2.2 Elimination of Less Dominative Prediction Unit Layer 58 5.2.3 Analysis of High Complexity Mode Decision (HCMD) with Coding Efficiency . . . . . . . . . . . . . . . . . 59 5.2.4 Overall Architecture Scheme . . . . . . . . . . . . . . 61 5.3 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62 6 Conclusion 63 Bibliography 65
dc.language.iso	en
dc.subject	紋理預測	zh_TW
dc.subject	Texture Prediction	en
dc.title	應用於視頻上的紋理預測之演算法及硬體架構實作	zh_TW
dc.title	Algorithm and Architecture of Texture Prediction in Video Application	en
dc.type	Thesis
dc.date.schoolyear	99-2
dc.description.degree	碩士
dc.contributor.oralexamcommittee	賴永康,簡韶逸,楊家輝,蔡宗漢
dc.subject.keyword	紋理預測,	zh_TW
dc.subject.keyword	Texture Prediction,	en
dc.relation.page	70
dc.rights.note	未授權
dc.date.accepted	2011-08-21
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	電機工程學研究所	zh_TW
顯示於系所單位：	電機工程學系

文件中的檔案：

檔案	大小	格式
ntu-100-1.pdf 未授權公開取用	2.73 MB	Adobe PDF

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。