Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 電機工程學系
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/24064
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor陳良基
dc.contributor.authorKuan-Yu Chenen
dc.contributor.author陳冠宇zh_TW
dc.date.accessioned2021-06-08T05:15:08Z-
dc.date.copyright2011-08-22
dc.date.issued2011
dc.date.submitted2011-08-21
dc.identifier.citation[1] D. M. A. Smolic, “Applications and requirements for 3DAV,” in
ISO/IEC JTC1/SC29/WG11, N5877, July 2003.
[2] T. K. Robert T. Collins, Omead Amid, “An active camera system for
acquiring multi-view video,” in Image Processing, 2002.
[3] H. P.Wojciech Matusik, “3D TV: a scalable system for real-time acquisition,
transmission, and autostereoscopic display of dynamic scenes,”
in ACM SIGGRAPH 2004 Papers, 2004.
[4] T. Fujii, K. Mori, K. Takeda, K. Mase, M. Tanimoto, and Y. Suenaga,
“Multipoint measuring system for video and sound - 100-camera
and microphone system,” in Multimedia and Expo, 2006 IEEE Inter-
national Conference on, pp. 437 –440, 2006.
[5] M. Tanimoto, Image and Geometry Processing for 3-D Cinematogra-
phy. Geometry and Computing, 2010.
[6] “Introduction to multi-view video coding,” in ISO/IEC JTC1/SC
29/WG11, N7328, July 2005.
[7] S. Shimizu, M. Kitahara, H. Kimata, K. Kamikura, and Y. Yashima,
“View scalable multiview video coding using 3-D warping with depth
map,” Circuits and Systems for Video Technology, IEEE Transactions
on, vol. 17, pp. 1485 –1495, November 2007.
67
68
[8] I. Moccagatta, “Recent developments in video compression standards
and their impact on implementation complexity: From scalable to
multi-view video coding,” in Embedded Systems for Real Time Multi-
media, Proceedings of the 2006 IEEE/ACM/IFIP Workshop on, pp. 4
–4, October 2006.
[9] P. Merkle, A. Smolic, K. Muller, and T. Wiegand, “Multi-view video
plus depth representation and coding,” in Image Processing, 2007.
ICIP 2007. IEEE International Conference on, vol. 1, pp. I –201 –I
–204, September 2007.
[10] G. Ding, “A multi-view video coding method based on distributed
source coding for free viewpoint switching,” in Intelligent Information
Hiding and Multimedia Signal Processing, 2008. IIHMSP '08 Interna-
tional Conference on, pp. 438 –441, 2008.
[11] M. Tanimoto, “Free viewpoint television - FTV,” December 2004.
[12] M. Tanimoto, “Overview of free viewpoint television,” vol. 21, pp. 454
–461, July 2006.
[13] M. Tanimoto and T. Fujii, “FTV-free viewpoint television,” July 2002.
[14] W. Li, J. Zhou, B. Li, and M. Sezan, “Virtual view specification and
synthesis for free viewpoint television,” Circuits and Systems for Video
Technology, IEEE Transactions on, vol. 19, pp. 533 –546, April 2009.
[15] M. Tanimoto, “Overview of FTV (free-viewpoint television),” in Mul-
timedia and Expo, 2009. ICME 2009. IEEE International Conference
on, pp. 1552 –1553, June 2009.
[16] A. Smolic, K. Mueller, P. Merkle, C. Fehn, P. Kauff, P. Eisert, and
T. Wiegand, “3D video and free viewpoint video - technologies, appli69
cations and MPEG standards,” in Multimedia and Expo, 2006 IEEE
International Conference on, pp. 2161 –2164, 2006.
[17] M.-F. Group, “LDV virtual view rendering software,” in ISO/IEC
JTC1/SC29/WG11 MPEG2008/M16040, February 2009.
[18] “Video codec for audiovisual services at p x 64 Kbit/s,” in ITU-T
Recommedation H.261, March 1993.
[19] “Draft ITU-T recommendation and final draft international standard
of joint video specification,” 2003.
[20] T. Wiegand, G. Sullivan, G. Bjontegaard, and A. Luthra, “Overview
of the H.264/AVC video coding standard,” vol. 13, pp. 560–576, July
2003.
[21] A. Joch, F. Kossentini, H. Schwarz, T. Wiegand, and G. Sullivan, “Performance
comparison of video coding standards using Lagrangian coder
control,” in Image Processing. 2002, vol. 2, pp. II–501 – II–504, 2002.
[22] G. Laroche, I. Jung, and B. Pesquet-Popescu, “Intra coding with prediction
mode information inference,” in Circuits and Systems for Video
Technology, IEEE Transactions, pp. 1786 – 1796, December 2010.
[23] M. M. Oliveira, B. Bowen, R. Mckenna, and Y. sung Chang, “Fast digital
image inpainting,” in Visualization, Imaging and Image Processing
(VIIP 2001), 2001.
[24] A. Criminisi, P. Perez, and K. Toyama, “Region filling and object
removal by exemplar-based image inpainting,” in Image Processing,
IEEE Transactions, pp. 1200 – 1212, September 2004.
[25] I. Drori, D. Cohen-Or, and H. Yeshurun, “Fragment-based image completion,”
in ACM SIGGRAPH 2003, 2003.
70
[26] M. Bertalmio and G. Sapiro, “Image inpainting,” in SIGGRAPH '00,
2000.
[27] G. Sullivan and T. Wiegand, “Rate-distortion optimization for video
compression,” vol. 15, pp. 74 – 90, November 1998.
[28] T.-C. Chen, S.-Y. Chien, Y.-W. Huang, C.-H. Tsai, C.-Y. Chen, T.-
W. Chen, and L.-G. Chen, “Analysis and architecture design of an
HDTV720p 30 frames/s H.264/AVC encoder,” in Circuits and Sys-
tems for Video Technology, pp. 673–688, June 2006.
[29] Y.-W. Huang, T.-C. Chen, C.-H. Tsai, C.-Y. Chen, T.-W. Chen, C.-S.
Chen, C.-F. Shen, S.-Y. Ma, T.-C. Wang, B.-Y. Hsieh, H.-C. Fang,
and L.-G. Chen, “A 1.3TOPS H.264/AVC single-chip encoder for
HDTV applications,” in Solid-State Circuits Conference. Digest of
Technical Papers. ISSCC, February 2005.
[30] Z. Liu, Y. Song, M. Shao, S. Li, L. Li, S. Ishiwata, M. Nakagawa, and
S. G. T. Ikenaga, “A 1.41W H.264/AVC real-time encoder SOC for
HDTV1080P,” in VLSI Circuits, pp. 12–13, June 2007.
[31] M.-. V. Group, “Joint multiview video model (JMVM) 1.0,” in Number
ISO/IEC JTC1/SC29/WG11 N8244, July 2006.
[32] K.-J. Oh, S. Yea, and Y.-S. Ho, “Hole filling method using depth
based in-painting for view synthesis in free viewpoint television and
3-d video,” in Picture Coding Symposium. PCS 2009, May 2009.
[33] P.-K. Tsung, P.-C. Lin, K.-Y. Chen, T.-D. Chuang, H.-J. Yang, S.-
Y. Chien, L.-F. Ding, W.-Y. Chen, C.-C. Cheng, T.-C. Chen, and
L.-G. Chen, “A 216fps 40962160p 3DTV set-top box SoC for freeviewpoint
3DTV applications,” in Solid-State Circuits Conference Di-
gest of Technical Papers (ISSCC), February 2011.
71
[34] G. J. Sullivana and J. R. Ohm, “Recent developments in standardization
of high efficiency video coding (hevc).,” in Proceeding SPIE,
August 2010.
[35] K. Ugur, K. Andersson, A. Fuldseth, G. Bjontegaard, L. P. Endresen,
J. Lainema, A. Hallapuro, J. Ridge, D. Rusanovskyy, C. Zhang,
A. Norkin, C. Priddle, T. Rusert, J. Samuelsson, R. Sjoberg, and Z.Wu,
“High performance, low complexity video coding and the emerging
HEVC standard,” vol. 20, pp. 1688 – 1697, December 2010.
[36] S. Ma and C.-C. J. Kuo, “High-definition video coding with supermacroblocks,”
in Proceeding SPIE, January 2007.
[37] R. Tarek, “Intra prediction in HEVC and AVC,” April 2011.
[38] Y. Lin, “JCTVC-E286: Report of improved intra prediction for
positive directions in UDI,” in ITU-T SG16 WP3 and ISO/IEC
JTC1/SC29/WG11, March 2011.
[39] X. Cao and Tsinghua, “JCTVC-E278: Report on short distance
intra prediction method,” in ITU-T SG16 WP3 and ISO/IEC
JTC1/SC29/WG11, March 2011.
[40] “JCTVC-D282: Adaptive intra smoothing,” in ITU-T SG16 WP3
and ISO/IEC JTC1/SC29/WG11, March 2011.
[41] Y. W. Huang, B. Y. Hsieh, T. C. Chen, and L. G. Chen, “Analysis,
fast algorithm, and VLSI architecture design for H.264/AVC intra
frame coder,” vol. 15, pp. 378 – 401, March 2005.
[42] C. W. Ku, C. C. Cheng, G. S. Yu, M. C. Tsai, and T. S. Chang, “A
high-definition H.264/AVC intra-frame codec IP for digital video and
still camera applications,” vol. 16, pp. 917 – 928, August 2006.
72
[43] K. Suh, S. Park, and H. Cho, “An efficient hardware architecture of
intra prediction and TQ/IQIT module for H.264 encoder,” vol. 27,
pp. 511–524, October 2006.
[44] L. F. Ding, “Multiview video coding: Algorithm, VLSI architecture,
and syetem design,” in Ding's Doctor's thesis, January 2009.
[45] Z. Wei and K. N. Ngan;, “A fast rate-distortion optimization algorithm
for H.264/AVC,” in Acoustics, Speech and Signal Processing, 2007.
ICASSP 2007, April 2007.
[46] L. Liu and X. Zhuang;, “CABAC based bit estimation for fast H.264
RD optimization decision,” in Consumer Communications and Net-
working Conference, January 2009.
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/24064-
dc.description.abstractAdvanced video applications are the epochal impacts to the history of human visual perception system. The evolution of television technology grows toward more colorful and higher resolution with these applications. To pursue higher visual quality and more realistic visual perception, more and more advance video applications are developed such as high definition TV (HDTV), 3D device, Internet video streaming and free-viewpoint 3DTV/Virtual reality. In these advance video application, the main challenge is the massive data capacity and data loss. Therefore, the role of texture prediction becomes more and more significant. Many texture prediction methods have been proposed and is a well-developed technique, such as intra prediction in video coding, interpolation, inpainting, etc. In this thesis, the texture prediction algorithm and its hardware architecture design are applied in many advanced video application to further ameliorate the visual quality.
First, the texture prediction is employed as the occlusion recovery method in virtual view synthesis in multi-view system. To fulfill the more realistic experience, multi-view video brings the viewers a three-dimensional and real perceptual vision by transmitting different video sequences simultaneously on the display. However, multi-view video format is not enough to support free viewpoint sequences since its samples of spatial dimension is finite. For this purpose, the virtual view synthesis algorithm is developed for rendering images seen from any virtual viewpoints by the finite source of images seen from some fixed viewpoints only. In virtual view synthesis, the occlusion region which is blocked by the objects in reference view deteriorates the visual quality. Therefore, we propose a single iterative hybrid motion and depth-oriented inpainting algorithm and its corresponding hardware architecture to retrieve the texture in occlusion region. The simulation result outperforms by both perceptual quality and the objective metric measure. Our hardware architecture reduces 93.3% of computation cycles and still maintains the quality by isophote line propagation and depth enhancement.
Video technology contributes a lot in modern society and digitization of video further simplifies the processing, transmission and storage of video content. Without unlimited storage capacity and transmission rate, video coding is necessary. In the second part of the thesis, we introduce and analyze the newest video coding standard, high efficiency video coding. HEVC targets to further reduce the bit rate by 50% compared to the H.264/AVC, current state-of-art of video coding. With the bandwidth limitation and targeting higher resolution, texture prediction is required to reaching better coding efficiency. In this thesis, a texture prediction technique, intra plus inpainting mode, is proposed to further decrease the bit rate or lower the computation complexity. Based on the proposed algorithm and architecture, a worldwide first HEVC standard of intra prediction mode with the specification of Quad-HD 4096x2160 sequence with 30 fps is revealed.
en
dc.description.provenanceMade available in DSpace on 2021-06-08T05:15:08Z (GMT). No. of bitstreams: 1
ntu-100-R98921034-1.pdf: 2794590 bytes, checksum: cc314cb5325bb5545fed24294c437dd6 (MD5)
Previous issue date: 2011
en
dc.description.tableofcontents1 Introduction 1
1.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Free Viewpoint 3DTV and Texture Prediction . . . . . . . . 2
1.2.1 Multi-View Video System and Free-Viewpoint TV (FTV)
System . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.2.2 Texture Prediction in Free-Viewpoint TV System . . 6
1.3 Video Coding and Texture Prediction . . . . . . . . . . . . . 6
1.4 State of the Art on Texture Prediction . . . . . . . . . . . . 8
1.4.1 Inpainting Algorithm . . . . . . . . . . . . . . . . . . 8
1.4.2 Intra Prediction in H.264/AVC and HEVC . . . . . . 10
1.5 Thesis Organization . . . . . . . . . . . . . . . . . . . . . . . 13
2 Proposed Algorithm of Single Iterative Hybrid Motion and
Depth-Oriented Inpainting 15
2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
2.2 Proposed Virtual View Motion Vector Calibration . . . . . . 17
2.2.1 Problem Statement . . . . . . . . . . . . . . . . . . . 17
2.2.2 Proposed Algorithm . . . . . . . . . . . . . . . . . . 18
2.3 Proposed Depth-Oriented Inpainting . . . . . . . . . . . . . 21
2.3.1 Problem Statement . . . . . . . . . . . . . . . . . . . 21
2.3.2 Proposed Algorithm . . . . . . . . . . . . . . . . . . 21
2.4 Proposed Hybrid Motion/Depth-Oriented Inpainting Flow . 23
2.5 Experimental Result . . . . . . . . . . . . . . . . . . . . . . 25
2.6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
3 Proposed Hardware Architecture Design of Depth-Oriented
Inpainting 29
3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
3.2 Line Crack-Based Interpolation . . . . . . . . . . . . . . . . 30
3.2.1 Problem Statement . . . . . . . . . . . . . . . . . . . 30
3.2.2 Proposed algorithm of Line Crack-Based Interpolation 31
3.3 Fast Gradient Production . . . . . . . . . . . . . . . . . . . 33
3.3.1 Problem Statement . . . . . . . . . . . . . . . . . . . 33
3.3.2 Proposed algorithm of Fast Gradient Production . . . 34
3.4 Overall Hardware Architecture Flow of Depth-Aware Inpainting
in Virtual View Synthesis . . . . . . . . . . . . . . . . . 35
3.5 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
4 Analysis and Algorithm Design of Intra Prediction in HEVC 41
4.1 The Texture Prediction in HEVC - Intra Prediction . . . . . 42
4.1.1 Wide-Range Variable Block-Size Prediction . . . . . . 42
4.1.2 Angular Intra Prediction . . . . . . . . . . . . . . . . 44
4.1.3 Transformation in Intra Prediction . . . . . . . . . . 45
4.2 Anisotropic Inpainting in HEVC Intra Prediction . . . . . . 46
4.2.1 Problem Statement . . . . . . . . . . . . . . . . . . . 46
4.2.2 Proposed Algorithm . . . . . . . . . . . . . . . . . . 47
4.3 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
5 Analysis and Architecture Design of HEVC Intra Mode 53
5.1 Design Challenge . . . . . . . . . . . . . . . . . . . . . . . . 55
5.2 Analysis and Proposal of HEVC Intra Prediction of Hardware
Architecture Design . . . . . . . . . . . . . . . . . . . . . . . 57
5.2.1 Hardware-Oriented Hybrid Open-Closed Loop Intra
Prediction . . . . . . . . . . . . . . . . . . . . . . . . 57
5.2.2 Elimination of Less Dominative Prediction Unit Layer 58
5.2.3 Analysis of High Complexity Mode Decision (HCMD)
with Coding Efficiency . . . . . . . . . . . . . . . . . 59
5.2.4 Overall Architecture Scheme . . . . . . . . . . . . . . 61
5.3 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
6 Conclusion 63
Bibliography 65
dc.language.isoen
dc.subject紋理預測zh_TW
dc.subjectTexture Predictionen
dc.title應用於視頻上的紋理預測之演算法及硬體架構實作zh_TW
dc.titleAlgorithm and Architecture of Texture Prediction in Video Applicationen
dc.typeThesis
dc.date.schoolyear99-2
dc.description.degree碩士
dc.contributor.oralexamcommittee賴永康,簡韶逸,楊家輝,蔡宗漢
dc.subject.keyword紋理預測,zh_TW
dc.subject.keywordTexture Prediction,en
dc.relation.page70
dc.rights.note未授權
dc.date.accepted2011-08-21
dc.contributor.author-college電機資訊學院zh_TW
dc.contributor.author-dept電機工程學研究所zh_TW
顯示於系所單位:電機工程學系

文件中的檔案:
檔案 大小格式 
ntu-100-1.pdf
  未授權公開取用
2.73 MBAdobe PDF
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved