基於內容之色彩加深度圖三維影像尺寸調整架構

Wei-Cih Jhou; 周瑋慈

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/48094

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	吳家麟(Ja-Ling Wu)
dc.contributor.author	Wei-Cih Jhou	en
dc.contributor.author	周瑋慈	zh_TW
dc.date.accessioned	2021-06-15T06:46:05Z	-
dc.date.available	2011-07-06
dc.date.copyright	2011-07-06
dc.date.issued	2011
dc.date.submitted	2011-06-22
dc.identifier.citation	[1] R. Achanta and S. Susstrunk. Saliency detection for content-aware image resizing. In IEEE International Conference on Image Processing (ICIP), 2009 16th, pages 1005 –1008, Nov. 2009. [2] S. Avidan and A. Shamir. Seam carving for content-aware image resizing. ACM SIGGRAPH 2007 papers, 2007. [3] C. Barnes, E. Shechtman, A. Finkelstein, and D. B. Goldman. Patchmatch: a randomized correspondence algorithm for structural image editing. ACM SIGGRAPH 2009 papers, pages 24:1–24:11, 2009. [4] S. Cho, H. Choi, Y. Matsushita, and S. Lee. Image retargeting using importance diﬀusion. 16th IEEE International Conference on Image Processing 2009. ICIP 2009, pages 977–980, 2009. [5] T. S. Cho, S. Avidan, and W. Freeman. The patch transform. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(8):1489–1501, 2010. [6] T. S. Cho, M. Butman, S. Avidan, and W. Freeman. The patch transform and its applications to image editing. IEEE Conference on Computer Vision and Pattern Recognition, 2008. CVPR 2008., pages 1–8, 2008. [7] S. Goferman, L. Zelnik-Manor, and A. Tal. Context-aware saliency detection. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2010, pages 2376–2383, 2010. [8] C. Goodwin. Seeing in Depth. Social Studies of Science, 25(2):237–274, 1995. [9] Y. Guo, F. Liu, J. Shi, Z.-H. Zhou, and M. Gleicher. Image retargeting using mesh parametrization. IEEE Transactions on Multimedia, 11(5):856 –867, Aug. 2009. [10] J. Harel, C. Koch, and P. Perona. Graph-based visual saliency. Advances in Neural Information Processing Systems (NISP) 19,, pages 545–552, 2007. [11] H. Hirschmuller and D. Scharstein. Evaluation of cost functions for stereo matching. IEEE Conference on Computer Vision and Pattern Recognition, 2007. CVPR ’07., pages 1–8, 2007. [12] D. M. Hoﬀman, A. R. Girshick, K. Akeley, and M. S. Banks. Vergence-accommodation conﬂicts hinder visual performance and cause visual fatigue. Journal of Vision, 8(3), 2008. [13] L. Itti, C. Koch, and E. Niebur. A model of saliency-based visual attention for rapid scene analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(11):1254–1259, Nov. 1998. [14] T. Jost, N. Ouerhani, R. Von Wartburg, R. Muri, and H. Hugli. Contribution of depth to visual attention: comparison of a computer model and human. Early cognitive vision workshop, Isle of Skye, Scotland, 2004. [15] W.-J. Kim, S.-D. Kim, J. Kim, and N. Hur. Resizing of stereoscopic images for display adaptation. Stereoscopic Displays and Applications XX, 7237(1):72371S, 2009. [16] M. Lambooij, W. IJsselsteijn, M. Fortuin, and I. Heynderickx. Visual discomfort and visual fatigue of stereoscopic displays: A review. Journal of Imaging Science and Technology, 53(3):030201, 2009. [17] M. Lang, A. Hornung, O. Wang, S. Poulakos, A. Smolic, and M. Gross. Nonlinear disparity mapping for stereoscopic 3d. ACM SIGGRAPH 2010 papers, pages 75:1–75:10, 2010. [18] B. Mendiburu. 3D Movie Making: Stereoscopic Digital Cinema from Script to Screen. Focal Press, 2009. [19] A. Moore, S. Prince, J. Warrell, U. Mohammed, and G. Jones. Superpixel lattices. IEEE Conference on Computer Vision and Pattern Recognition, 2008. CVPR ’08., pages 1–8, 2008. [20] Y. Pritch, E. Kav-Venaki, and S. Peleg. Shift-map image editing. IEEE 12th International Conference on Computer Vision, 2009. ICCV 2009, 29 2009. [21] V. Ramachandra, M. Zwicker, and T. Nguyen. Combined image plus depth seam carving for multiview 3d images. IEEE International Conference on Acoustics, Speech and Signal Processing, 2009. ICASSP 2009., pages 737–740, 2009. [22] M. Rubinstein, D. Gutierrez, O. Sorkine, and A. Shamir. A comparative study of image retargeting. ACM SIGGRAPH Asia 2010 papers, pages 160:1–160:10, 2010. [23] M. Rubinstein, A. Shamir, and S. Avidan. Improved seam carving for video retargeting. ACM SIGGRAPH 2008 papers, pages 16:1–16:9, 2008. [24] M. Rubinstein, A. Shamir, and S. Avidan. Multi-operator media retargeting. ACM SIGGRAPH 2009 papers, pages 23:1–23:11, 2009. [25] D. Scharstein and C. Pal. Learning conditional random ﬁelds for stereo. IEEE Conference on Computer Vision and Pattern Recognition, 2007. CVPR ’07., pages 1–8, 2007. [26] D. Scharstein and R. Szeliski. High-accuracy stereo depth maps using structured light. IEEE Conference on Computer Vision and Pattern Recognition, 2003. CVPR ’03., 1:195–202, 2003. [27] V. Setlur, S. Takagi, R. Raskar, M. Gleicher, and B. Gooch. Automatic image retargeting. Proceedings of the 4th international conference on Mobile and ubiquitous multimedia. MUM ’05, pages 59–68, 2005. [28] A. Shamir and O. Sorkine. Visual media retargeting. ACM SIGGRAPH ASIA 2009 Courses, pages 11:1–11:13, 2009. [29] D. Simakov, Y. Caspi, E. Shechtman, and M. Irani. Summarizing visual data using bidirectional similarity. IEEE Conference on Computer Vision and Pattern Recognition, pages 1–8, 2008. [30] B. Suh, H. Ling, B. B. Bederson, and D. W. Jacobs. Automatic thumbnail cropping and its eﬀectiveness. Proceedings of the 16th annual ACM symposium on User interface software and technology, pages 95–104, 2003. [31] G. Sun and N. Holliman. Evaluating methods for controlling depth perception in stereoscopic cinematography. Stereoscopic Displays and Applications XX, 7237(1):72370I, 2009. [32] K. Utsugi, T. Shibahara, T. Koike, K. Takahashi, and T. Naemura. Seam carving for stereo images. In 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON), 2010, pages 1 –4, 2010. [33] D. Vaquero, M. Turk, K. Pulli, M. Tico, and N. Gelfand. A survey of image retargeting techniques. Applications of Digital Image Processing XXXIII, 7798(1):779814, 2010. [34] L. Wang, Y. Zhang, and J. Feng. On the Euclidean distance of images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(8):1334–1339, Aug. 2005. [35] B. Wong, R. Woods, and E. Peli. Stereoacuity at distance and near. Optometry & Vision Science, 79(12):771, 2002. [36] Y. Zhang, G. Jiang, M. Yu, and K. Chen. Stereoscopic visual attention model for 3d video. In Advances in Multimedia Modeling, volume 5916 of Lecture Notes in Computer Science, pages 314–324. Springer Berlin / Heidelberg, 2010.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/48094	-
dc.description.abstract	最近，有許多研究意識到媒體尺寸調整的問題，由於可裸眼觀看3D影像的個人數位顯示器之蓬勃發展，媒體內容不只是要在一般顯示器上做調適，也必須在3D顯示器上做調適。與傳統的單視圖相比，顏色加深度圖(或視差圖)三維圖像能為我們提供更多有關立體觀感的資訊。因此，單視圖的傳統視覺注意力分析模型，難以直接應用於調整顏色加深度圖的三維圖像上，再加上對於觀看一個3D內容而言，其深度資訊的立體視覺舒適區域，針對不同3D顯示器裝置的螢幕大小也有所差異，如果不考慮立體視覺舒適區域，3D視覺體驗將是充滿壓力與不舒適的視覺感受。在本論文中，我們首先提出一個自下而上的顯著值模型，該模型結合深度的資訊來模擬立體人類視覺系統，然後，將此顯著性模型與現有的縮放技術整合進而提高調整大小後的二維圖像之品質。最後，為了將不同大小的3D屏幕空間沿著Z軸方向壓縮，我們對深度資訊做一個非線性的映射技術，使產生的深度分布，落在觀看者的立體視覺舒適區域內。本論文所提出的三維影像尺寸調整技術將人類視覺注意模型的立體感知效果(顯著性和舒適區)考慮在內，實驗結果顯示，調整大小後的二維圖像之品質將會被增進，因此，也同時會增進人類的3D視覺體驗。	zh_TW
dc.description.abstract	Recently, there are numerous works focusing on content aware media resizing. Due to the amount of personal digital displays on which one can watch 3D images with naked eyes grows rapidly, media need to be adapted not only to regular display but also to 3D display. Compared to traditional mono-view photographs, color plus depth map (or disparity map) 3D images provide us with more information about the stereoscopic perception. Therefore, traditional visual attention model for mono-view photographs can hardly be applied to resize the color plus depth map 3D images directly. The stereo comfort zone of the depth information for viewing 3D contents varies with the size of 3D display devices. Without taking the comfort zone into consideration, the 3D viewing experience will be stressful to the stereo vision. In this thesis, we first propose a bottom-up saliency model, which incorporating with the depth information to simulate the stereoscopic vision of the human visual system. Then, the saliency model is integrated into an existing resizing technique to enhance the quality of the resized 2D image. Finally, to squeeze the various 3D screen spaces along the z direction, a nonlinear depth mapping technique is applied to make the resultant depth fall into the comfort zone. By taking the stereoscopic perception effects (saliency and comfort zone) of the human visual attention model into account, experimental results show that the quality of the resized 3D images will be improved which, in turn, will enhance our viewing experience for 3D images.	en
dc.description.provenance	Made available in DSpace on 2021-06-15T06:46:05Z (GMT). No. of bitstreams: 1 ntu-100-R98944006-1.pdf: 1539379 bytes, checksum: 8116ee13fc5d36718e435069aa1da15a (MD5) Previous issue date: 2011	en
dc.description.tableofcontents	口試委員會審定書 # 誌謝 i 中文摘要 ii ABSTRACT iii CONTENTS iv LIST OF FIGURES vi LIST OF TABLES viii Chapter 1 Introduction 1 1.1 Motivation 1 1.2 Contributions 3 Chapter 2 Related Work 5 2.1 2D image resizing 5 2.1.1 Cropping and Scaling 5 2.1.2 Segmentation based pasting 6 2.1.3 Seam carving 6 2.1.4 Patch based methods 7 2.1.5 Warping based methods 8 2.1.6 Multi-operator methods 9 2.2 3D Image resizing 9 Chapter 3 The Color plus Depth Map Resizing 11 3.1 The Problem Formulation 11 3.2 The Unified Saliency Map 12 3.2.1 The depth attention model 15 3.2.2 The gradient-based depth feature model 16 3.2.3 Superpixel-based saliency map 18 3.3 The Resizing in the X-Y Direction 19 3.4 The Resizing in the Z direction 20 Chapter 4 Experimental Results 21 4.1 The Dataset 21 4.2 The Analysis and Discussion 22 4.2.1 User study 1 23 4.2.2 User study 2 27 Chapter 5 Conclusion 31 REFERENCE 32
dc.language.iso	en
dc.subject	立體視覺舒適	zh_TW
dc.subject	尺寸調整	zh_TW
dc.subject	視覺顯著性	zh_TW
dc.subject	stereo visual comfort	en
dc.subject	visual saliency	en
dc.subject	resizing	en
dc.title	基於內容之色彩加深度圖三維影像尺寸調整架構	zh_TW
dc.title	Content-aware Color plus Depth Map 3D Image Resizing	en
dc.type	Thesis
dc.date.schoolyear	99-2
dc.description.degree	碩士
dc.contributor.oralexamcommittee	黃俊翔(Chunh-Siang Huang),朱威達(Wei-Ta Chu),鄭文皇(Wen-Huang Cheng)
dc.subject.keyword	尺寸調整,視覺顯著性,立體視覺舒適,	zh_TW
dc.subject.keyword	resizing,visual saliency,stereo visual comfort,	en
dc.relation.page	35
dc.rights.note	有償授權
dc.date.accepted	2011-06-22
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	資訊網路與多媒體研究所	zh_TW
顯示於系所單位：	資訊網路與多媒體研究所

文件中的檔案：

檔案	大小	格式
ntu-100-1.pdf 未授權公開取用	1.5 MB	Adobe PDF

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。