Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 資訊網路與多媒體研究所
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/44659
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor歐陽明(Ming Ouhyoung)
dc.contributor.authorYuan-Chen Hoen
dc.contributor.author何元臣zh_TW
dc.date.accessioned2021-06-15T03:52:27Z-
dc.date.available2010-07-13
dc.date.copyright2010-07-13
dc.date.issued2010
dc.date.submitted2010-07-08
dc.identifier.citation[1] W. H. Kruskal, Ordinal measures of association. Journal of the American Statistical Association, 53(284): 814-861, 1958.
[2] R. O. Duda, , P. E. Hart, Use of the hough transformation to detect lines and curves in pictures. Communications of the ACM 15, 1 (1972), 11–15.
[3] R. Chellappa, “Two-dimensional discrete Gaussian Markov random field models for image processing,” Pattern Recognition, Vol. 2, 1985, pp.79-112.
[4] J. Canny, A computational approach to edge detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 8, 6 (1986), 679–698.
[5] B. S. Manjunath and W. Y. Ma, “Texture Features for Browsing and Retrieval of Image Data,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 18, No. 8, August 1996.
[6] L. Itti, C. Koch, and E. Niebur. A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Anal. Mach. Intell., 20(11):1254–1259, 1998.
[7] R. Herbrich, T. Graepel, and K. Obermayer, Support vector learning for ordinal regression. In IEE Conference Publication, volume 1, pages 97–102. Citeseer, 1999.
[8] A. E. Savakis, S. P. ETZ, A. C. Loui. Evaluation of image appeal in consumer photography. In Proceedings of SPIE, Human Vision and Electronic Imaging V (2000), vol. 3959, pp. 111–120.
[9] C. C. Chang, C. J. Lin. LIBSVM: a library for support vector machines. Software available at http://www.csie.ntu.edu.tw/cjlin/libsvm.
[10] Y. Ro, M. Kim, H. Kang, B. Manjunath, and J. Kim. MPEG-7 homogeneous texture descriptor. ETRI journal, 23(2):41-51, 2001.
[11] B. Suh, H. Ling, B. B. Bederson, and D. W. Jacobs. Automatic thumbnails cropping and its effectiveness. In Proceedings of the 2003 ACM Symposium on User Interface Software and Technology (2003), pp. 95-104.
[12] H. Tong, M. Li, H. Zhang, J. He, and C. Zhang. Classification of digital photos taken by photographers or home users. Lecture Notes in Computer Science, pages 198{205, 2004.
[13] P. Felzenszwalb and D. Huttenlocher. Efficient graph-based image segmentation. International Journal of Computer Vision, 59(2):167-181, 2004.
[14] C. G. M. Snoek, M. Worring, and A. W. M. Smeulders. Early versus late fusion in semantic video analysis. In MULTIMEDIA ’05: Proceedings of the 13th annual ACM International conference on Multimedia, pages 399-402, New York, NY, USA, 2005. ACM.
[15] G. Sharma, W. Wu, and E. Dalal. The CIEDE2000 color-difference formula: implementation notes, supplementary test data, and mathematical observations. Color research and application, 30(1):21-30, 2005.
[16] M. S. Lew, N. Sebe, C. Djeraba, and R. Jain. Content-based multimedia information retrieval: State of the art and challenges. In ACM TOMCCAP. 2(1):1-19, 2006
[17] D. Cohen-Or, O. Sorkine, R. Gal, T. Leyvand, and Y.-Q. Xu. Color harmonization. ACM Trans. Graph., 25(3):624-630, 2006.
[18] A. Santella, M. Agrawala, D. DeCarlo, D. Salesin, and M. Cohen. Gazed-based interaction for semi-automatic photo cropping. In ACM CHI 2006 Conference Proceedings (2006), pp. 771-780.
[19] R. Datta, D. Joshi, J. Li, and J. Z. Wang. Studying aesthetics in photographic images using a computational approach. In Proc. ECCV, pages 7-13, 2006.
[20] Y. Ke, X. Tang, and F. Jing. The design of high-level features for photo quality assessment. In Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on, volume 1, pages 419- 426, June 2006.
[21] S. Banerjee, and B. Evans. In-camera automation of photographic composition rules. IEEE Transactions on Image Processing 16, 7 (2007), 1807-1820
[22] J. Harel, C. Koch, and P. Perona. Graph-based visual saliency. In B. Sch‥olkopf, J. Platt, and T. Hoffman, editors, Advances in Neural Information Processing Systems 19, pages 545–552. MIT Press, Cambridge, MA, 2007.
[23] T. Liu, J. Sun, N. Zheng, X. Tang and H. Shum. Learning to Detect A Salient Object. CVPR 2007.
[24] C. J. Hu, A Real-Time Skin-Color-Enhanced Face Detection Algorithm. Master’s thesis, National Taiwan University, 2007.
[25] Z. Cao, T. Qin, T. Liu, M. Tsai, and H. Li. Learning to rank: from pairwise approach to listwise approach. In Proceedings of the 24th international conference on Machine learning, page 136. ACM, 2007.
[26] J. Harel, C. Koch, and P. Perona, “Graph-based visual saliency,” in Advances in Neural Information Processing Systems 19, B. Sch‥olkopf, J. Platt, and T. Hoffman, Eds. Cambridge, MA: MIT Press, 2007.
[27] Freeman, M., the Photographer’s eye: Composition and Design for Better Digital Photos. Ilex Press 2007
[28] R. Datta, J. Li, and J. Z. Wang . Algorithmic Inferencing of Aesthetics and Emotion in Natural Images: An Exposition, Proc. of IEEE International Conference on Image Processing, Special Session on Image Aesthetics, Mood and Emotion, 105-108, 2008.
[29] Y. Yang and W. Hsu. Video search reranking via online ordinal reranking. In Proceedings of ICME, pages 285–288, 2008.
[30] Y. Luo and X. Tang. Photo and video quality evaluation: Focusing on the subject. In ECCV '08: Proceedings of the 10th European Conference on Computer Vision, pages 386{399, Berlin, Heidelberg, 2008. Springer-Verlag.
[31] J. San Pedro and S. Siersdorfer. Ranking and classifying attractiveness of photos in folksonomies. In WWW '09: Proceedings of the 18th international conference on World wide web, pages 771-780, New York, NY, USA, 2009. ACM.
[32] R. Achanta, S. Hemami, F. Estrada, and S. Susstrunk. Frequency-tuned salient region detection. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on, pages 1597-1604, june 2009.
[33] M. Nishiyama, T. Okabe, Y. Sato, and I. Sato. Sensation-based photo cropping. In Multimedia ’09: Proceeding of 17th annual ACM international conference on Multimedia, pages 669-672, New York, NY, USA, 2009. ACM.
[34] Y. Y. Chang, and H. T. Chen. Finding Good Composition in Panoramic Scenes. ICCV, 2009
[35] X. Sun, H. Yao, R. Ji, and S. Liu. Photo assessment based on computational visual attention model. In MM '09: Proceedings of the seventeen ACM international conference on Multimedia, pages 541-544, New York, NY, USA, 2009. ACM.
[36] S. Wang, Q. Huang, S. Jiang, L. Qin and Q. Tin, Visual ContextRank for Web Image Re-ranking, ACM Multimedia 2009: 657-660.
[37] L. Liu, R. Chen, L. Wolf, D. Cohen-Or, Optimizing photo composition. Computer Graphic Forum (Proceedings of Eurographics), 29(2), 2010.
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/44659-
dc.description.abstract由於數位相機的普及造成個人持有的照片張數大幅上升。因此,過去逐張瀏覽挑選喜愛照片的方式已不符合需求。 除此之外並非所有使用者均熟悉如何評斷照片好壞的構圖規則,故依需要產生了一連串以照片好壞程度自動做出分類的研究。早期的研究主要多從影像處理的角度出發,以清晰度、亮度、飽和度等低階特徵爲主,但亦能得出令人滿意的結果 (Ke 2006, 72% accuracy, Datta 2006, 86% accuracy)。近來的研究加入了攝影師在拍照時使用的規則(三分法、明暗對比、淺景深)更進一步地提昇了預測的準確率 (Luo 2008, 95% accuracy)。然而過去的研究均使用同樣的研究流程:利用部分資料以機器學習的方式得出一具有預測能力的模型後再用剩下的資料驗證。這樣得出的模型雖然能夠有效的反映出大部分使用者的意見但也許未必能夠適用於單一的使用者。此外先前的研究的正確率大多著重於將照片分爲好與壞兩類而較少著墨於將照片依評分結果做出完整排列之先後順序。在加入使用者意見和探討排序結果之正確率的兩大前提下,我們建立了一套系統可供使用者結合專業的美學分析和個人風格的偏好即時得到重新排序後的結果。本系統可達到(1) 93% 好壞分類準確率 (2) 0.4054的 Kendall Tau 排序相關程度 (3) 92% 的使用者認爲系統重新排序後的結果優於原始排序結果zh_TW
dc.description.abstractDue to the growing popularity and availability of digital camera, the number of photos owned by each user has increased dramatically. Consequently, manually selecting favorite photos becomes nearly impractical. Moreover, since most digital camera users may not be familiar with the photography rules used by professional photographers, a number of studies on automatic photo selection have been conducted. Most early researches use low-level features (clarity, brightness and saturation) and achieve reasonably good results in binary classification (Ke 2006, 72% accuracy, Datta 2006, 86% accuracy). Recent researches have combined high-level features formulated from photographers’ rules of photo composition with low-level features to produce a better result (Luo 2008, 95% accuracy). However, most of previous researches follow the same framework: training a predication model with some of the data and testing the model with the rest. The problem of this framework is that the prediction model may reflect the preference of a certain group of people but it may not agree with each individual user’s taste. Besides, previous works concentrate on the predication accuracy of binary classification, but we also want to examine the accuracy of ordering of the entire ranked list. Therefore, we create a system for users to combine a ranked photo list based on photography rules with their personal preferences to create more personalized results. Our system has been able to achieve (1) 93 % binary classification accuracy (2) 0.4054 Kendall Tau correlation rank (3) 92% satisfaction rate of personalized re-ranked list over original list.en
dc.description.provenanceMade available in DSpace on 2021-06-15T03:52:27Z (GMT). No. of bitstreams: 1
ntu-99-R97944025-1.pdf: 4157786 bytes, checksum: 38bc0e73a92694131c335a85e73e7c9d (MD5)
Previous issue date: 2010
en
dc.description.tableofcontents致謝 i
摘要 ii
Abstract iii
Chapter 1 1
Chapter 2 3
2.1 Binary Photo Classification 3
2.2 Automatic image cropping 4
2.3 Re-ranking 5
Chapter 3 7
3.1 Overview 7
3.2 Saliency Map Detector 9
3.3 Line Pattern Detector 12
3.4 Personalization 13
3.5 User Interface 14
Chapter 4 17
4.1 Photo Composition 18
4.1.1 Rule of Thirds 18
4.1.2 Size of ROI 20
4.1.3 Line patterns 21
4.1.4 Aspect ratio 22
4.1.5 ROI mass center 23
4.1.6 Simplicity 23
4.2 Color and Intensity Distribution 24
4.2.1 Clarity 24
4.2.2 Color harmony 25
4.2.3 Intensity balance 26
4.2.4 Contrast 27
4.3 Texture 28
4.4 Personalized features 31
4.4.1 Intensity and Saturation 31
4.4.2 Average RGB 32
4.4.3 Black-and-white photo detection 33
Chapter 5 36
5.1 Prediction Accuracy 36
5.2 Ranking 39
5.3 Other experiments 40
5.4 User Study 40
5.5 More Results 42
Chapter 6 47
Bibliography 49
Appendix 55
A. Color Difference Equation (CIE 2000) 55
dc.language.isoen
dc.subject照片構圖zh_TW
dc.subject照片排序zh_TW
dc.subject個人化zh_TW
dc.subject美學規則zh_TW
dc.subjectPhoto compositionen
dc.subjectPersonalized rankingen
dc.subjectPhoto rankingen
dc.subjectAesthetic Rules.en
dc.title一套個人化之照片排序系統zh_TW
dc.titleA Novel Personalized Ranking System for Amateur Photosen
dc.typeThesis
dc.date.schoolyear98-2
dc.description.degree碩士
dc.contributor.oralexamcommittee徐宏民(Hung-Ming Hsu),梁容輝(Rung-Huei Liang)
dc.subject.keyword照片排序,個人化,照片構圖,美學規則,zh_TW
dc.subject.keywordPhoto ranking,Personalized ranking,Photo composition,Aesthetic Rules.,en
dc.relation.page56
dc.rights.note有償授權
dc.date.accepted2010-07-08
dc.contributor.author-college電機資訊學院zh_TW
dc.contributor.author-dept資訊網路與多媒體研究所zh_TW
顯示於系所單位:資訊網路與多媒體研究所

文件中的檔案:
檔案 大小格式 
ntu-99-1.pdf
  未授權公開取用
4.06 MBAdobe PDF
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved