Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 工學院
  3. 機械工程學系
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/42645
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor黃漢邦
dc.contributor.authorChih-Fan Chenen
dc.contributor.author陳志帆zh_TW
dc.date.accessioned2021-06-15T01:18:38Z-
dc.date.available2011-07-29
dc.date.copyright2009-07-29
dc.date.issued2009
dc.date.submitted2009-07-27
dc.identifier.citation[1] X. Bai, X. Yang, and L. J. Latecki, “Detection and recognition of contour parts based on shape similarity,” Pattern Recognition, Vol. 41, pp. 2189-2199, 2008.
[2] M. Beauchemin, K. P. B. Thomson, and G. Edwards, “On the Hausdorff distance used for the evaluation of segmentation results,” Canadian Journal of Remote Sensing, Vol. 24, pp. 3-8, 1998.
[3] E. Belogay, C. Cabrelli, U. Molter, and R. Shonkwiler, “Calculating the Hausdorff distance between curves,” Information Processing Letters, Vol. 64, pp. 17-22, 1997.
[4] S. Belongie, J. Malik, and J. Puzicha, “Shape matching and object recognition using shape contexts,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 24, pp. 509-522, 2002.
[5] K. Bennett, “Support vector machines and other kernel methods,” in IEEE International Conference on Fuzzy Systems, Reno, NV, United states, p. 2, 2005.
[6] G. Borgefors, “Hierarchical Chamfer matching: A parametric edge matching algorithm,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 10, pp. 849-865, 1988.
[7] Y. Boykov and G. Funka-Lea, “Graph cuts and efficient N-D image segmentation,” International Journal of Computer Vision, Vol. 70, pp. 109-131, 2006.
[8] J. C. Burges, “A Tutorial on Support Vector Machines for Pattern Recognition,” Vol. 2, 1998.
[9] J. Canny, “COMPUTATIONAL APPROACH TO EDGE DETECTION,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. PAMI-8, pp. 679-698, 1986.
[10] S. Carpin, A. Birk, and V. Jucikas, “On map merging,” Robotics and Autonomous Systems, Vol. 53, pp. 1-14, 2005.
[11] T.-W. Chen, Y.-L. Chen, and S.-Y. Chien, “Fast image segmentation based on K-means clustering with histograms in HSV color space,” in Proceedings of the 2008 IEEE 10th Workshop on Multimedia Signal Processing, MMSP 2008, Cairns, QLD, Australia, pp. 322-325, 2008.
[12] C. Corts and V. N. Vapnik, “Support Vector Networks,” Machine Learning, Vol. Vol. 20, pp. pp. 273-297, 1995.
[13] A. Del Bimbo and P. Pala, “Visual image retrieval by elastic matching of user sketches,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 19, pp. 121-132, 1997.
[14] A. Del Bimbo, P. Pala, and S. Santini, “Image retrieval by elastic matching of shapes and image patterns,” in International Conference on Multimedia Computing and Systems -Proceedings, Los Alamitos, CA, USA, pp. 215-218, 1996.
[15] A. Del Bimbo, P. Pala, and S. Santini, “Visual image retrieval by elastic deformation of object sketches,” in IEEE Symposium on Visual Languages, Proceedings, Los Alamitos, CA, USA, pp. 216-223, 1994.
[16] A. El Oirrak, “Affine invariant descriptors using principal components analysis,” Pattern Recognition and Image Analysis, Vol. 18, pp. 14-22, 2008.
[17] A. Ess, B. Leibe, K. Schindler, and L. Van Gool, “A mobile vision system for robust multi-person tracking,” in 26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR, Anchorage, AK, United states, 2008.
[18] R. Fabbri, L. Da F. Costa, J. C. Torelli, and O. M. Bruno, “2D Euclidean distance transform algorithms: A comparative survey,” ACM Computing Surveys, Vol. 40, 2008.
[19] P. Felzenszwalb, D. McAllester, and D. Ramanan, “A discriminatively trained, multiscale, deformable part model,” in 26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR, Anchorage, AK, United states, 2008.
[20] P. F. Felzenszwalb and D. P. Huttenlocher, “Distance Transforms of Sampled Functions,” Cornell Computing and Information Science Technical Report TR2004-1963, 2004.
[21] P. F. Felzenszwalb and D. P. Huttenlocher, “Efficient graph-based image segmentation,” International Journal of Computer Vision, Vol. 59, pp. 167-181, 2004.
[22] V. Ferrari, L. Fevrier, F. Jurie, and C. Schmid, “Groups of adjacent contour segments for object detection,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 30, pp. 36-51, 2008.
[23] V. Ferrari, T. Tuytelaars, and L. Van Gool, “Object detection by contour segment networks,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Heidelberg, D-69121, Germany, Vol. 3953 NCS, pp. 14-28, 2006.
[24] S. Garrido, L. Moreno, D. Blanco, and F. Martin, “Log of the inverse of the Distance Transform and Fast Marching applied to Path Planning,” in IEEE International Conference on Intelligent Robots and Systems, Beijing, China, pp. 2882-2887, 2006.
[25] D. M. Gavrila, “A Bayesian, exemplar-based approach to hierarchical shape matching,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 29, pp. 1408-1421, 2007.
[26] D. B. Gu and J. X. Sun, “EM image segmentation algorithm based on an inhomogeneous hidden MRF model,” IEE Proceedings: Vision, Image and Signal Processing, Vol. 152, pp. 184-190, 2005.
[27] G. Guo, S. Z. Li, and K. L. Chan, “Support vector machines for face recognition,” Image and Vision Computing, Vol. 19, pp. 631-638, 2001.
[28] M. Heath, S. Sarkar, T. Sanocki, and K. Bowyer, “Comparison of edge detectors: a methodology and initial study,” in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA, pp. 143-148, 1996.
[29] H.-P. Huang and C.-T. Lin, “Multi-CAMSHIFT for multi-view faces tracking and recognition,” in 2006 IEEE International Conference on Robotics and Biomimetics, ROBIO 2006, Kunming, China, pp. 1334-1339, 2006.
[30] Z.-K. Huang and D.-H. Liu, “Segmentation of color image using EM algorithm in HSV color space,” in Proceedings of the 2007 International Conference on Information Acquisition, ICIA, Jeju City, Korea, Republic of, pp. 316-319, 2007.
[31] H. Ikeda, N. Kato, H. Kashimura, and M. Shimizu, “Scale, rotation and translation invarient fast face detection system,” in Proceedings of the IASTED International Conference on Signal and Image Processing, Honolulu, HI, United states, Vol. 5, pp. 146-151, 2003.
[32] A. E. Jason Weston, Gokhan BakIr , Fabian Sinz “Spider: General Purpose Machine Learning Toolbox in Matlab ”: http://www.kyb.tuebingen.mpg.de/bs/people/spider/
[33] B. Kurt, A. Capar, and M. Gokmen, “Affine invariant shape descriptors Ilgin degisimsiz sekil betimleyiciler,” in 2007 IEEE 15th Signal Processing and Communications Applications, SIU, Eskisehir, Turkey, 2007.
[34] B. Leibe, E. Seemann, and B. Schiele, “Pedestrian detection in crowded scenes,” in Proceedings - 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, San Diego, CA, United states, Vol. I, pp. 878-885, 2005.
[35] Y. Li, Z. Ou, and G. Wang, “Face recognition using Gabor features and Support Vector Machines,” in Lecture Notes in Computer Science, Changsha, China, Vol. 3611, pp. 119-122, 2005.
[36] C. J. Lin and C. C. Chang, “LIBSVM: A Library for Support Vector Machines,” http://www.csie.ntu.edu.tw/~cjlin/libsvm
[37] T. Lindeberg, “Edge detection and ridge detection with automatic scale selection,” International Journal of Computer Vision, Vol. 30, pp. 117-154, 1998.
[38] D. G. Lowe, “Distinctive image features from scale-invariant keypoints,” International Journal of Computer Vision, Vol. 60, pp. 91-110, 2004.
[39] K. Mikolajczyk and C. Schmid, “Scale affine invariant interest point detectors,” International Journal of Computer Vision, Vol. 60, pp. 63-86, 2004.
[40] H. P. Ng, S. H. Ong, K. W. C. Foong, P. S. Goh, and W. L. Nowinski, “Medical image segmentation using k-means clustering and improved watershed algorithm,” in Proceedings of the IEEE Southwest Symposium on Image Analysis and Interpretation, Denver, CO, United states, pp. 61-65, 2006.
[41] A. Opelt, A. Pinz, and A. Zisserman, “Incremental learning of object detectors using a visual shape alphabet,” in Proceedings - 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2006, New York, NY, United states, Vol. 1, pp. 3-10, 2006.
[42] A. Opelt, A. Pinz, and A. Zisserman, “Learning an alphabet of shape and appearance for multi-class object detection,” International Journal of Computer Vision, Vol. 80, pp. 16-44, 2008.
[43] E. Rosten and T. Drummond, “Fusing points and lines for high performance tracking,” in Proceedings of the IEEE International Conference on Computer Vision, Beijing, China, Vol. II, pp. 1508-1515, 2005.
[44] P. Shih and C. Liu, “Extracting efficient color features for face recognition using evolutionary computation,” in Proceedings - Sixth International Conference on Computational Intelligence and Multimedia Applications, ICCIMA 2005, Las Vegas, NV, United states, pp. 285-290, 2005.
[45] J. Shotton, A. Blake, and R. Cipolla, “Contour-based learning for object detection,” in Proceedings of the IEEE International Conference on Computer Vision, Beijing, China, Vol. I, pp. 503-510, 2005.
[46] J. Shotton, A. Blake, and R. Cipolla, “Multiscale categorical object recognition using contour fragments,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 30, pp. 1270-1281, 2008.
[47] B. Stenger, A. Thayananthan, P. H. S. Torr, and R. Cipolla, “Model-based hand tracking using a hierarchical bayesian filter,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 28, pp. 1372-1384, 2006.
[48] C. H. a. M. Stephens, “A combined corner and edge detector,” Proceedings of the 4th Alvey Vision Conference, pp. 147--151., 1988.
[49] A. Thayananthan, B. Stenger, P. H. S. Torr, and R. Cipolla, “Shape context and chamfer matching in cluttered scenes,” in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol. 1, pp. 127-133, 2003.
[50] T. Vajda, E. Szatmari, and S. Nedevschi, “Human body detection and tracking in video sequences using chamfer matching,” in ICCP 2007 Proceedings IEEE 3rd International Conference on Intelligent Computer Communication and Processing, Cluj-Napoca, Romania, pp. 141-146, 2007.
[51] O. Veksler, “Star shape prior for graph-cut image segmentation,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Marseille, France, Vol. 5304 LNCS, pp. 454-467, 2008.
[52] Z. Wang, X. Yang, Y. Xu, and S. Yu, “CamShift guided particle filter for visual tracking,” Pattern Recognition Letters, Vol. 30, pp. 407-413, 2009.
[53] L. Zhang, J. Chen, Y. Lu, and P. Wang, “Face recognition using scale invariant feature transform and support vector machine,” in Proceedings of the 9th International Conference for Young Computer Scientists, ICYCS 2008, Zhang Jia Jie, Hunan, China, pp. 1766-1770, 2008.
[54] M. Zhang and R. Alhajj, “Improving the graph-based image segmentation method,” in Proceedings - International Conference on Tools with Artificial Intelligence, ICTAI, Arlington, VA, United states, pp. 617-624, 2006.
[55] H. Zhou, Y. Yuan, and C. Shi, “Object tracking using SIFT features and mean shift,” Computer Vision and Image Understanding, Vol. 113, pp. 345-352, 2009.
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/42645-
dc.description.abstract物體辨識這個議題近年來在電腦視覺(computer vision)上引起了不少的討論。因為這種資訊可以用在非常多方面像是:機器人學、影像搜索、人機互動等等方面。因為這還是一個未解決的問題,所以如何辨識物體的方法有很多,在本文之中,我們用輪廓資訊來當我們偵測物體的模型。
物體辨識是一個非常不容易的問題,因為我們最後的結果必須是知道物體在整張圖片的位置與大小。但是因為這些資訊不會事先就知道,因此.如何從整張影像中找到物體是一個非常不容易的事情。這也就是說在整個圖片搜尋的過程中,辨識的方法必須從大部分錯誤的圖片之中找到極少正確的圖片。
雖然其他資訊也可能很重要,但本論文的研究著重於只用輪廓的資訊偵測物體。整體而言可以分成兩大部分:第一部分是如何從使用者給予的圖片中擷取出輪廓。這個問題著重在怎麼開始,怎麼形成一條可用的輪廓。我們用了一些找興趣點(interesting point)的方法,像是Fast corner detector與 Difference of Gaussian 等方法。而找出線斷的方法,我們使用的是現在被廣泛使用的Canny edge Detector。有了這兩種方法後,我們可以找出輪廓的基礎模型。而第二部份則是決定於給予的圖片。若是給一張乾淨的影像,我們即可用輪廓來偵測物體。但是也可以給一些正確的圖片,我們也可以運用一些機器學習(Machine Learning)的方法,讓電腦自動找出重要的輪廓線,形成輪廓模型。
最後在與影像比對上,我們用了一種快速比對的方法:Chamfer Matching。在比對每一張圖片的相似度後,我們會有一個物體機率分布圖,我們找到分部最高的點,來當作我們的偵測結果。
zh_TW
dc.description.abstractObject detection has become an interesting topic in the filed of computer vision. Object information is very important and can be used in robotics, image retrieval and other technical areas. Object detection is a very challenging problem in computer vision, because the position and size of objects in an image must be known by computer itself. When the position and size of objects are not known, the detector should be able to differentiate between the query object and the negative objects. Although many methods and research papers have been presented about how to detect objects, we have chosen a native way to detect objects: contour-based object detection.
The purpose of this thesis is to present one kind of object detector using only contour information. The creation of such a detector could be separated into two parts. Firstly, those contours of the object must be formed and found. We use the result of some interesting point detectors as a starting point, and combine them with a well-known edge detector – “Canny edge detector”. Thus, we can identify the object contours in the image. Secondly, our object detector is formed by inputting one image or inputting a series of positive images. We must deal with them in different ways. If a pure picture is inputted with a specify object, we can only matching it without any training methods. But if several pieces of data are given, we try to use some learning methods such as SVM (support vector machine) to learn the model and classify the object in the image.
Finally, we use the method of “sliding windows” to detect objects in testing stage. We match the model with the query sub-image. By using “Chamfer matching” we can get a probability map of the object. The highest point is the object center detected by our detector. The benefit of the Chamfer matching is that it can efficiently match edges, which suits our purposes. We compare and show the results in different methods and objects.
en
dc.description.provenanceMade available in DSpace on 2021-06-15T01:18:38Z (GMT). No. of bitstreams: 1
ntu-98-R96522809-1.pdf: 4142742 bytes, checksum: cc5cf70469b38aac2c6805f982a916b4 (MD5)
Previous issue date: 2009
en
dc.description.tableofcontentsContents
摘要 I
Abstract III
Contents V
List of Tables VII
List of Figures VIII
Nomenclature X
Chapter 1 Introduction 1
1.1 Motivation 1
1.2 Related Works 3
1.3 Objective and Contributions 4
1.4 Thesis Organization 6
Chapter 2 Background Knowledge 8
2.1 Local Feature Detector 8
2.1.1 Corner Detector 8
2.1.2 Fast Corner Detection (FCD) 10
2.1.3 Difference of Gaussian (DoG) 11
2.2 Image Segmentation 13
2.3 Edge Detector 14
2.3.1 Detecting Operator 14
2.3.2 Canny Edge Detector 15
2.4 Similarity of Edge 17
2.4.1 Edge Matching 17
2.4.2 Distance Transform 18
2.4.3 Chamfer Matching 20
2.5 Classification Method 23
2.5.1 Machine Learning 23
2.5.2 Support Vector Machine (SVM) 24
Chapter 3 Definition and Concept of Object Detection 28
3.1 The Definition of an Object Class 28
3.2 Bounding Box of an Object 29
3.3 Database 30
3.4 Object Classification and Detection 31
3.4.1 Traditional Classification 31
3.4.2 Classification and Detection in Image 31
3.5 Evaluating the Performance 32
Chapter 4 Contour Information 36
4.1 Introduction 36
4.2 Contour Model 37
4.2.1 Image Preprocessing 37
4.2.2 Information Given by an Edge Point 39
4.2.3 Boundary Fragment 42
4.3 A Single Image Detector 46
4.3.1 The Assumption of Single Image 46
4.3.2 Hough Voting and Detect 48
4.3.3 Set Weight for Each Boundary 51
Chapter 5 Learning Object Model 52
5.1 Introduction of Learning 52
5.2 Boundary Fragment from Training Image 53
5.3 Fragment Clustering 54
5.3.1 Similarity Matrix 55
5.3.2 Hierarchical Clustering 56
5.4 Training Model 60
5.4.1 The Slight Change of Matching 60
5.4.2 Training Image into Feature Point 63
5.4.3 Applied Training Methods to Image 65
Chapter 6 Experiment Results 74
6.1 Detection Methods 74
6.2 System Flow Chart 76
6.2.1 Training Flow Chart 76
6.2.2 Testing Flow Chart 78
6.3 Experiments and Results 80
6.3.1 For single image 80
6.3.2 The Training Result 82
6.4 Comparison 87
Chapter 7 Conclusions 91
7.1 Conclusions 91
7.2 Future Works 92
References 94
dc.language.isoen
dc.subjectChamfer Matchingzh_TW
dc.subject物體偵測zh_TW
dc.subject機器學習zh_TW
dc.subject形狀比對zh_TW
dc.subject輪廓資訊zh_TW
dc.subjectObject detectionen
dc.subjectmachine learningen
dc.subjectcontour featureen
dc.subjectChamfer matchingen
dc.subjectshape matchingen
dc.title利用輪廓資訊辨識物體之方法發展zh_TW
dc.titleContour-Based Object Detectionen
dc.typeThesis
dc.date.schoolyear97-2
dc.description.degree碩士
dc.contributor.oralexamcommittee傅立成,徐宏民
dc.subject.keyword物體偵測,形狀比對,Chamfer Matching,輪廓資訊,機器學習,zh_TW
dc.subject.keywordObject detection,shape matching,Chamfer matching,contour feature,machine learning,en
dc.relation.page98
dc.rights.note有償授權
dc.date.accepted2009-07-27
dc.contributor.author-college工學院zh_TW
dc.contributor.author-dept機械工程學研究所zh_TW
顯示於系所單位:機械工程學系

文件中的檔案:
檔案 大小格式 
ntu-98-1.pdf
  未授權公開取用
4.05 MBAdobe PDF
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved