Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 資訊網路與多媒體研究所
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/62091
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor莊永裕(Yung-Yu Chuang)
dc.contributor.authorKuang-Jui Hsuen
dc.contributor.author許洸睿zh_TW
dc.date.accessioned2021-06-16T13:27:15Z-
dc.date.available2018-07-30
dc.date.copyright2013-07-30
dc.date.issued2013
dc.date.submitted2013-07-23
dc.identifier.citation[1] C. Rother, V. Kolmogorov, and A. Blake. ”GrabCut”: Interactive foreground extraction using iterated graph cuts. ACM Trans. on Graphics, 23(3):309–314, 2004.
[2] V. S. Lempitsky, P. Kohli, C. Rother, and T. Sharp. Image segmentation with a bounding box prior. In Proc. Int’l Conf. Computer Vision, 2009.
[3] I. Endres and D. Hoiem. Category independent object proposals. In Proc. Euro. Conf. Computer Vision, 2010.
[4] Y. Chen, A. B. Chan, and G. Wang. Adaptive figure-ground classification. In Proc. Conf. Computer Vision and Pattern Recognition, 2012.
[5] B. Fulkerson, A. Vedaldi, and S. Soatto. Class segmentation and object localization with superpixel neighborhoods. In Proc. Int’l Conf. Computer Vision, 2009.
[6] J. Tighe and S. Lazebnik. Superparsing - scalable nonparametric image parsing with superpixels. Int. J. Compututer Vision, 101(2):329–349, 2013.
[7] X. Boix, J. M. Gonfaus, J. van de Weijer, A. D. Bagdanov, J. S. Gual, and J. Gonz` alez. Harmony potentials - fusing global and local scale for semantic image segmentation. Int. J. Compututer Vision, 96(1):83–102, 2012.
[8] C. Cheng, A Koschan, C.-H Chen, D L. Page, and M. A. Abidi. Outdoor scene image segmentation based on background recognition and perceptual organization. IEEE Trans. on Image Processing, 21(3):1007–1019, 2012.
[9] J. Carreira, F. Li, and C. Sminchisescu. Object recognition by sequential figure-ground ranking. Int. J. Compututer Vision, 98(3):243–262, 2012.
[10] J. Shotton, J.Winn, A. Criminisi, and T. Darrell. TextonBoost for image understanding: Multi-class object recognition and segmentation by jointly modeling texture, layout, and context. Int. J. Compututer Vision, 81(1):2–23, 2009.
[11] P. Kohli, L. Ladick’ y, and P. H. S. Torr. Robust higher order potentials for enforcing label consistency. Int. J. Compututer Vision, 82(3):302–324, 2009.
[12] N. Payet and S. Todorovic. Hough forest random field for object recognition and segmentation. IEEE Trans. on Pattern Analysis and Machine Intelligence, 35(5):1066–1079, 2013.
[13] Q. Zhou, J. Zhu, and W. Liu. Learning dynamic hybrid Markov random field for image labeling. IEEE Trans. on Image Processing, 22(6):2219–2232, 2013.
[14] P. Arbelaez, B. Hariharan, C. Gu, S. Gupta, L. D. Bourdev, and J. Malik. Semantic segmentation using regions and parts. In Proc. Conf. Computer Vision and Pattern Recognition, 2012.
[15] J. Carreira, R. Caseiro, J. Batista, and C. Sminchisescu. Semantic segmentation with second-order pooling. In Proc. Euro. Conf. Computer Vision, 2012.
[16] L. Ladick’ y, C. Russell, P. Kohli, and P. Torr. Associative hierarchical CRFs for object class image segmentation. In Proc. Int’l Conf. Computer Vision, 2009.
[17] L. Ladick’ y, C. Russell, P. Kohli, and P. H. S. Torr. Graph cut based inference with co-occurrence statistics. In Proc. Euro. Conf. Computer Vision, 2010.
[18] C. Pantofaru, C. Schmid, and M. Hebert. Object recognition by integrating multiple image segmentations. In Proc. Euro. Conf. Computer Vision, 2008.
[19] J. Shotton, M. Johnson, and R. Cipolla. Semantic texton forests for image categorization and segmentation. In Proc. Conf. Computer Vision and Pattern Recognition, 2008.
[20] A. Vezhnevets, V. Ferrari, and J. M. Buhmann. Weakly supervised structured output learning for semantic segmentation. In Proc. Conf. Computer Vision and Pattern Recognition, 2012.
[21] J. Yao, S. Fidler, and R. Urtasun. Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation. In Proc. Conf. Computer Vision and Pattern Recognition, 2012.
[22] D. Comaniciu and P. Meer. Mean shift: A robust approach toward feature space analysis. IEEE Trans. on Pattern Analysis and Machine Intelligence, 24(5):603–619, 2002.
[23] S. Chen, L. Cao, Y. Wang, J. Liu, and X. Tang. Image segmentation by MAP-ML estimations. IEEE Trans. on Image Processing, 19(9):2254–2264, 2010.
[24] M. Mignotte. A label field fusion Bayesian model and its penalized maximum rand estimator for image segmentation. IEEE Trans. on Image Processing, 19(6):1610–1624, 2010.
[25] A. K. Qin and D. A. Clausi. Multivariate image segmentation using semantic region growing with adaptive edge penalty. IEEE Trans. on Image Processing, 19(8):2157– 2170, 2010.
[26] M. B. Salah, A. Mitiche, and I. B. Ayed. Multiregion image segmentation by parametric kernel graph cuts. IEEE Trans. on Image Processing, 20(2):545–557, 2011.
[27] C. Panagiotakis, I. Grinias, and G. Tziritas. Natural image segmentation based on tree equipartition, bayesian flooding and region merging. IEEE Trans. on Image Processing, 20(8):2276–2287, 2011.
[28] S. Wang and J. M. Siskind. Image segmentation with ratio cut. IEEE Trans. on Pattern Analysis and Machine Intelligence, 25(6):675–690, 2003.
[29] S. Xiang, C. Pan, F. Nie, and C. Zhang. Turbopixel segmentation using eigen-images. IEEE Trans. on Image Processing, 19(11):3024–3034, 2010.
[30] J. D. Lafferty, A. McCallum, and F. C. N. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proc. Int’l Conf. Machine Learning, 2001.
[31] C. Sutton and A. McCallum. An Introduction to Conditional Random Fields for Relational Learning. MIT Press, 2006.
[32] M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. The PASCAL Visual Object Classes Challenge 2007 (VOC2007) Results.
[33] C. Galleguillos, B. Babenko, A. Rabinovich, and S. Belongie. Weakly supervised object localization with stable segmentations. In Proc. Euro. Conf. Computer Vision, 2008.
[34] J. Carreira and C.Sminchisescu. CPMC: Automatic object segmentation using constrained parametric min-cuts. IEEE Trans. on Pattern Analysis and Machine Intelligence, 34(7):1312–1328, 2012.
[35] T. G. Dietterich, R. H. Lathrop, and T. Lozano-P’ erez. Solving the multiple instance problem with axis-parallel rectangles. Artificial Intelligence, 89(1-2):31–71, 1997.
[36] G. Tak’ acs. Smooth maximum function based algorithms for classification, regression, and collaborative filtering. Acta Technica Jaurinensis, Series Computatorica Intelligentica, 3(1):27–63, 2010.
[37] Y. Boykov, O. Veksler, and R. Zabih. Fast approximate energy minimization via graph cuts. IEEE Trans. on Pattern Analysis and Machine Intelligence, 23(11):1222–1239, 2001.
[38] T. Cour and J. Shi. Recognizing objects by piecing together the segmentation puzzle. In Proc. Conf. Computer Vision and Pattern Recognition, 2007.
[39] B. Alexe, T. Deselaers, and V. Ferrari. ClassCut for unsupervised class segmentation. In Proc. Euro. Conf. Computer Vision, 2010.
[40] R. Unnikrishnan, C. Pantofaru, and M. Hebert. Toward objective evaluation of image segmentation algorithms. IEEE Trans. on Pattern Analysis and Machine Intelligence, 9(6):929–944, 2007.
[41] J.-Z. Cheng, F.-J. Chang, K.-J. Hsu, and Y.-Y. Lin. Knowledge leverage from contours to bounding boxes: A concise approach to annotation. In Proc. Asian Conf. on Computer Vision, 2012.
[42] M. P. Kumar, H. Turki, D. Preston, and D. Koller. Learning specific-class segmentation from diverse data. In Proc. Int’l Conf. Computer Vision, 2011.
[43] Daniel Kuettel and V. Ferrari. Figure-ground segmentation by transferring window masks. In Proc. Conf. Computer Vision and Pattern Recognition, 2012.
[44] Z. Kuang, D. Schnieders, H. Zhou, K.-Y. K. Wong, Y. Yu, and B. Peng. Learning image specific parameters for interactive segmentation. In Proc. Conf. Computer Vision and Pattern Recognition, 2012.
[45] P. A. Viola, J. C. Platt, and C. Zhang. Multiple instance boosting for object detection. In Advances in Neural Information Processing Systems, 2005.
[46] Q. Zhang, S. Goldman, W. Yu, and J. E. Fritts. Content-based image retrieval using multiple-instance learning. In Proc. Int’l Conf. Machine Learning, 2002.
[47] S. Ray and D. Page. Multiple instance regression. In Proc. Int’l Conf. Machine Learning, 2001.
[48] P.-M. Cheung and J. T. Kwok. A regularization framework for multiple-instance learning. In Proc. Int’l Conf. Machine Learning, 2006.
[49] David G. Lowe. Distinctive image features from scale-invariant keypoints. Int. J. Compututer Vision, 6O(2):91–110, 2004.
[50] M. Varma and A. Zisserman. A statistical approach to texture classification from single images. Int. J. Compututer Vision, 62(1-2):61–81, 2005.
[51] M. Gupta and K. Ramnath. Interactive segmentation tool-box.
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/62091-
dc.description.abstractIn this work, we address the high annotation cost of acquiring training data for semantic segmentation. Most modern approaches utilize graphical models, such as the conditional random fields, to carry out semantic segmentation, and hence rely on sufficient training data in form of object contours. To reduce the manual effort on annotating contours, we consider the training dataset for semantic segmentation is a mixture of a few object contours and an abundant set of bounding boxes of objects. Our idea is to borrow the knowledge derived from the object contours to infer the unknown object contours in the bounding boxes. The inferred contours then can serve as training data. To this end, we generate multiple contour hypotheses for each bounding box with the constraint that at least one hypothesis is close to the ground truth. Corresponding to multiple instance learning (MIL), a bounding box can be treated as a bag with its contour hypotheses as instances. We proposed an approach, called augmented multiple instance regression (AMIR), that formulates the task of hypothesis selection as the problem of MIR, and augments information derived from the object contours to guide and regularize the training process of MIR. The proposed approach is evaluated in the Pascal VOC segmentation task. The experimental results demonstrate that AMIR can precisely infer the object contours in the bounding boxes, and hence provides effective alternates of manually labeled contours for semantic segmentation.en
dc.description.provenanceMade available in DSpace on 2021-06-16T13:27:15Z (GMT). No. of bitstreams: 1
ntu-102-R99944051-1.pdf: 13668353 bytes, checksum: 7b622fd0847987cc130550d6853743f9 (MD5)
Previous issue date: 2013
en
dc.description.tableofcontents誌謝 i
中文摘要 ii
Abstract iv
1 Introduction 1
2 Related Work 5
2.1 Semantic Segmentation . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.2 Figure-Ground Segmentation . . . . . . . . . . . . . . . . . . . . . . . . 6
2.3 Multiple Image Segmentations . . . . . . . . . . . . . . . . . . . . . . . 6
2.4 Semantic Segmentation with Low Cost . . . . . . . . . . . . . . . . . . . 7
2.5 Multiple Instance Learning (MIL) . . . . . . . . . . . . . . . . . . . . . 8
3 Inferring Multiple Tight Segments in a Bounding Box 9
3.1 Tight Segment via Bounding Box Prior . . . . . . . . . . . . . . . . . . 9
3.2 Multiple Tight Segments . . . . . . . . . . . . . . . . . . . . . . . . . . 11
4 Inferring Best Tight Segmentation 14
4.1 Problem Statement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
4.1.1 Labeled Contours (Instance Level) . . . . . . . . . . . . . . . . . 15
4.1.2 Positive Bounding Boxes (Bag Level) . . . . . . . . . . . . . . . 15
4.1.3 Negative Bounding Boxes (Bag Level) . . . . . . . . . . . . . . . 16
4.2 Linear Regression . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
4.3 Benefits From The Bounding Boxes . . . . . . . . . . . . . . . . . . . . 17
4.4 AMIR Formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
4.5 AMIR Optimization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
4.5.1 Approximation of ”max” . . . . . . . . . . . . . . . . . . . . . . 20
4.5.2 Differentiation of AMIR . . . . . . . . . . . . . . . . . . . . . . 20
4.6 Discussion of Differentiation . . . . . . . . . . . . . . . . . . . . . . . . 22
5 Feature Extraction 23
5.1 Segment-Level Feature . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
5.2 Pixel-Level Feature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
6 Experiment Results 25
6.1 Dataset: Pascal VOC 2007 . . . . . . . . . . . . . . . . . . . . . . . . . 25
6.2 Baselines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
6.3 Experiment I: Multiple Tight Segments . . . . . . . . . . . . . . . . . . 28
6.4 Experiment II: Segment Selection for Object Contour Estimation . . . . . 29
6.5 Experiment III: Semantic Segmentation . . . . . . . . . . . . . . . . . . 31
7 Conclusion 37
Bibliography 39
dc.language.isoen
dc.subject弱監督式學習zh_TW
dc.subject語意式影像切割zh_TW
dc.subject多重實例迴歸zh_TW
dc.subject切割選擇zh_TW
dc.subjectsegment selectionen
dc.subjectSemantic segmentationen
dc.subjectweakly supervised learningen
dc.subjectmultiple in- stance regression (MIR)en
dc.title多重實例迴歸於定界框內物件輪廓之估測zh_TW
dc.titleAugmented Multiple Instance Regression For Inferring Object Contours Within Bounding Boxesen
dc.typeThesis
dc.date.schoolyear101-2
dc.description.degree碩士
dc.contributor.coadvisor林彥宇(Yen-Yu Lin)
dc.contributor.oralexamcommittee陳祝嵩(Chu-song Chen),賴尚宏(Shang-Hong Lai),陳煥宗(Hwann-Tzong Chen)
dc.subject.keyword語意式影像切割,弱監督式學習,多重實例迴歸,切割選擇,zh_TW
dc.subject.keywordSemantic segmentation,weakly supervised learning,multiple in- stance regression (MIR),segment selection,en
dc.relation.page44
dc.rights.note有償授權
dc.date.accepted2013-07-23
dc.contributor.author-college電機資訊學院zh_TW
dc.contributor.author-dept資訊網路與多媒體研究所zh_TW
顯示於系所單位:資訊網路與多媒體研究所

文件中的檔案:
檔案 大小格式 
ntu-102-1.pdf
  未授權公開取用
13.35 MBAdobe PDF
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved