多重實例迴歸於定界框內物件輪廓之估測

Kuang-Jui Hsu; 許洸睿

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/62091

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	莊永裕(Yung-Yu Chuang)
dc.contributor.author	Kuang-Jui Hsu	en
dc.contributor.author	許洸睿	zh_TW
dc.date.accessioned	2021-06-16T13:27:15Z	-
dc.date.available	2018-07-30
dc.date.copyright	2013-07-30
dc.date.issued	2013
dc.date.submitted	2013-07-23
dc.identifier.citation	[1] C. Rother, V. Kolmogorov, and A. Blake. ”GrabCut”: Interactive foreground extraction using iterated graph cuts. ACM Trans. on Graphics, 23(3):309–314, 2004. [2] V. S. Lempitsky, P. Kohli, C. Rother, and T. Sharp. Image segmentation with a bounding box prior. In Proc. Int’l Conf. Computer Vision, 2009. [3] I. Endres and D. Hoiem. Category independent object proposals. In Proc. Euro. Conf. Computer Vision, 2010. [4] Y. Chen, A. B. Chan, and G. Wang. Adaptive ﬁgure-ground classiﬁcation. In Proc. Conf. Computer Vision and Pattern Recognition, 2012. [5] B. Fulkerson, A. Vedaldi, and S. Soatto. Class segmentation and object localization with superpixel neighborhoods. In Proc. Int’l Conf. Computer Vision, 2009. [6] J. Tighe and S. Lazebnik. Superparsing - scalable nonparametric image parsing with superpixels. Int. J. Compututer Vision, 101(2):329–349, 2013. [7] X. Boix, J. M. Gonfaus, J. van de Weijer, A. D. Bagdanov, J. S. Gual, and J. Gonz` alez. Harmony potentials - fusing global and local scale for semantic image segmentation. Int. J. Compututer Vision, 96(1):83–102, 2012. [8] C. Cheng, A Koschan, C.-H Chen, D L. Page, and M. A. Abidi. Outdoor scene image segmentation based on background recognition and perceptual organization. IEEE Trans. on Image Processing, 21(3):1007–1019, 2012. [9] J. Carreira, F. Li, and C. Sminchisescu. Object recognition by sequential ﬁgure-ground ranking. Int. J. Compututer Vision, 98(3):243–262, 2012. [10] J. Shotton, J.Winn, A. Criminisi, and T. Darrell. TextonBoost for image understanding: Multi-class object recognition and segmentation by jointly modeling texture, layout, and context. Int. J. Compututer Vision, 81(1):2–23, 2009. [11] P. Kohli, L. Ladick’ y, and P. H. S. Torr. Robust higher order potentials for enforcing label consistency. Int. J. Compututer Vision, 82(3):302–324, 2009. [12] N. Payet and S. Todorovic. Hough forest random ﬁeld for object recognition and segmentation. IEEE Trans. on Pattern Analysis and Machine Intelligence, 35(5):1066–1079, 2013. [13] Q. Zhou, J. Zhu, and W. Liu. Learning dynamic hybrid Markov random ﬁeld for image labeling. IEEE Trans. on Image Processing, 22(6):2219–2232, 2013. [14] P. Arbelaez, B. Hariharan, C. Gu, S. Gupta, L. D. Bourdev, and J. Malik. Semantic segmentation using regions and parts. In Proc. Conf. Computer Vision and Pattern Recognition, 2012. [15] J. Carreira, R. Caseiro, J. Batista, and C. Sminchisescu. Semantic segmentation with second-order pooling. In Proc. Euro. Conf. Computer Vision, 2012. [16] L. Ladick’ y, C. Russell, P. Kohli, and P. Torr. Associative hierarchical CRFs for object class image segmentation. In Proc. Int’l Conf. Computer Vision, 2009. [17] L. Ladick’ y, C. Russell, P. Kohli, and P. H. S. Torr. Graph cut based inference with co-occurrence statistics. In Proc. Euro. Conf. Computer Vision, 2010. [18] C. Pantofaru, C. Schmid, and M. Hebert. Object recognition by integrating multiple image segmentations. In Proc. Euro. Conf. Computer Vision, 2008. [19] J. Shotton, M. Johnson, and R. Cipolla. Semantic texton forests for image categorization and segmentation. In Proc. Conf. Computer Vision and Pattern Recognition, 2008. [20] A. Vezhnevets, V. Ferrari, and J. M. Buhmann. Weakly supervised structured output learning for semantic segmentation. In Proc. Conf. Computer Vision and Pattern Recognition, 2012. [21] J. Yao, S. Fidler, and R. Urtasun. Describing the scene as a whole: Joint object detection, scene classiﬁcation and semantic segmentation. In Proc. Conf. Computer Vision and Pattern Recognition, 2012. [22] D. Comaniciu and P. Meer. Mean shift: A robust approach toward feature space analysis. IEEE Trans. on Pattern Analysis and Machine Intelligence, 24(5):603–619, 2002. [23] S. Chen, L. Cao, Y. Wang, J. Liu, and X. Tang. Image segmentation by MAP-ML estimations. IEEE Trans. on Image Processing, 19(9):2254–2264, 2010. [24] M. Mignotte. A label ﬁeld fusion Bayesian model and its penalized maximum rand estimator for image segmentation. IEEE Trans. on Image Processing, 19(6):1610–1624, 2010. [25] A. K. Qin and D. A. Clausi. Multivariate image segmentation using semantic region growing with adaptive edge penalty. IEEE Trans. on Image Processing, 19(8):2157– 2170, 2010. [26] M. B. Salah, A. Mitiche, and I. B. Ayed. Multiregion image segmentation by parametric kernel graph cuts. IEEE Trans. on Image Processing, 20(2):545–557, 2011. [27] C. Panagiotakis, I. Grinias, and G. Tziritas. Natural image segmentation based on tree equipartition, bayesian ﬂooding and region merging. IEEE Trans. on Image Processing, 20(8):2276–2287, 2011. [28] S. Wang and J. M. Siskind. Image segmentation with ratio cut. IEEE Trans. on Pattern Analysis and Machine Intelligence, 25(6):675–690, 2003. [29] S. Xiang, C. Pan, F. Nie, and C. Zhang. Turbopixel segmentation using eigen-images. IEEE Trans. on Image Processing, 19(11):3024–3034, 2010. [30] J. D. Lafferty, A. McCallum, and F. C. N. Pereira. Conditional random ﬁelds: Probabilistic models for segmenting and labeling sequence data. In Proc. Int’l Conf. Machine Learning, 2001. [31] C. Sutton and A. McCallum. An Introduction to Conditional Random Fields for Relational Learning. MIT Press, 2006. [32] M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. The PASCAL Visual Object Classes Challenge 2007 (VOC2007) Results. [33] C. Galleguillos, B. Babenko, A. Rabinovich, and S. Belongie. Weakly supervised object localization with stable segmentations. In Proc. Euro. Conf. Computer Vision, 2008. [34] J. Carreira and C.Sminchisescu. CPMC: Automatic object segmentation using constrained parametric min-cuts. IEEE Trans. on Pattern Analysis and Machine Intelligence, 34(7):1312–1328, 2012. [35] T. G. Dietterich, R. H. Lathrop, and T. Lozano-P’ erez. Solving the multiple instance problem with axis-parallel rectangles. Artiﬁcial Intelligence, 89(1-2):31–71, 1997. [36] G. Tak’ acs. Smooth maximum function based algorithms for classiﬁcation, regression, and collaborative ﬁltering. Acta Technica Jaurinensis, Series Computatorica Intelligentica, 3(1):27–63, 2010. [37] Y. Boykov, O. Veksler, and R. Zabih. Fast approximate energy minimization via graph cuts. IEEE Trans. on Pattern Analysis and Machine Intelligence, 23(11):1222–1239, 2001. [38] T. Cour and J. Shi. Recognizing objects by piecing together the segmentation puzzle. In Proc. Conf. Computer Vision and Pattern Recognition, 2007. [39] B. Alexe, T. Deselaers, and V. Ferrari. ClassCut for unsupervised class segmentation. In Proc. Euro. Conf. Computer Vision, 2010. [40] R. Unnikrishnan, C. Pantofaru, and M. Hebert. Toward objective evaluation of image segmentation algorithms. IEEE Trans. on Pattern Analysis and Machine Intelligence, 9(6):929–944, 2007. [41] J.-Z. Cheng, F.-J. Chang, K.-J. Hsu, and Y.-Y. Lin. Knowledge leverage from contours to bounding boxes: A concise approach to annotation. In Proc. Asian Conf. on Computer Vision, 2012. [42] M. P. Kumar, H. Turki, D. Preston, and D. Koller. Learning speciﬁc-class segmentation from diverse data. In Proc. Int’l Conf. Computer Vision, 2011. [43] Daniel Kuettel and V. Ferrari. Figure-ground segmentation by transferring window masks. In Proc. Conf. Computer Vision and Pattern Recognition, 2012. [44] Z. Kuang, D. Schnieders, H. Zhou, K.-Y. K. Wong, Y. Yu, and B. Peng. Learning image speciﬁc parameters for interactive segmentation. In Proc. Conf. Computer Vision and Pattern Recognition, 2012. [45] P. A. Viola, J. C. Platt, and C. Zhang. Multiple instance boosting for object detection. In Advances in Neural Information Processing Systems, 2005. [46] Q. Zhang, S. Goldman, W. Yu, and J. E. Fritts. Content-based image retrieval using multiple-instance learning. In Proc. Int’l Conf. Machine Learning, 2002. [47] S. Ray and D. Page. Multiple instance regression. In Proc. Int’l Conf. Machine Learning, 2001. [48] P.-M. Cheung and J. T. Kwok. A regularization framework for multiple-instance learning. In Proc. Int’l Conf. Machine Learning, 2006. [49] David G. Lowe. Distinctive image features from scale-invariant keypoints. Int. J. Compututer Vision, 6O(2):91–110, 2004. [50] M. Varma and A. Zisserman. A statistical approach to texture classiﬁcation from single images. Int. J. Compututer Vision, 62(1-2):61–81, 2005. [51] M. Gupta and K. Ramnath. Interactive segmentation tool-box.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/62091	-
dc.description.abstract	In this work, we address the high annotation cost of acquiring training data for semantic segmentation. Most modern approaches utilize graphical models, such as the conditional random ﬁelds, to carry out semantic segmentation, and hence rely on sufﬁcient training data in form of object contours. To reduce the manual effort on annotating contours, we consider the training dataset for semantic segmentation is a mixture of a few object contours and an abundant set of bounding boxes of objects. Our idea is to borrow the knowledge derived from the object contours to infer the unknown object contours in the bounding boxes. The inferred contours then can serve as training data. To this end, we generate multiple contour hypotheses for each bounding box with the constraint that at least one hypothesis is close to the ground truth. Corresponding to multiple instance learning (MIL), a bounding box can be treated as a bag with its contour hypotheses as instances. We proposed an approach, called augmented multiple instance regression (AMIR), that formulates the task of hypothesis selection as the problem of MIR, and augments information derived from the object contours to guide and regularize the training process of MIR. The proposed approach is evaluated in the Pascal VOC segmentation task. The experimental results demonstrate that AMIR can precisely infer the object contours in the bounding boxes, and hence provides effective alternates of manually labeled contours for semantic segmentation.	en
dc.description.provenance	Made available in DSpace on 2021-06-16T13:27:15Z (GMT). No. of bitstreams: 1 ntu-102-R99944051-1.pdf: 13668353 bytes, checksum: 7b622fd0847987cc130550d6853743f9 (MD5) Previous issue date: 2013	en
dc.description.tableofcontents	誌謝 i 中文摘要 ii Abstract iv 1 Introduction 1 2 Related Work 5 2.1 Semantic Segmentation . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2.2 Figure-Ground Segmentation . . . . . . . . . . . . . . . . . . . . . . . . 6 2.3 Multiple Image Segmentations . . . . . . . . . . . . . . . . . . . . . . . 6 2.4 Semantic Segmentation with Low Cost . . . . . . . . . . . . . . . . . . . 7 2.5 Multiple Instance Learning (MIL) . . . . . . . . . . . . . . . . . . . . . 8 3 Inferring Multiple Tight Segments in a Bounding Box 9 3.1 Tight Segment via Bounding Box Prior . . . . . . . . . . . . . . . . . . 9 3.2 Multiple Tight Segments . . . . . . . . . . . . . . . . . . . . . . . . . . 11 4 Inferring Best Tight Segmentation 14 4.1 Problem Statement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 4.1.1 Labeled Contours (Instance Level) . . . . . . . . . . . . . . . . . 15 4.1.2 Positive Bounding Boxes (Bag Level) . . . . . . . . . . . . . . . 15 4.1.3 Negative Bounding Boxes (Bag Level) . . . . . . . . . . . . . . . 16 4.2 Linear Regression . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 4.3 Beneﬁts From The Bounding Boxes . . . . . . . . . . . . . . . . . . . . 17 4.4 AMIR Formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 4.5 AMIR Optimization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20 4.5.1 Approximation of ”max” . . . . . . . . . . . . . . . . . . . . . . 20 4.5.2 Differentiation of AMIR . . . . . . . . . . . . . . . . . . . . . . 20 4.6 Discussion of Differentiation . . . . . . . . . . . . . . . . . . . . . . . . 22 5 Feature Extraction 23 5.1 Segment-Level Feature . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 5.2 Pixel-Level Feature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 6 Experiment Results 25 6.1 Dataset: Pascal VOC 2007 . . . . . . . . . . . . . . . . . . . . . . . . . 25 6.2 Baselines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 6.3 Experiment I: Multiple Tight Segments . . . . . . . . . . . . . . . . . . 28 6.4 Experiment II: Segment Selection for Object Contour Estimation . . . . . 29 6.5 Experiment III: Semantic Segmentation . . . . . . . . . . . . . . . . . . 31 7 Conclusion 37 Bibliography 39
dc.language.iso	en
dc.subject	弱監督式學習	zh_TW
dc.subject	語意式影像切割	zh_TW
dc.subject	多重實例迴歸	zh_TW
dc.subject	切割選擇	zh_TW
dc.subject	segment selection	en
dc.subject	Semantic segmentation	en
dc.subject	weakly supervised learning	en
dc.subject	multiple in- stance regression (MIR)	en
dc.title	多重實例迴歸於定界框內物件輪廓之估測	zh_TW
dc.title	Augmented Multiple Instance Regression For Inferring Object Contours Within Bounding Boxes	en
dc.type	Thesis
dc.date.schoolyear	101-2
dc.description.degree	碩士
dc.contributor.coadvisor	林彥宇(Yen-Yu Lin)
dc.contributor.oralexamcommittee	陳祝嵩(Chu-song Chen),賴尚宏(Shang-Hong Lai),陳煥宗(Hwann-Tzong Chen)
dc.subject.keyword	語意式影像切割,弱監督式學習,多重實例迴歸,切割選擇,	zh_TW
dc.subject.keyword	Semantic segmentation,weakly supervised learning,multiple in- stance regression (MIR),segment selection,	en
dc.relation.page	44
dc.rights.note	有償授權
dc.date.accepted	2013-07-23
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	資訊網路與多媒體研究所	zh_TW
顯示於系所單位：	資訊網路與多媒體研究所

文件中的檔案：

檔案	大小	格式
ntu-102-1.pdf 未授權公開取用	13.35 MB	Adobe PDF

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。