請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/62091完整後設資料紀錄
| DC 欄位 | 值 | 語言 |
|---|---|---|
| dc.contributor.advisor | 莊永裕(Yung-Yu Chuang) | |
| dc.contributor.author | Kuang-Jui Hsu | en |
| dc.contributor.author | 許洸睿 | zh_TW |
| dc.date.accessioned | 2021-06-16T13:27:15Z | - |
| dc.date.available | 2018-07-30 | |
| dc.date.copyright | 2013-07-30 | |
| dc.date.issued | 2013 | |
| dc.date.submitted | 2013-07-23 | |
| dc.identifier.citation | [1] C. Rother, V. Kolmogorov, and A. Blake. ”GrabCut”: Interactive foreground extraction using iterated graph cuts. ACM Trans. on Graphics, 23(3):309–314, 2004.
[2] V. S. Lempitsky, P. Kohli, C. Rother, and T. Sharp. Image segmentation with a bounding box prior. In Proc. Int’l Conf. Computer Vision, 2009. [3] I. Endres and D. Hoiem. Category independent object proposals. In Proc. Euro. Conf. Computer Vision, 2010. [4] Y. Chen, A. B. Chan, and G. Wang. Adaptive figure-ground classification. In Proc. Conf. Computer Vision and Pattern Recognition, 2012. [5] B. Fulkerson, A. Vedaldi, and S. Soatto. Class segmentation and object localization with superpixel neighborhoods. In Proc. Int’l Conf. Computer Vision, 2009. [6] J. Tighe and S. Lazebnik. Superparsing - scalable nonparametric image parsing with superpixels. Int. J. Compututer Vision, 101(2):329–349, 2013. [7] X. Boix, J. M. Gonfaus, J. van de Weijer, A. D. Bagdanov, J. S. Gual, and J. Gonz` alez. Harmony potentials - fusing global and local scale for semantic image segmentation. Int. J. Compututer Vision, 96(1):83–102, 2012. [8] C. Cheng, A Koschan, C.-H Chen, D L. Page, and M. A. Abidi. Outdoor scene image segmentation based on background recognition and perceptual organization. IEEE Trans. on Image Processing, 21(3):1007–1019, 2012. [9] J. Carreira, F. Li, and C. Sminchisescu. Object recognition by sequential figure-ground ranking. Int. J. Compututer Vision, 98(3):243–262, 2012. [10] J. Shotton, J.Winn, A. Criminisi, and T. Darrell. TextonBoost for image understanding: Multi-class object recognition and segmentation by jointly modeling texture, layout, and context. Int. J. Compututer Vision, 81(1):2–23, 2009. [11] P. Kohli, L. Ladick’ y, and P. H. S. Torr. Robust higher order potentials for enforcing label consistency. Int. J. Compututer Vision, 82(3):302–324, 2009. [12] N. Payet and S. Todorovic. Hough forest random field for object recognition and segmentation. IEEE Trans. on Pattern Analysis and Machine Intelligence, 35(5):1066–1079, 2013. [13] Q. Zhou, J. Zhu, and W. Liu. Learning dynamic hybrid Markov random field for image labeling. IEEE Trans. on Image Processing, 22(6):2219–2232, 2013. [14] P. Arbelaez, B. Hariharan, C. Gu, S. Gupta, L. D. Bourdev, and J. Malik. Semantic segmentation using regions and parts. In Proc. Conf. Computer Vision and Pattern Recognition, 2012. [15] J. Carreira, R. Caseiro, J. Batista, and C. Sminchisescu. Semantic segmentation with second-order pooling. In Proc. Euro. Conf. Computer Vision, 2012. [16] L. Ladick’ y, C. Russell, P. Kohli, and P. Torr. Associative hierarchical CRFs for object class image segmentation. In Proc. Int’l Conf. Computer Vision, 2009. [17] L. Ladick’ y, C. Russell, P. Kohli, and P. H. S. Torr. Graph cut based inference with co-occurrence statistics. In Proc. Euro. Conf. Computer Vision, 2010. [18] C. Pantofaru, C. Schmid, and M. Hebert. Object recognition by integrating multiple image segmentations. In Proc. Euro. Conf. Computer Vision, 2008. [19] J. Shotton, M. Johnson, and R. Cipolla. Semantic texton forests for image categorization and segmentation. In Proc. Conf. Computer Vision and Pattern Recognition, 2008. [20] A. Vezhnevets, V. Ferrari, and J. M. Buhmann. Weakly supervised structured output learning for semantic segmentation. In Proc. Conf. Computer Vision and Pattern Recognition, 2012. [21] J. Yao, S. Fidler, and R. Urtasun. Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation. In Proc. Conf. Computer Vision and Pattern Recognition, 2012. [22] D. Comaniciu and P. Meer. Mean shift: A robust approach toward feature space analysis. IEEE Trans. on Pattern Analysis and Machine Intelligence, 24(5):603–619, 2002. [23] S. Chen, L. Cao, Y. Wang, J. Liu, and X. Tang. Image segmentation by MAP-ML estimations. IEEE Trans. on Image Processing, 19(9):2254–2264, 2010. [24] M. Mignotte. A label field fusion Bayesian model and its penalized maximum rand estimator for image segmentation. IEEE Trans. on Image Processing, 19(6):1610–1624, 2010. [25] A. K. Qin and D. A. Clausi. Multivariate image segmentation using semantic region growing with adaptive edge penalty. IEEE Trans. on Image Processing, 19(8):2157– 2170, 2010. [26] M. B. Salah, A. Mitiche, and I. B. Ayed. Multiregion image segmentation by parametric kernel graph cuts. IEEE Trans. on Image Processing, 20(2):545–557, 2011. [27] C. Panagiotakis, I. Grinias, and G. Tziritas. Natural image segmentation based on tree equipartition, bayesian flooding and region merging. IEEE Trans. on Image Processing, 20(8):2276–2287, 2011. [28] S. Wang and J. M. Siskind. Image segmentation with ratio cut. IEEE Trans. on Pattern Analysis and Machine Intelligence, 25(6):675–690, 2003. [29] S. Xiang, C. Pan, F. Nie, and C. Zhang. Turbopixel segmentation using eigen-images. IEEE Trans. on Image Processing, 19(11):3024–3034, 2010. [30] J. D. Lafferty, A. McCallum, and F. C. N. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proc. Int’l Conf. Machine Learning, 2001. [31] C. Sutton and A. McCallum. An Introduction to Conditional Random Fields for Relational Learning. MIT Press, 2006. [32] M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. The PASCAL Visual Object Classes Challenge 2007 (VOC2007) Results. [33] C. Galleguillos, B. Babenko, A. Rabinovich, and S. Belongie. Weakly supervised object localization with stable segmentations. In Proc. Euro. Conf. Computer Vision, 2008. [34] J. Carreira and C.Sminchisescu. CPMC: Automatic object segmentation using constrained parametric min-cuts. IEEE Trans. on Pattern Analysis and Machine Intelligence, 34(7):1312–1328, 2012. [35] T. G. Dietterich, R. H. Lathrop, and T. Lozano-P’ erez. Solving the multiple instance problem with axis-parallel rectangles. Artificial Intelligence, 89(1-2):31–71, 1997. [36] G. Tak’ acs. Smooth maximum function based algorithms for classification, regression, and collaborative filtering. Acta Technica Jaurinensis, Series Computatorica Intelligentica, 3(1):27–63, 2010. [37] Y. Boykov, O. Veksler, and R. Zabih. Fast approximate energy minimization via graph cuts. IEEE Trans. on Pattern Analysis and Machine Intelligence, 23(11):1222–1239, 2001. [38] T. Cour and J. Shi. Recognizing objects by piecing together the segmentation puzzle. In Proc. Conf. Computer Vision and Pattern Recognition, 2007. [39] B. Alexe, T. Deselaers, and V. Ferrari. ClassCut for unsupervised class segmentation. In Proc. Euro. Conf. Computer Vision, 2010. [40] R. Unnikrishnan, C. Pantofaru, and M. Hebert. Toward objective evaluation of image segmentation algorithms. IEEE Trans. on Pattern Analysis and Machine Intelligence, 9(6):929–944, 2007. [41] J.-Z. Cheng, F.-J. Chang, K.-J. Hsu, and Y.-Y. Lin. Knowledge leverage from contours to bounding boxes: A concise approach to annotation. In Proc. Asian Conf. on Computer Vision, 2012. [42] M. P. Kumar, H. Turki, D. Preston, and D. Koller. Learning specific-class segmentation from diverse data. In Proc. Int’l Conf. Computer Vision, 2011. [43] Daniel Kuettel and V. Ferrari. Figure-ground segmentation by transferring window masks. In Proc. Conf. Computer Vision and Pattern Recognition, 2012. [44] Z. Kuang, D. Schnieders, H. Zhou, K.-Y. K. Wong, Y. Yu, and B. Peng. Learning image specific parameters for interactive segmentation. In Proc. Conf. Computer Vision and Pattern Recognition, 2012. [45] P. A. Viola, J. C. Platt, and C. Zhang. Multiple instance boosting for object detection. In Advances in Neural Information Processing Systems, 2005. [46] Q. Zhang, S. Goldman, W. Yu, and J. E. Fritts. Content-based image retrieval using multiple-instance learning. In Proc. Int’l Conf. Machine Learning, 2002. [47] S. Ray and D. Page. Multiple instance regression. In Proc. Int’l Conf. Machine Learning, 2001. [48] P.-M. Cheung and J. T. Kwok. A regularization framework for multiple-instance learning. In Proc. Int’l Conf. Machine Learning, 2006. [49] David G. Lowe. Distinctive image features from scale-invariant keypoints. Int. J. Compututer Vision, 6O(2):91–110, 2004. [50] M. Varma and A. Zisserman. A statistical approach to texture classification from single images. Int. J. Compututer Vision, 62(1-2):61–81, 2005. [51] M. Gupta and K. Ramnath. Interactive segmentation tool-box. | |
| dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/62091 | - |
| dc.description.abstract | In this work, we address the high annotation cost of acquiring training data for semantic segmentation. Most modern approaches utilize graphical models, such as the conditional random fields, to carry out semantic segmentation, and hence rely on sufficient training data in form of object contours. To reduce the manual effort on annotating contours, we consider the training dataset for semantic segmentation is a mixture of a few object contours and an abundant set of bounding boxes of objects. Our idea is to borrow the knowledge derived from the object contours to infer the unknown object contours in the bounding boxes. The inferred contours then can serve as training data. To this end, we generate multiple contour hypotheses for each bounding box with the constraint that at least one hypothesis is close to the ground truth. Corresponding to multiple instance learning (MIL), a bounding box can be treated as a bag with its contour hypotheses as instances. We proposed an approach, called augmented multiple instance regression (AMIR), that formulates the task of hypothesis selection as the problem of MIR, and augments information derived from the object contours to guide and regularize the training process of MIR. The proposed approach is evaluated in the Pascal VOC segmentation task. The experimental results demonstrate that AMIR can precisely infer the object contours in the bounding boxes, and hence provides effective alternates of manually labeled contours for semantic segmentation. | en |
| dc.description.provenance | Made available in DSpace on 2021-06-16T13:27:15Z (GMT). No. of bitstreams: 1 ntu-102-R99944051-1.pdf: 13668353 bytes, checksum: 7b622fd0847987cc130550d6853743f9 (MD5) Previous issue date: 2013 | en |
| dc.description.tableofcontents | 誌謝 i
中文摘要 ii Abstract iv 1 Introduction 1 2 Related Work 5 2.1 Semantic Segmentation . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2.2 Figure-Ground Segmentation . . . . . . . . . . . . . . . . . . . . . . . . 6 2.3 Multiple Image Segmentations . . . . . . . . . . . . . . . . . . . . . . . 6 2.4 Semantic Segmentation with Low Cost . . . . . . . . . . . . . . . . . . . 7 2.5 Multiple Instance Learning (MIL) . . . . . . . . . . . . . . . . . . . . . 8 3 Inferring Multiple Tight Segments in a Bounding Box 9 3.1 Tight Segment via Bounding Box Prior . . . . . . . . . . . . . . . . . . 9 3.2 Multiple Tight Segments . . . . . . . . . . . . . . . . . . . . . . . . . . 11 4 Inferring Best Tight Segmentation 14 4.1 Problem Statement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 4.1.1 Labeled Contours (Instance Level) . . . . . . . . . . . . . . . . . 15 4.1.2 Positive Bounding Boxes (Bag Level) . . . . . . . . . . . . . . . 15 4.1.3 Negative Bounding Boxes (Bag Level) . . . . . . . . . . . . . . . 16 4.2 Linear Regression . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 4.3 Benefits From The Bounding Boxes . . . . . . . . . . . . . . . . . . . . 17 4.4 AMIR Formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 4.5 AMIR Optimization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20 4.5.1 Approximation of ”max” . . . . . . . . . . . . . . . . . . . . . . 20 4.5.2 Differentiation of AMIR . . . . . . . . . . . . . . . . . . . . . . 20 4.6 Discussion of Differentiation . . . . . . . . . . . . . . . . . . . . . . . . 22 5 Feature Extraction 23 5.1 Segment-Level Feature . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 5.2 Pixel-Level Feature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 6 Experiment Results 25 6.1 Dataset: Pascal VOC 2007 . . . . . . . . . . . . . . . . . . . . . . . . . 25 6.2 Baselines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 6.3 Experiment I: Multiple Tight Segments . . . . . . . . . . . . . . . . . . 28 6.4 Experiment II: Segment Selection for Object Contour Estimation . . . . . 29 6.5 Experiment III: Semantic Segmentation . . . . . . . . . . . . . . . . . . 31 7 Conclusion 37 Bibliography 39 | |
| dc.language.iso | en | |
| dc.subject | 弱監督式學習 | zh_TW |
| dc.subject | 語意式影像切割 | zh_TW |
| dc.subject | 多重實例迴歸 | zh_TW |
| dc.subject | 切割選擇 | zh_TW |
| dc.subject | segment selection | en |
| dc.subject | Semantic segmentation | en |
| dc.subject | weakly supervised learning | en |
| dc.subject | multiple in- stance regression (MIR) | en |
| dc.title | 多重實例迴歸於定界框內物件輪廓之估測 | zh_TW |
| dc.title | Augmented Multiple Instance Regression For Inferring Object Contours Within Bounding Boxes | en |
| dc.type | Thesis | |
| dc.date.schoolyear | 101-2 | |
| dc.description.degree | 碩士 | |
| dc.contributor.coadvisor | 林彥宇(Yen-Yu Lin) | |
| dc.contributor.oralexamcommittee | 陳祝嵩(Chu-song Chen),賴尚宏(Shang-Hong Lai),陳煥宗(Hwann-Tzong Chen) | |
| dc.subject.keyword | 語意式影像切割,弱監督式學習,多重實例迴歸,切割選擇, | zh_TW |
| dc.subject.keyword | Semantic segmentation,weakly supervised learning,multiple in- stance regression (MIR),segment selection, | en |
| dc.relation.page | 44 | |
| dc.rights.note | 有償授權 | |
| dc.date.accepted | 2013-07-23 | |
| dc.contributor.author-college | 電機資訊學院 | zh_TW |
| dc.contributor.author-dept | 資訊網路與多媒體研究所 | zh_TW |
| 顯示於系所單位: | 資訊網路與多媒體研究所 | |
文件中的檔案:
| 檔案 | 大小 | 格式 | |
|---|---|---|---|
| ntu-102-1.pdf 未授權公開取用 | 13.35 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。
