用於影像分割之像素相似度估測

Wei-Chih Tu; 塗偉志

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/54628

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	簡韶逸(Shao-Yi Chien)
dc.contributor.author	Wei-Chih Tu	en
dc.contributor.author	塗偉志	zh_TW
dc.date.accessioned	2021-06-16T03:35:59Z	-
dc.date.available	2021-02-20
dc.date.copyright	2021-02-20
dc.date.issued	2021
dc.date.submitted	2021-02-05
dc.identifier.citation	R. Margolin, L. Zelnik-Manor, and A. Tal, “How to evaluate foreground maps?” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014. E. S. L. Gastal and M. M. Oliveira, “Domain transform for edge-aware image and video processing,” ACM Transactions on Graphics (TOG), vol. 30, no. 4, pp. 1–12, 2011. D. Min, S. Choi, J. Lu, B. Ham, K. Sohn, and M. N. Do, “Fast global image smoothing based on weighted least squares,” IEEE Transactions on Image Processing (TIP), vol. 23, no. 12, pp. 5638–5653, 2014. Q. Yang, “Recursive approximation of the bilateral filter,” IEEE Transactions on Image Processing (TIP), vol. 24, no. 6, pp. 1919–1927, 2015. S. Liu, J. Pan, and M.-H. Yang, “Learning recursive filters for low-level vision via a hybrid neural network,” in Proceedings of European Conference on Computer Vision (ECCV), 2016. L.-C. Chen, J. T. Barron, G. Papandreou, K. Murphy, and A. L. Yuille, “Semantic image segmentation with task-specific edge detection using cnns and a discriminatively trained domain transform,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. S. Liu, S. De Mello, J. Gu, G. Zhong, M.-H. Yang, and J. Kautz, “Learning affinity via spatial propagation networks,” in Proceedings of Neural Information Processing Systems (NIPS), 2017, pp. 1520–1530. W.-C. Tu, S. He, Q. Yang, and S.-Y. Chien, “Real-time salient object detection with a minimum spanning tree,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. Z. Farbman, R. Fattal, D. Lischinski, and R. Szeliski, “Edge-preserving decompositions for multi-scale tone and detail manipulation,” ACM Transactions on Graphics (TOG), vol. 27, no. 3, pp. 1–10, 2008. M.-Y. Liu, O. Tuzel, S. Ramalingam, and R. Chellappa, “Entropy rate superpixel segmentation,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011. M. Cordts, M. Omran, S. Ramos, T. Rehfeld, M. Enzweiler, R. Benenson, U. Franke, S. Roth, and B. Schiele, “The cityscapes dataset for semantic urban scene understanding,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. E. Shelhamer, J. Long, and T. Darrell, “Fully convolutional networks for semantic segmentation,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 39, no. 4, pp. 640–651, 2017. L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille, “Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 40, no. 4, pp. 834–848, 2018. P. Kr¨ahenb¨uhl and V. Koltun, “Efficient inference in fully connected crfs with gaussian edge potentials,” in Proceedings of Neural Information Processing Systems (NIPS), 2011. P. F. Felzenszwalb and D. P. Huttenlocher, “Efficient graph-based image segmentation,” International Journal of Computer Vision (IJCV), vol. 59, no. 2, pp. 167–181, 2004. R. Gadde, V. Jampani, M. Kiefel, D. Kappler, and P. Gehler, “Superpixel convolutional networks using bilateral inceptions,” in Proceedings of European Conference on Computer Vision (ECCV), 2016. W. Zhu, S. Liang, Y. Wei, and J. Sun, “Saliency optimization from robust background detection,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014. C. Yang, L. Zhang, H. Lu, X. Ruan, and M.-H. Yang, “Saliency detection via graph-based manifold ranking,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013. L. Xu, C. Lu, Y. Xu, and J. Jia, “Image smoothing via L0 gradient minimization,” ACM Transactions on Graphics (TOG), vol. 30, no. 6, pp. 1–12, 2011. W.-C. Tu and S.-Y. Chien, “Two-way recursive filtering,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. Early Access, pp. 1–14, 2021. W.-C. Tu, M.-Y. Liu, V. Jampani, D. Sun, S.-Y. Chien, M.-H. Yang, and J. Kautz, “Learning superpixels with segmentation-aware affinity loss,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018. W.-C. Tu, C.-L. Tsai, and S.-Y. Chien, “Collaborative noise reduction using color-line model,” in Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2014, pp. 2465–2469. C.-L. Tsai, W.-C. Tu, and S.-Y. Chien, “Efficient natural color image denoising based on guided filter,” in Proceedings of IEEE International Conference on Image Processing (ICIP), 2015, pp. 43–47. J. Jiao, W.-C. Tu, S. He, and R. W. H. Lau, “Formresnet: Formatted residual learning for image restoration,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition Workshop (CVPRW), 2017, pp. 1034–1042. J. Jiao, W.-C. Tu, D. Liu, S. He, R. W. H. Lau, and T. S. Huang, “Formnet: Formatted learning for image restoration,” IEEE Transactions on Image Processing (TIP), vol. 29, pp. 6302–6314, 2020. C.-Y. Chang, W.-C. Tu, and S.-Y. Chien, “Optimized regressor forest for image super-resolution,” in Proceedings of British Machine Vision Conference (BMVC). BMVA Press, 2016. Y.-T. Chen, W.-C. Tu, and S.-Y. Chien, “Fast video super-resolution via approximate nearest neighbor search,” in Proceedings of IEEE International Conference on Image Processing (ICIP), 2016, pp. 1141–1144. W.-C. Tu, Y.-A. Lai, and S.-Y. Chien, “Constant time bilateral filtering for color images,” in Proceedings of IEEE International Conference on Image Processing (ICIP), 2016. M.-Y. Tai, W.-C. Tu, and S.-Y. Chien, “Vlsi architecture design of layerbased bilateral and median filtering for 4k2k videos at 30fps,” in Proceedings of IEEE International Symposium on Circuits and Systems (ISCAS), 2017, pp. 1–4. C.-W. Wu, C.-T. Liu, C.-E. Chiang, W.-C. Tu, and S.-Y. Chien, “Vehicle re-identification with the space-time prior,” in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2018, pp. 121–1217. C.-W. Wu, C.-T. Liu, W.-C. Tu, Y. Tsao, Y.-C. F. Wang, and S.-Y. Chien, “Space-time guided association learning for unsupervised person reidentification,” in Proceedings of IEEE International Conference on Image Processing (ICIP), 2020, pp. 2261–2265. A. Rosenfeld and J. L. Pfaltz, “Sequential operations in digital picture processing,” Journal of the ACM (JACM), vol. 13, no. 4, pp. 471–494, 1966. A. Rosenfeld and J. Pfaltz, “Distance functions on digital pictures,” Pattern Recognition, vol. 1, no. 1, pp. 33–61, 1968. H. G. Barrow, J. M. Tenenbaum, R. C. Bolles, and H. C. Wolf, “Parametric correspondence and chamfer matching: Two new techniques for image matching,” in Proceedings of International Joint Conference on Artificial Intelligence (IJCAI), 1977. G. Borgefors, “Hierarchical chamfer matching: a parametric edge matching algorithm,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 10, no. 6, pp. 849–865, 1988. M.-Y. Liu, O. Tuzel, A. Veeraraghavan, and R. Chellappa, “Fast directional chamfer matching,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2010. D. P. Huttenlocher, G. A. Klanderman, and W. J. Rucklidge, “Comparing images using the hausdorff distance,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 15, no. 9, pp. 850–863, 1993. C.-H. T. Yang, S.-H. Lai, and L.-W. Chang, “Hybrid image matching combining hausdorff distance with normalized gradient matching,” Pattern Recognition, vol. 40, no. 4, pp. 1173–1181, 2007. P. F. Felzenszwalb and D. P. Huttenlocher, “Distance transforms of sampled functions,” Theory of Computing, vol. 8, pp. 415–428, 2012. M. W. Jones, J. A. Baerentzen, and M. Sramek, “3d distance fields: a survey of techniques and applications,” IEEE Transactions on Visualization and Computer Graphics (TVCG), vol. 12, no. 4, pp. 581–599, 2006. Z. M. Kov´acs-V and R. Guerrieri, “Massively-parallel handwritten character recognition based on the distance transform,” Pattern Recognition, vol. 28, no. 3, pp. 293–301, 1995. P. K. Saha, G. Borgefors, and G. S. di Baja, “A survey on skeletonization algorithms and their applications,” Pattern Recognition Letters, vol. 76, pp. 3–12, 2016. R. Fabbri, L. D. F. Costa, J. C. Torelli, and O. M. Bruno, “2d euclidean distance transform algorithms: A comparative survey,” ACM Computing Surveys (CSUR), vol. 40, no. 1, pp. 1–44, 2008. F. Leymarie and M. D. Levine, “Fast raster scan distance propagation on the discrete rectangular lattice,” CVGIP: Image Understanding, vol. 55, no. 1, pp. 84–94, 1992. P. J. Toivanen, “New geodesic distance transforms for gray-scale images,” Pattern Recognition Letters, vol. 17, no. 5, pp. 437–450, 1996. R. C´ardenes, C. Alberola-L´opez, and J. Ruiz-Alzola, “Fast and accurate geodesic distance transform by ordered propagation,” Image and Vision Computing (IVC), vol. 28, no. 3, pp. 307–316, 2010. R. Strand, K. C. Ciesielski, F. Malmberg, and P. K. Saha, “The minimum barrier distance,” Computer Vision and Image Understanding (CVIU), vol. 117, no. 4, pp. 429–437, 2013. K. C. Ciesielski, R. Strand, F. Malmberg, and P. K. Saha, “Efficient algorithm for finding the exact minimum barrier distance,” Computer Vision and Image Understanding (CVIU), vol. 123, pp. 53–64, 2014. R. Strand, K. C. Ciesielski, F. Malmberg, and P. K. Saha, “The minimum barrier distance: A summary of recent advances,” in International Conference on Discrete Geometry for Computer Imagery (DGCI), 2017, pp. 57–68. J. Zhang, S. Sclaroff, Z. Lin, X. Shen, B. Price, and R.M˘ech, “Minimum barrier salient object detection at 80 fps,” in Proceedings of IEEE International Conference on Computer Vision (ICCV), 2015. Q. Yang, “A non-local cost aggregation method for stereo matching,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012. Q. Yang, “Stereo matching using tree filtering,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 37, no. 4, pp. 834–846, 2015. L. Bao, Y. Song, Q. Yang, H. Yuan, and G. Wang, “Tree filtering: Efficient structure-preserving smoothing with a minimum spanning tree,” IEEE Transactions on Image Processing (TIP), vol. 23, no. 2, pp. 555–569, 2014. J. B. Kruskal, “On the shortest spanning subtree of a graph and the traveling salesman problem,” Proceedings of the American Mathematical Society, vol. 7, no. 1, pp. 48–50, 1956. R. C. Prim, “Shortest connection networks and some generalizations,” The Bell System Technical Journal, vol. 36, no. 6, pp. 1389–1401, 1957. D. B. Johnson, “Priority queues with update and finding minimum spanning trees,” Information Processing Letters, vol. 4, no. 3, pp. 53–57, 1975. L. Itti, C. Koch, and E. Niebur, “A model of saliency-based visual attention for rapid scene analysis,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), no. 11, pp. 1254–1259, 1998. F. Perazzi, P. Kr¨ahenb¨uhl, Y. Pritch, and A. Hornung, “Saliency filters: Contrast based filtering for salient region detection,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012. N. Tong, H. Lu, X. Ruan, and M.-H. Yang, “Salient object detection via bootstrap learning,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015. S. He, R. W. Lau, W. Liu, Z. Huang, and Q. Yang, “Supercnn: A superpixelwise convolutional neural network for salient object detection,” International Journal of Computer Vision (IJCV), vol. 115, no. 3, pp. 330–344, 2015. M.-M. Cheng, N. J. Mitra, X. Huang, P. H. S. Torr, and S.-M. Hu, “Global contrast based salient region detection,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 37, no. 3, pp. 569–582, 2015. G. Li and Y. Yu, “Visual saliency detection based on multiscale deep cnn features,” IEEE Transactions on Image Processing (TIP), vol. 25, no. 11, pp. 5012–5024, 2016. L. Marchesotti, C. Cifarelli, and G. Csurka, “A framework for visual saliency detection with applications to image thumbnailing,” in Proceedings of IEEE International Conference on Computer Vision (ICCV), 2009. Y. Ding, J. Xiao, and J. Yu, “Importance filtering for image retargeting,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011. U. Rutishauser, D. Walther, C. Koch, and P. Perona, “Is bottom-up attention useful for object recognition?” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2004. M. Cornia, L. Baraldi, G. Serra, and R. Cucchiara, “Paying more attention to saliency: Image captioning with saliency and context attention,” ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), vol. 14, no. 2, pp. 1–21, 2018. J. Yang and M.-H. Yang, “Top-down visual saliency via joint crf and dictionary learning,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 39, no. 3, pp. 576–588, 2016. R. Achanta, S. Hemami, F. Estrada, and S. Susstrunk, “Frequency-tuned salient region detection,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009. Y. Wei, F. Wen, W. Zhu, and J. Sun, “Geodesic saliency using background priors,” in Proceedings of European Conference on Computer Vision (ECCV), 2012. M.-M. Cheng, J. Warrell, W.-Y. Lin, S. Zheng, V. Vineet, and N. Crook, “Efficient salient region detection with soft image abstraction,” in Proceedings of IEEE International Conference on Computer Vision (ICCV), 2013. B. Jiang, L. Zhang, H. Lu, C. Yang, and M.-H. Yang, “Saliency detection via absorbing markov chain,” in Proceedings of IEEE International Conference on Computer Vision (ICCV), 2013. Y. Qin, H. Lu, Y. Xu, and H. Wang, “Saliency detection via cellular automata,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015. S. Lloyd, “Least squares quantization in pcm,” IEEE Transactions on Information Theory (TIT), vol. 28, no. 2, pp. 129–137, 1982. N. Otsu, “A threshold selection method from gray-level histograms,” IEEE Transactions on Systems, Man, and Cybernetics, vol. 9, no. 1, pp. 62–66, 1979. J. Shi, Q. Yan, L. Xu, and J. Jia, “Hierarchical image saliency detection on extended cssd,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 38, no. 4, pp. 717–729, 2016. Y. Li, X. Hou, C. Koch, J. M. Rehg, and A. L. Yuille, “The secrets of salient object segmentation,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014. J. Zhang and S. Sclaroff, “saliency detection: a Boolean map approach,” in Proceedings of IEEE International Conference on Computer Vision (ICCV), 2013. Q. Yan, L. Xu, J. Shi, and J. Jia, “Hierarchical saliency detection,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013. J. Zhang and S. Sclaroff, “Exploiting surroundedness for saliency detection: a Boolean map approach,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2015. R. Achanta, A. Shaji, K. Smith, A. Lucchi, P. Fua, and S. S¨usstrunk, “Slic superpixels compared to state-of-the-art superpixel methods,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 34, no. 11, pp. 2274–2282, 2012. I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning. MIT Press, 2016, http://www.deeplearningbook.org. Y.-T. Hu, J.-B. Huang, and A. G. Schwing, “Unsupervised Video Object Segmentation Using Motion Saliency-Guided Spatio-Temporal Propagation,” in Proceedings of European Conference on Computer Vision (ECCV), 2018. B. A. Shenoi, Introduction to Digital Signal Processing and Filter Design. John Wiley Sons, Ltd, 2005. C. Tomasi and R. Manduchi, “Bilateral filtering for gray and color images,” in Proceedings of IEEE International Conference on Computer Vision (ICCV), 1998. R. Deriche, “Recursively implementing the gaussian and its derivatives,” in Proceedings of IEEE International Conference on Image Processing (ICIP), 1992. L. J. Van Vliet, I. T. Young, and P. W. Verbeek, “Recursive gaussian derivative filters,” in Proceedings of International Conference on Pattern Recognition (ICPR), 1998. P. Perona and J. Malik, “Scale-space and edge detection using anisotropic diffusion,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 12, no. 7, pp. 629–639, 1990. L. Xu, Q. Yan, Y. Xia, and J. Jia, “Structure extraction from texture via relative total variation,” ACM Transactions on Graphics (TOG), vol. 31, no. 6, pp. 1–10, 2012. W. Liu, P. Zhang, X. Chen, C. Shen, X. Huang, and J. Yang, “Embedding bilateral filter in least squares for efficient edge-preserving image smoothing,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 30, no. 1, pp. 23–35, 2020. J.-H. Rick Chang and Y.-C. Frank Wang, “Propagated image filtering,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015. P. Bu, H. Zhao, Y. Jin, and Y. Ma, “Linear recursive non-local edge-aware filter,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. Early Access, 2020. Z. Wang, A. C. Bovik, H. R. Sheikh, and E. P. Simoncelli, “Image quality assessment: from error visibility to structural similarity,” IEEE Transactions on Image Processing (TIP), vol. 13, no. 4, pp. 600–612, 2004. G. Petschnigg, R. Szeliski, M. Agrawala, M. Cohen, H. Hoppe, and K. Toyama, “Digital photography with flash and no-flash image pairs,” ACM Transactions on Graphics (TOG), vol. 23, no. 3, pp. 664–672, 2004. K. He, J. Sun, and X. Tang, “Guided image filtering,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 35, no. 6, pp. 1397–1409, 2013. S. Xie and Z. Tu, “Holistically-nested edge detection,” in Proceedings of IEEE International Conference on Computer Vision (ICCV), 2015. Q. Yang, “Semantic filtering,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. G. J. Sullivan, J.-R. Ohm,W.-J. Han, and T.Wiegand, “Overview of the high efficiency video coding (hevc) standard,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 22, no. 12, pp. 1649–1668, 2012. K. He and J. Sun, “Fast guided filter,” arXiv, vol. abs/1505.00996, 2015. F. Kou, W. Chen, C. Wen, and Z. Li, “Gradient domain guided image filtering,” IEEE Transactions on Image Processing (TIP), vol. 24, no. 11, pp. 4528–4539, 2015. Z. Li, J. Zheng, Z. Zhu, W. Yao, and S. Wu, “Weighted guided image filtering,” IEEE Transactions on Image Processing (TIP), vol. 24, no. 1, pp. 120–129, 2015. M. Everingham, S. A. Eslami, L. Van Gool, C. K. Williams, J. Winn, and A. Zisserman, “The pascal visual object classes challenge: A retrospective,” International Journal of Computer Vision (IJCV), vol. 111, no. 1, pp. 98–136, 2015. L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille, “Semantic image segmentation with deep convolutional nets and fully connected crfs,” in Proceedings of International Conference on Learning Representations (ICLR), 2015. P. Viola and M. J. Jones, “Robust real-time face detection,” International Journal of Computer Vision (IJCV), vol. 57, no. 2, pp. 137–154, 2004. A. Van Den Oord, N. Kalchbrenner, and K. Kavukcuoglu, “Pixel recurrent neural networks,” in Proceedings of International Conference on Machine Learning (ICML), 2016. F. Yang, H. Lu, and M.-H. Yang, “Robust superpixel tracking,” IEEE Transactions on Image Processing (TIP), vol. 23, no. 4, pp. 1639–1651, 2014. A. Sharma, O. Tuzel, and M.-Y. Liu, “Recursive context propagation network for semantic scene labeling,” in Proceedings of Neural Information Processing Systems (NIPS), 2014. D. Zoran, P. Isola, D. Krishnan, and W. T. Freeman, “Learning ordinal relationships for mid-level vision,” in Proceedings of IEEE International Conference on Computer Vision (ICCV), 2015. M. Van den Bergh, X. Boix, G. Roig, and L. Van Gool, “Seeds: Superpixels extracted via energy-driven sampling,” International Journal of Computer Vision (IJCV), vol. 111, no. 3, pp. 298–314, 2015. Z. Li and J. Chen, “Superpixel segmentation using linear spectral clustering,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015. R. Achanta and S. Susstrunk, “Superpixels and polygons using simple noniterative clustering,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017. P. Arbelaez, M. Maire, C. Fowlkes, and J. Malik, “Contour detection and hierarchical image segmentation,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 33, no. 5, pp. 898–916, 2011. X. Ren and J. Malik, “Learning a classification model for segmentation,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2003. M. Grundmann, V. Kwatra, M. Han, and I. Essa, “Efficient hierarchical graph-based video segmentation,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2010. J. Yao, M. Boben, S. Fidler, and R. Urtasun, “Real-time coarse-to-fine topologically preserving segmentation,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015. Y.-J. Liu, C.-C. Yu, M.-J. Yu, and Y. He, “Manifold slic: A fast method to compute content-sensitive superpixels,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. D. Stutz, A. Hermans, and B. Leibe, “Superpixels: An evaluation of the state-of-the-art,” Computer Vision and Image Understanding (CVIU), vol. 166, pp. 1–27, 2018. V. Machairas, M. Faessel, D. C´ardenas-Pe˜na, T. Chabardes, T. Walter, and E. Decenci`ere, “Waterpixels,” IEEE Transactions on Image Processing (TIP), vol. 24, no. 11, pp. 3707–3716, 2015. A. Levinshtein, A. Stere, K. N. Kutulakos, D. J. Fleet, S. J. Dickinson, and K. Siddiqi, “Turbopixels: Fast superpixels using geometric flows,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 31, no. 12, pp. 2290–2297, 2009. R. Girshick, J. Donahue, T. Darrell, and J. Malik, “Rich feature hierarchies for accurate object detection and semantic segmentation,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014. K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” in Proceedings of International Conference on Learning Representations (ICLR), 2015. K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. J. Canny, “A computational approach to edge detection,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 8, no. 6, pp. 679–698, 1986. D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” in Proceedings of International Conference on Learning Representations (ICLR), 2015. A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet classification with deep convolutional neural networks,” in Proceedings of Neural Information Processing Systems (NIPS), 2012. X. Wei, Q. Yang, Y. Gong, N. Ahuja, and M. Yang, “Superpixel hierarchy,” IEEE Transactions on Image Processing (TIP), vol. 27, no. 10, pp. 4838–4849, 2018. S. Gould, R. Hartley, and D. Campbell, “Deep declarative networks: A new hope,” Australian National University (arXiv:1909.04866), Tech. Rep., Sep 2019.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/54628	-
dc.description.abstract	像素相似度是用來衡量影像中相鄰兩像素之間距離或是連接性的變量，衡量像素相似度是許多影像處理算法中基本但關鍵的步驟，例如影像濾波器中用來計算濾波器核心函數，或是計算全域最佳化問題時用來計算點和點之間的邊權重。在諸多衡量相素相似度的應用之中，有許多都是和影像分割高度相關的，例如像素相似度影響了影像分割時物體邊界最佳的位置或是群聚方法中用來決定是否要合併兩塊相鄰的分割以形成更大的分割，本文以衡量像素相似度為角度探討一系列和影像分割有關的問題，我們展示了像素相似度帶來的方便計算性、多功能性甚至讓我們能夠藉由學習像素相似度來利用深度學習輔助超像素分割問題。具體來說，我們在本文的第一個部分探討有效率的距離轉換問題，藉由距離轉換以及適當的像素相似度估測函數，我們便能夠衡量影像中任意兩點在該函數下的距離，我們提出的距離轉換法具有線性時間複雜度，搭配最小屏障距離函數，我們展示了實時顯著物體偵測，同樣的距離轉換法也可以應用在互動式影像分割及眼睛影像瞳孔偵測等。第二部分，我們討論了像素相似度對於遞歸濾波器的影響，我們提出一種新型遞歸濾波器可以直接作用在二維影像平面，我們用這個濾波器展示了包含保邊影像平滑、材質去除、語意影像分割優化等應用，這些應用都採用相同的濾波器計算方式只是替換了像素相似度函數便可以達成不同效果。第三部分，我們利用學習像素相似度進而使深度學習能夠用來解超像素分割的問題，由於超像素分割問題本身沒有正確答案，且超像素表示法的編號絕對值沒有意義，這個問題不容易用深度學習搭配監督式學習法來解決，我們提出一個新的損失函數，可以利用一般影像分割的資料集來引導超像素分割的學習，在學習的過程中我們同時利用超像素切割的結果計算回饋信號來輔助深度學習更好的修正模型。我們在一系列問題中提出的解決辦法都圍繞在衡量像素相似度上，通過大量實驗的驗證，我們的方法同時具有準確性及計算效率，這是衡量像素相似度所帶來的方便性及多元性的好處。	zh_TW
dc.description.abstract	Pixel affinities describe the distance or connectivity between neighboring pixels in an image. Measuring pixel affinities is a fundamental yet essential process in many image processing algorithms such as computing the filter kernel for image filtering or determining edge weights in a global optimization framework. Among all applications, measuring pixel affinities is highly related to the task of image segmentation, where pixel affinities play essential roles to determine where to put segmentation boundaries or to merge two disjoint segments. In this thesis, we address several segmentation related tasks by explicitly measuring the pixel affinities. Specifically, in the first part, we show how we measure pixel affinities for distance transform to locate salient objects in an image. For the second part, we show the impact of pixel affinities in recursive image filtering. We propose a novel recursive filtering method and show its applications in edge-aware smoothing, texture removal and semantic segmentation. All above applications can be achieved in a single filtering framework by merely changing the way we compute pixel affinities. For the third part, we present the first ever deep learning based superpixel segmentation algorithm that can form semantically more meaningful segments. By explicitly learning pixel affinities, we make the learning of superpixels possible. The proposed algorithms are quantitatively and qualitatively evaluated in various applications including semantic segmentation, superpixel generation and salient object detection. Experimental results show that our algorithms are effective and efficient in these applications.	en
dc.description.provenance	Made available in DSpace on 2021-06-16T03:35:59Z (GMT). No. of bitstreams: 1 U0001-0502202100073700.pdf: 102121360 bytes, checksum: a72c1da7dc198cdaa9dc89779b3477ec (MD5) Previous issue date: 2021	en
dc.description.tableofcontents	Abstract (P.i) List of Figures (P.vii) List of Tables (P.xiii) 1 Introduction (P.1) 2 Distance Transform for Segmentation (P.5) 2.1 Introduction (P.5) 2.2 Background (P.6) 2.3 Fast Distance Transform with Minimum Spanning Tree (P.10) 2.3.1 Image as a minimum spanning tree (P.10) 2.3.2 MST-based distance transform (P.11) 2.3.3 Complexity analysis (P.14) 2.4 Application to Salient Object Detection (P.16) 2.4.1 Measuring boundary connectivity (P.18) 2.4.2 Post-processing (P.18) 2.5 Experimental Results (P.21) 2.5.1 Experimental settings (P.21) 2.5.2 Computational efficiency (P.21) 2.5.3 Quantitative and qualitative evaluation (P.23) 2.5.4 Limitations (P.28) 2.6 Extension to Other Applications (P.28) 2.7 Summary (P.30) 3 Spatial Propagation via Recursive Filtering (P.33) 3.1 Introduction (P.33) 3.2 Related Work (P.37) 3.2.1 Recursive filtering (P.37) 3.2.2 Learning pixel affinities (P.39) 3.3 Two-Way Recursive Filtering (P.40) 3.4 Two-Way Recursive Image Smoothing (P.43) 3.4.1 Gradient-based pixel affinity for edge-aware filtering (P.43) 3.4.2 Iterative filtering and convergence analysis (P.44) 3.4.3 Extension to joint filtering (P.49) 3.5 Learning Pixel Affinities for TWRF (P.51) 3.5.1 TWRF layer (P.51) 3.5.2 TWRF for segmentation refinement (P.53) 3.6 Parallel Implementation (P.55) 3.7 Experiments for Edge-Aware Filtering (P.57) 3.7.1 Edge-aware smoothing quality (P.57) 3.7.2 Effect of iterations (P.59) 3.7.3 Application to image detail enhancement (P.60) 3.8 Experiments for Segmentation Refinement (P.62) 3.8.1 Implementation details (P.62) 3.8.2 Comparison of the SPN models (P.63) 3.8.3 Comparison of the end-to-end models (P.67) 3.9 Computational Efficiency (P.68) 3.10 Summary (P.71) 4 Learning Pixel Affinities for Superpixel Segmentation (P.73) 4.1 Introduction (P.73) 4.2 Related Work (P.75) 4.2.1 Graph-based algorithms (P.75) 4.2.2 Clustering-based algorithms (P.76) 4.2.3 Other approaches (P.77) 4.3 Superpixels Meet Deep Learning (P.77) 4.4 Learning Segmentation-Aware Affinities (P.79) 4.4.1 Segmentation-aware loss (P.80) 4.4.2 Pixel Affinity Net (P.82) 4.5 Experiments (P.83) 4.5.1 Performance metrics (P.83) 4.5.2 Implementation details (P.85) 4.5.3 Comparisons with baselines (P.87) 4.5.4 Comparisons with the state-of-the-arts (P.90) 4.5.5 Ablation study of PAN (P.96) 4.5.6 Generalization to other graph-based algorithms (P.97) 4.6 Applications (P.97) 4.6.1 Semantic segmentation (P.98) 4.6.2 Salient object detection (P.99) 4.7 Summary (P.100) 5 Conclusion (P.101) Reference (P.103)
dc.language.iso	en
dc.title	用於影像分割之像素相似度估測	zh_TW
dc.title	Towards Learning Pixel Affinities for Image Segmentation and Beyond	en
dc.type	Thesis
dc.date.schoolyear	109-1
dc.description.degree	博士
dc.contributor.oralexamcommittee	盧奕璋(Yi-Chang Lu),陳宏銘(Homer H. Chen),王鈺強(Yu-Chiang Frank Wang),賴尚宏(Shang-Hong Lai),林彥宇(Yen-Yu Lin)
dc.subject.keyword	像素相似度,影像分割,遞歸濾波器,距離轉換,超像素,	zh_TW
dc.subject.keyword	Pixel Affinity,Image Segmentation,Recursive Filter,Distance Transform,Superpixel,	en
dc.relation.page	118
dc.identifier.doi	10.6342/NTU202100552
dc.rights.note	有償授權
dc.date.accepted	2021-02-06
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	電子工程學研究所	zh_TW
顯示於系所單位：	電子工程學研究所

文件中的檔案：

檔案	大小	格式
U0001-0502202100073700.pdf 目前未授權公開取用	99.73 MB	Adobe PDF

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。