利用使用者引導之最佳化的二維內容設計

I-Chao Shen; 沈奕超

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/71416

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	陳炳宇(Bing-Yu Chen)
dc.contributor.author	I-Chao Shen	en
dc.contributor.author	沈奕超	zh_TW
dc.date.accessioned	2021-06-17T06:00:21Z	-
dc.date.available	2022-12-01
dc.date.copyright	2020-12-09
dc.date.issued	2020
dc.date.submitted	2020-12-03
dc.identifier.citation	[1] R. Abdal, Y. Qin, and P. Wonka. Image2stylegan: How to embed images into the stylegan latent space? In Proceedings of the IEEE international conference on computer vision, pages 4432–4441, 2019. [2] D. Acuna, H. Ling, A. Kar, and S. Fidler. Efficient interactive annotation of segmentation datasets with polygon-rnn++. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 859–868, 2018. [3] Adobe. Adobe PhotoShop 2018. http://www.adobe.com/, 2018. [4] Adobe. Adobe illustrator 2020: Image trace, 2020. [5] H. Altwaijry, A. Veit, S. J. Belongie, and C. Tech. Learning to detect and match keypoints with deep architectures. In BMVC, 2016. [6] M. Arjovsky, S. Chintala, and L. Bottou. Wasserstein gan. arXiv preprint arXiv:1701.07875, 2017. [7] F. Arrebola and F. Sandoval. Corner detection and curve segmentation by multiresolution chain-code linking. Pattern Recognition, 38(10):1596–1614, 2005. [8] J. Ba, J. R. Kiros, and G. E. Hinton. Layer normalization. ArXiv, abs/1607.06450, 2016. [9] I. Baran, J. Lehtinen, and J. Popović. Sketching clothoid splines using shortest paths. In CGF, volume 29,2, pages 655–664, 2010. [10] D. Bau, H. Strobelt, W. Peebles, J. Wulff, B. Zhou, J. Zhu, and A. Torralba., Semantic photo manipulation with a generative image prior. ACM Transactions on Graphics (Proceedings of ACM SIGGRAPH), 38(4), 2019. [11] D. Bau, J.-Y. Zhu, H. Strobelt, B. Zhou, J. B. Tenenbaum, W. T. Freeman, and A. Torralba. Gan dissection: Visualizing and understanding generative adversarial networks. arXiv preprint arXiv:1811.10597, 2018. [12] S. Bell, P. Upchurch, N. Snavely, and K. Bala. Opensurfaces: A richly annotated catalog of surface appearance. ACM Transactions on graphics (TOG), 32(4):111, 2013. [13] S. Belongie, J. Malik, and J. Puzicha. Shape context: A new descriptor for shape matching and object recognition. In Advances in neural information processing systems, pages 831–837, 2001. [14] Y. Bengio, J. Louradour, R. Collobert, and J. Weston. Curriculum learning. In Proceedings of the 26th annual international conference on machine learning, pages 41–48, 2009. [15] J. Bergstra, B. Komer, C. Eliasmith, D. Yamins, and D. D. Cox. Hyperopt: a python library for model selection and hyperparameter optimization. Computational Science Discovery, 8(1):014008, 2015. [16] M. Bessmeltsev and J. Solomon. Vectorization of line drawings via polyvector fields. ACM Transactions on Graphics (TOG), 38(1):9, 2019. [17] H. L. Beus and S. S. Tiu. An improved corner detection algorithm based on chain-coded plane curves. Pattern Recognition, 20(3):291–296, 1987. [18] S. Bouaziz, M. Deuss, Y. Schwartzburg, T. Weise, and M. Pauly. Shape-up: Shaping discrete geometry with projections. In Computer Graphics Forum, volume 31, pages 1657–1667. Wiley Online Library, 2012. [19] Y. Boykov and V. Kolmogorov. An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision. IEEE transactions on pattern analysis and machine intelligence, 26(9):1124–1137, 2004. [20] L. Breiman. Random forests. Machine learning, 45(1):5–32, 2001. [21] E. Brochu, T. Brochu, and N. de Freitas. A bayesian interactive optimization approach to procedural animation design. In Proceedings of the 2010 ACM SIGGRAPH/Eurographics Symposium on Computer Animation, pages 103–112, 2010. [22] E. Brochu, N. de Freitas, and A. Ghosh. Active preference learning with discrete choice data. In Advances in Neural Information Processing Systems, 2007. [23] A. Carmona-Poyato, F. J. Madrid-Cuevas, R. Medina-Carnicer, and R. Muñoz-Salinas. Polygonal approximation of digital planar curves through break point suppression. Pattern Recognition, 43(1):14–25, 2010. [24] A. X. Chang, T. Funkhouser, L. Guibas, P. Hanrahan, Q. Huang, Z. Li, S. Savarese, M. Savva, S. Song, H. Su, et al. Shapenet: An information-rich 3d model repository. arXiv preprint arXiv:1512.03012, 2015. [25] S. Chaudhuri, E. Kalogerakis, L. Guibas, and V. Koltun. Probabilistic reasoning for assembly-based 3d modeling. In ACM Transactions on Graphics (TOG), volume 30, page 35. ACM, 2011. [26] X. Chen, J. Song, and O. Hilliges. Monocular neural image based rendering with continuous view control. In Proceedings of the IEEE International Conference on Computer Vision, pages 4090–4100, 2019. [27] D. Chetverikov and Z. Szabo. A simple and efficient algorithm for detection of high curvature points in planar curves. In CAIP, volume 3, pages 746–753. Springer, 2003. [28] F. Cole, A. Golovinskiy, A. Limpaecher, H. S. Barros, A. Finkelstein, T. Funkhouser, and S. Rusinkiewicz. Where do people draw lines? ACM Trans. Graph., 27(3):88:1–88:11, Aug. 2008. [29] D. DeCarlo, A. Finkelstein, S. Rusinkiewicz, and A. Santella. Suggestive contours for conveying shape. ACM Transactions on Graphics (Proc. SIGGRAPH), 22(3):848–855, July 2003. [30] J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. ImageNet: A Large-Scale Hierarchical Image Database. In CVPR09, 2009. [31] A. Dosovitskiy, J. T. Springenberg, M. Tatarchenko, and T. Brox. Learning to generate chairs, tables and cars with convolutional networks. IEEE transactions on pattern analysis and machine intelligence, 39(4):692–705, 2016. [32] H. Fan, H. Su, and L. J. Guibas. A point set generation network for 3d object reconstruction from a single image. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 605–613, 2017. [33] G. Farin. Curves and Surfaces for CAGD: A Practical Guide. Morgan Kaufmann Publishers Inc., 2002. [34] J.-D. Favreau, F. Lafarge, and A. Bousseau. Photo2clipart: Image abstraction and vectorization using layered linear gradients. ACM Transactions on Graphics (SIGGRAPH Asia Conference Proceedings), 36(6), November 2017. [35] M. Fisher, D. Ritchie, M. Savva, T. Funkhouser, and P. Hanrahan. Example-based synthesis of 3d object arrangements. ACM Transactions on Graphics (TOG), 31(6):135, 2012. [36] S. Fleishman, D. Cohen-Or, and C. T. Silva. Robust moving least-squares fitting with sharp features. ACM TOG, 24(3):544–552, 2005. [37] H. Freeman and L. S. Davis. A corner-finding algorithm for chain-coded curves. IEEE Transactions on computers, 26(3):297–303, 1977. [38] E. Garces, A. Agarwala, D. Gutierrez, and A. Hertzmann. A similarity measure for illustration style. ACM Transactions on Graphics (SIGGRAPH 2014), 33(4), 2014. [39] J. R. Gardner, M. J. Kusner, Z. Xu, K. Q. Weinberger, and J. P. Cunningham. Bayesian optimization with inequality constraints. In Proceedings of the 31st International Conference on Machine Learning, pages 937–945, 2014. [40] M. A. Gelbart, J. Snoek, and R. P. Adams. Bayesian optimization with unknown constraints. In Proceedings of the 30st Uncertainty in Artificial Intelligence, 2014. [41] D. Ginsbourger, R. Le Riche, and L. Carraro. Kriging is well-suited to parallelize optimization. In Computational Intelligence in Expensive Optimization Problems, pages 131–162. Springer, 2010. [42] I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio. Generative adversarial nets. In Advances in neural information processing systems, pages 2672–2680, 2014. [43] Y. Gryaditskaya, M. Sypesteyn, J. W. Hoftijzer, S. Pont, F. Durand, and A. Bousseau. Opensketch: A richly-annotated dataset of product design sketches. ACM Transactions on Graphics (SIGGRAPH Asia Conference Proceedings), 38(6), November 2019. [44] E. Guérin, J. Digne, E. Galin, A. Peytavie, C. Wolf, B. Benes, and B. Martinez. Interactive example-based terrain authoring with conditional generative adversarial networks. Acm Transactions on Graphics (TOG), 36(6):228, 2017. [45] E. Guérin, J. Digne, E. Galin, A. Peytavie, C. Wolf, B. Benes, and B. Martinez. Interactive example-based terrain authoring with conditional generative adversarial networks. ACM Trans. Graph., 36(6):228:1–228:13, Nov. 2017. [46] I. Gulrajani, F. Ahmed, M. Arjovsky, V. Dumoulin, and A. C. Courville. Improved training of wasserstein gans. In Advances in neural information processing systems, pages 5767–5777, 2017. [47] D. Ha and D. Eck. A neural representation of sketch drawings. arXiv preprint arXiv:1704.03477, 2017. [48] G. Hacohen and D. Weinshall. On the power of curriculum learning in training deep networks. arXiv preprint arXiv:1904.03626, 2019. [49] C. Han, Q. Wen, S. He, Q. Zhu, Y. Tan, G. Han, and T.-T. Wong. Deep unsupervised pixelization. ACM Transactions on Graphics (SIGGRAPH Asia 2018 issue), 37(6):243:1–243:11, November 2018. [50] C. Harris and M. Stephens. A combined corner and edge detector. In Alvey vision conference, volume 15, page 50, 1988. [51] K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016. [52] K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 770–778, June 2016. [53] R. Hess and D. Field. Integration of contours: new insights. Trends in cognitive sciences, 3(12):480–486, 1999. [54] T. C. L. Hin, I. Shen, I. Sato, T. Igarashi, et al. Interactive subspace exploration on generative image modelling. arXiv preprint arXiv:1906.09840, 2019. [55] S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural computation, 9(8):1735–1780, 1997. [56] P. Holl, V. Koltun, and N. Thuerey. Learning to control pdes with differentiable physics. arXiv preprint arXiv:2001.07457, 2020. [57] S. Hoshyari, E. A. Dominici, A. Sheffer, N. Carr, Z. Wang, D. Ceylan, I. Shen, et al. Perception-driven semi-structured boundary vectorization. ACM Transactions on Graphics (TOG), 37(4):118, 2018. [58] X. Huang, M.-Y. Liu, S. Belongie, and J. Kautz. Multimodal unsupervised image-to-image translation. arXiv preprint arXiv:1804.04732, 2018. [59] S. Iizuka, E. Simo-Serra, and H. Ishikawa. Globally and Locally Consistent Image Completion. ACM Transactions on Graphics (Proc. of SIGGRAPH 2017), 36(4):107:1–107:14, 2017. [60] Inkscape Project. Inkscape, 2020. [61] P. Isola, J.-Y. Zhu, T. Zhou, and A. A. Efros. Image-to-image translation with conditional adversarial networks. arxiv, 2016. [62] P. Isola, J.-Y. Zhu, T. Zhou, and A. A. Efros. Image-to-image translation with conditional adversarial networks. In Computer Vision and Pattern Recognition (CVPR), 2017 IEEE Conference on, 2017. [63] X. Jun, W. Holger, L. Wilmot, and S. Stephen. Interactive vectorization. In Proceedings of SIGCHI 2017. ACM, 2017. [64] E. Kalogerakis, S. Chaudhuri, D. Koller, and V. Koltun. A probabilistic model for component-based shape synthesis. ACM Transactions on Graphics (TOG), 31(4):55, 2012. [65] T. Karras, T. Aila, S. Laine, and J. Lehtinen. Progressive growing of gans for improved quality, stability, and variation. In Proc. International Conference on Learning Representations (ICLR), 2018. [66] T. Karras, S. Laine, M. Aittala, J. Hellsten, J. Lehtinen, and T. Aila. Analyzing and improving the image quality of stylegan. arXiv preprint arXiv:1912.04958, 2019. [67] M. Kazhdan, M. Bolitho, and H. Hoppe. Poisson surface reconstruction. In Proceedings of the Fourth Eurographics Symposium on Geometry Processing, SGP ’06, page 61–70, Goslar, DEU, 2006. Eurographics Association. [68] T. Kelly, J. Femiani, P. Wonka, and N. J. Mitra. Bigsur: large-scale structured urban reconstruction. ACM Transactions on Graphics, 36(6), 2017. [69] B. Kim, O. Wang, A. C. Öztireli, and M. Gross. Semantic segmentation for line drawing vectorization using neural networks. In Computer Graphics Forum, volume 37, pages 329–338. Wiley Online Library, 2018. [70] D. P. Kingma and J. Ba. Adam: A method for stochastic optimization, 2014. [71] D. P. Kingma and M. Welling. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114, 2013. [72] D. P. Kingma and M. Welling. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114, 2013. [73] S. Koch, A. Matveev, Z. Jiang, F. Williams, A. Artemov, E. Burnaev, M. Alexa, D. Zorin, and D. Panozzo. Abc: A big cad model dataset for geometric deep learning. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2019. [74] K. Koffka. Principles of Gestalt Psychology. International library of psychology, philosophy, and scientific method. Routledge K. Paul, 1955. [75] J. Kopf and D. Lischinski. Depixelizing pixel art. ACM TOG, 30(4):99:1–99:8, 2011. [76] J. Kopf and D. Lischinski. Depixelizing pixel art. ACM Transactions on graphics (TOG), 30(4):99, 2011. [77] Y. Koyama, I. Sato, D. Sakamoto, and T. Igarashi. Sequential line search for efficient visual design optimization by crowds. ACM Trans. Graph., 36(4):48:1–48:11, July 2017. [78] A. K. Lampinen, D. So, D. Eck, and F. Bertsch. Improving image generative models with human interactions. arXiv preprint arXiv:1709.10459, 2017. [79] D. Langridge. Curve encoding and the detection of discontinuities. Computer Graphics and Image Processing, 20(1):58–71, 1982. [80] C. Li, S. Gupta, S. Rana, V. Nguyen, S. Venkatesh, and A. Shilton. High dimensional bayesian optimization using dropout. arXiv preprint arXiv:1802.05400, 2018. [81] J. Li, K. Xu, S. Chaudhuri, E. Yumer, H. Zhang, and L. Guibas. Grass: Generative recursive autoencoders for shape structures. ACM Transactions on Graphics (TOG), 36(4):52, 2017. [82] J. Li, T. Xu, J. Zhang, A. Hertzmann, and J. Yang. LayoutGAN: Generating graphic layouts with wireframe discriminator. In International Conference on Learning Representations, 2019. [83] M. Li, A. G. Patil, K. Xu, S. Chaudhuri, O. Khan, A. Shamir, C. Tu, B. Chen, D. Cohen-Or, and H. Zhang. Grains: Generative recursive autoencoders for indoor scenes. ACM Transactions on Graphics (TOG), 38(2):12, 2019. [84] T.-M. Li. Differentiable visual computing. arXiv preprint arXiv:1904.12228, 2019. [85] T.-M. Li, M. Lukáč, G. Michaël, and J. Ragan-Kelley. Differentiable vector graphics rasterization for editing and learning. ACM Trans. Graph. (Proc. SIGGRAPH Asia), 39(6):193:1 193:15, 2020. [86] T.-M. Li, M. Lukáč, and M. Gharbi. Differentiable rendering for vector graphics. Manuscript submitted for publication, 2020. [87] Y. Li, X. Wu, Y. Chrysanthou, A. Sharf, D. Cohen-Or, and N. J. Mitra. Globfit: Consistently fitting primitives by discovering global relations. ACM TOG, 30(4):52:1–52:12, 2011. [88] Y. Li, X. Wu, Y. Chrysathou, A. Sharf, D. Cohen-Or, and N. J. Mitra. Globfit: Consistently fitting primitives by discovering global relations. 30(4), July 2011. [89] J.-M. Lien and N. M. Amato. Approximate convex decomposition of polygons. In Proc. Symp. Computational Geometry, pages 17–26, 2004. [90] S. Lin, D. Ritchie, M. Fisher, and P. Hanrahan. Probabilistic color-by-numbers: Suggesting pattern colorizations using factor graphs. ACM Transactions on Graphics (TOG), 32(4):37, 2013. [91] T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollár, and C. L. Zitnick. Microsoft coco: Common objects in context. In European conference on computer vision, pages 740–755. Springer, 2014. [92] L. Liu, L. Zhang, Y. Xu, C. Gotsman, and S. J. Gortler. A local/global approach to mesh parameterization. In Computer Graphics Forum, volume 27, pages 1495–1504. Wiley Online Library, 2008. [93] Y. Liu, A. Agarwala, J. Lu, and S. Rusinkiewicz. Data-driven iconification. In International Symposium on Non-Photorealistic Animation and Rendering (NPAR), May 2016. [94] Y. Liu and W. Wang. A revisit to least squares orthogonal distance fitting of parametric curves and surfaces. In Proc. Advances in Geometric Modeling and Processing, pages 384–397, 2008. [95] Z. Liu, P. Luo, X. Wang, and X. Tang. Deep learning face attributes in the wild. In Proceedings of the IEEE international conference on computer vision, pages 3730–3738, 2015. [96] Z. Liu, P. Luo, X. Wang, and X. Tang. Deep learning face attributes in the wild. In Proceedings of International Conference on Computer Vision (ICCV), 2015. [97] J. Long, E. Shelhamer, and T. Darrell. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 3431–3440, 2015. [98] R. G. Lopes, D. Ha, D. Eck, and J. Shlens. A learned representation for scalable vector graphics. In The IEEE International Conference on Computer Vision (ICCV), October 2019. [99] R. G. Lopes, D. Ha, D. Eck, and J. Shlens. A learned representation for scalable vector graphics. In Proceedings of the IEEE International Conference on Computer Vision, pages 7930–7939, 2019. [100] J. Lopez-Moreno, P. Stefan, A. Bousseau, M. Agrawala, and G. Drettakis. Depicting stylized materials with vector shade trees. 2013 [101] Z. Lun, M. Gadelha, E. Kalogerakis, S. Maji, and R. Wang. 3d shape reconstruction from sketches via multi-view convolutional networks. In 2017 International Conference on 3D Vision (3DV), pages 67–77. IEEE, 2017. [102] Z. Lun, E. Kalogerakis, and A. Sheffer. Elements of style: Learning perceptual shape style similarity. ACM Trans. Graph., 34(4):84:1–84:14, 2015. [103] J. McCrae and K. Singh. Sketching Piecewise Clothoid Curves. In Proc. Sketch-Based Interfaces and Modeling, 2008. [104] J. McCrae and K. Singh. Neatening sketched strokes using piecewise french curves. In Proc. EG Symposium on Sketch-Based Interfaces and Modeling, pages 141–148, 2011. [105] G. Medioni and Y. Yasumoto. Corner detection and curve representation using cubic b-splines. In Robotics and Automation. Proceedings. 1986 IEEE International Conference on, volume 3, pages 764–769. IEEE, 1986. [106] R. Mehra, Q. Zhou, J. Long, A. Sheffer, A. Gooch, and N. J. Mitra. Abstraction of man-made shapes. ACM TOG, 28(5):137:1–137:10, 2009. [107] M. Mirza and S. Osindero. Conditional generative adversarial nets. CoRR, abs/1411.1784, 2014. [108] K. Mo, S. Zhu, A. X. Chang, L. Yi, S. Tripathi, L. J. Guibas, and H. Su. PartNet: A large-scale benchmark for fine-grained and hierarchical part-level 3D object understanding. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2019. [109] P. Murugan. Hyperparameters optimization in deep convolutional neural network / bayesian approach with gaussian process prior. CoRR, abs/1712.07233, 2017. [110] K. Olszewski, S. Tulyakov, O. Woodford, H. Li, and L. Luo. Transformable bottleneck networks. In Proceedings of the IEEE International Conference on Computer Vision, pages 7648–7657, 2019. [111] A. Orzan, A. Bousseau, H. Winnemöller, P. Barla, J. Thollot, and D. Salesin. Diffusion curves: a vector representation for smooth-shaded images. In ACM Transactions on Graphics (TOG), volume 27, page 92. ACM, 2008. [112] E. Park, J. Yang, E. Yumer, D. Ceylan, and A. C. Berg. Transformation-grounded image generation network for novel 3d view synthesis. In Proceedings of the ieee conference on computer vision and pattern recognition, pages 3500–3509, 2017. [113] A. Paszke, S. Gross, S. Chintala, G. Chanan, E. Yang, Z. DeVito, Z. Lin, A. Desmaison, L. Antiga, and A. Lerer. Automatic differentiation in PyTorch. In NIPS Autodiff Workshop, 2017. [114] F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12:2825–2830, 2011. [115] F. Perazzi, J. Pont-Tuset, B. McWilliams, L. Van Gool, M. Gross, and A. Sorkine-Hornung. A benchmark dataset and evaluation methodology for video object segmentation. In Computer Vision and Pattern Recognition, 2016. [116] T. Portenier, Q. Hu, A. Szabó, S. A. Bigdeli, P. Favaro, and M. Zwicker. Faceshop: Deep sketch-based face image editing. ACM Trans. Graph., 37(4):99:1–99:13, July 2018. [117] E. Rosten, R. Porter, and T. Drummond. Faster and better: A machine learning approach to corner detection. IEEE transactions on pattern analysis and machine intelligence, 32(1):105–119, 2010. [118] S. R. Safavian and D. Landgrebe. A survey of decision tree classifier methodology. IEEE transactions on systems, man, and cybernetics, 21(3):660–674, 1991. [119] O. Sbai, C. Couprie, and M. Aubry. Vector image generation by learning parametric layer decomposition. arXiv preprint arXiv:1812.05484, 2018. [120] W. Scott, P. Frazier, and W. Powell. The correlated knowledge gradient for simulation optimization of continuous parameters using gaussian process regression. SIAM Journal on Optimization, 21(3):996–1026, 2011. [121] P. Selinger. Potrace: a polygon-based tracing algorithm. In http://potrace.sourceforge.net, 2003. [122] B. Shahriari, K. Swersky, Z. Wang, R. P. Adams, and N. De Freitas. Taking the human out of the loop: A review of bayesian optimization. Proceedings of the IEEE, 104(1):148–175, 2016. [123] J. Shi et al. Good features to track. In Computer Vision and Pattern Recognition, IEEE Conference on, pages 593–600. IEEE, 1994. [124] M. Shugrina, Z. Liang, A. Kar, J. Li, A. Singh, K. Singh, and S. Fidler. Creative flow+ dataset. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2019. [125] S. Song, F. Yu, A. Zeng, A. X. Chang, M. Savva, and T. Funkhouser. Semantic scene completion from a single depth image. Proceedings of 30th IEEE Conference on Computer Vision and Pattern Recognition, 2017. [126] O. Sorkine, D. Cohen-Or, Y. Lipman, M. Alexa, C. Rössl, and H.-P. Seidel. Laplacian surface editing. In Proceedings of the 2004 Eurographics/ACM SIGGRAPH symposium on Geometry processing, pages 175–184. ACM, 2004. [127] M. Stepin. Hqx. http://web.archive.org/web/20070717064839/www.hiend3d.com/hq4x.html, 2003. [128] M. Stepin. Hqx, 2003. [129] J. Sun, L. Liang, F. Wen, and H.-Y. Shum. Image vectorization using optimized gradient meshes. In ACM Transactions on Graphics (TOG), volume 26, page 11. ACM, 2007. [130] S.-H. Sun, M. Huh, Y.-H. Liao, N. Zhang, and J. J. Lim. Multi-view to novel view: Synthesizing novel views with self-learned confidence. In European Conference on Computer Vision, 2018. [131] R. Suzuki, M. Koyama, T. Miyato, and T. Yonetsuji. Collaging on internal representations: An intuitive approach for semantic transfiguration. CoRR, abs/1811.10153, 2018. [132] M. Tatarchenko, A. Dosovitskiy, and T. Brox. Multi-view 3d models from single images with a convolutional network. In European Conference on Computer Vision (ECCV), 2016. [133] K. Tsukida and M. R. Gupta. How to analyze paired comparison data. Technical report, University of Washington, Department of Electrical Engineering, 05 2011. [134] K. Um, Raymond, Fei, P. Holl, R. Brand, and N. Thuerey. Solver-in-the-loop: Learning from differentiable physics to interact with iterative pde-solvers, 2020. [135] N. Umetani, T. Igarashi, and N. J. Mitra. Guided exploration of physically valid shapes for furniture design. [136] N. Umetani, D. M. Kaufman, T. Igarashi, and E. Grinspun. Sensitive couture for interactive garment editing and modeling. ACM Transactions on Graphics (SIGGRAPH 2011), 30(4), 2011. [137] N. Umetani, Y. Koyama, R. Schmidt, and T. Igarashi. Pteromys: interactive design and optimization of free-formed free-flight model airplanes. ACM Transactions on Graphics (TOG), 33(4):1–10, 2014. [138] S. G. Vandenberg and A. R. Kuse. Mental rotations, a group test of three-dimensional spatial visualization. Perceptual and motor skills, 47(2):599–604, 1978. [139] Vector Magic. Cedar Lake Ventures http://vectormagic.com/, 2017. [140] Vector Magic. Cedar lake ventures, 2020. [141] J. Wagemans, J. H. Elder, M. Kubovy, S. E. Palmer, M. A. Peterson, M. Singh, and R. von der Heydt. A century of gestalt psychology in visual perception i. perceptual grouping and figure-ground organization. Psychological Bulletin, 138(6):1172–1217, 2012. [142] K. Wang, M. Savva, A. X. Chang, and D. Ritchie. Deep convolutional priors for indoor scene synthesis. ACM Transactions on Graphics (TOG), 37(4):70, 2018. [143] T. Y. Wang, D. Ceylan, J. Popovic, and N. J. Mitra. Learning a shared shape space for multimodal garment design. ACM Trans. Graph., 37(6):1:1–1:14, 2018. [144] Z. Wang, F. Hutter, M. Zoghi, D. Matheson, and N. de Feitas. Bayesian optimization in a billion dimensions via random embeddings. Journal of Artificial Intelligence Research, 55:361–387, 2016. [145] Z. Wang, D. Liu, J. Yang, W. Han, and T. Huang. Deep networks for image super-resolution with sparse prior. In IEEE ICCV, pages 370–378, 2015. [146] T. Xia, B. Liao, and Y. Yu. Patch-based image vectorization with automatic curvilinear feature alignment. ACM Transactions on Graphics (TOG), 28(5):115, 2009. [147] T. Xu, P. Zhang, Q. Huang, H. Zhang, Z. Gan, X. Huang, and X. He. Attngan: Fine-grained text to image generation with attentional generative adversarial networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2018. [148] Z. Yan, R. Hu, X. Yan, L. Chen, O. Van Kaick, H. Zhang, and H. Huang. Rpm-net: Recurrent prediction of motion and parts from point cloud. ACM Trans. Graph., 38(6):240:1–240:15, Nov. 2019. [149] K. M. Yi, E. Trulls, V. Lepetit, and P. Fua. Lift: Learned invariant feature transform. In European Conference on Computer Vision, pages 467–483. Springer, 2016. [150] W. Yin, K. Kann, M. Yu, and H. Schütze. Comparative study of cnn and rnn for natural language processing. ArXiv, abs/1702.01923, 2017. [151] R. Zhang, J.-Y. Zhu, P. Isola, X. Geng, A. S. Lin, T. Yu, and A. A. Efros. Real-time user-guided image colorization with learned deep priors. ACM Transactions on Graphics (TOG), 9(4), 2017. [152] X. Zheng, X. Qiao, Y. Cao, and R. W. Lau. Content-aware generative modeling of graphic design layouts. ACM Transactions on Graphics (TOG), 38(4):133, 2019. [153] T. Zhou, S. Tulsiani, W. Sun, J. Malik, and A. A. Efros. View synthesis by appearance flow. In European Conference on Computer Vision, 2016. [154] C. Zhu, K. Xu, S. Chaudhuri, R. Yi, and H. Zhang. Scores: Shape composition with recursive substructure priors. ACM Transactions on Graphics (TOG), 37(6):211, 2019. [155] J.-Y. Zhu, P. Krähenbühl, E. Shechtman, and A. A. Efros. Generative visual manipulation on the natural image manifold. In Proceedings of European Conference on Computer Vision (ECCV), 2016. [156] J.-Y. Zhu, P. Krähenbühl, E. Shechtman, and A. A. Efros. Generative visual manipulation on the natural image manifold. In Proceedings of European Conference on Computer Vision (ECCV), 2016. [157] J.-Y. Zhu, T. Park, P. Isola, and A. A. Efros. Unpaired image-to-image translation using cycle-consistent adversarial networks. In Computer Vision (ICCV), 2017 IEEE International Conference on, 2017. [158] J.-Y. Zhu, T. Park, P. Isola, and A. A. Efros. Unpaired image-to-image translation using cycle-consistent adversarial networkss. In Computer Vision (ICCV), 2017 IEEE International Conference on, 2017. [159] C. Zou, E. Yumer, J. Yang, D. Ceylan, and D. Hoiem. 3d-prnn: Generating shape primitives with recurrent neural networks. In The IEEE International Conference on Computer Vision (ICCV), 2017.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/71416	-
dc.description.abstract	一大部分人類是影像的狂熱消費者。儘管如此，大部分的人只能使用並且觀看視覺資料，只有少部分的人有足夠多的專業和天份能夠有效率的利用影響資料來表示他們自己。即使是最普遍的二維視覺資料如影像和影片，大部分的人們都沒辦法有效率地從頭產生他們，或是改變這些資料來增加他們的美感。比如說，專業的美工人員可以有效率地利用向量圖軟體來產生二維標誌圖片。相對的，一般使用者經常需要花費很長時間但還是沒辦法產生有美感的圖片設計。在這篇論文研究中，我們調查並探索了幾種資料驅動的方法來弭平這個不對等的分佈。我們主要透過結合人類的先備知識以及嶄新的最佳化演算法來達到這個目的。首先，我們探索了如何讓使用者可以直接去探索和尋找利用生成影像模型 (generative image modeling)達到他想要的圖片。我們的方法提供多個滑桿 (slider) 讓使用者更有效率去瀏覽可能生成的圖片，並且允許使用者透過影像編輯工具來指定想要的影像特徵。接著，我們探索了如何產生符合人類視覺期望的半結構化 (semi-structured) 美工圖片向量化 (vectorization) 演算法。這些半結構化的美工圖片往往具備了區塊顏色區別性很強，部分連續邊界的特性。我們利用以前對人類視覺對於形狀的反應的研究來產生符合人的視覺系統會預期的結果。同時，我們也探索了如何利用單一物品形態的標籤來自動產生這些二維的向量美工圖。最後，我們提出了一個演算法和系統來幫助使用者設計多視角的向量美工圖案。在這些研究的過中，我們透過線上群眾外包平台的方式，來利用人類感知的比較作為衡量的標準。從結果中可以看到，我們提出的方法都能夠準確的捕捉人類的先備知識和喜好；也因次，我們的方法產生的結果設計都能夠獲得較多使用者的喜愛。未來，我們預想我們提出的這些方法和經驗，可以提供一個重要的基礎給之後嘗試要設計計算輔助系統的研究。	zh_TW
dc.description.abstract	Humans consume visual content avidly for a very long time. The magnitude of the consumption grows exponentially in the past few years due to widespread online social networks and content sharing services, such as Facebook, Instagram, and Youtube. However, there is a huge asymmetry–while everybody avidly consumes visual data, only a few are talented enough to effectively express themselves visually. Even for the most common visual content such as 2D images and videos, most of us still cannot efficiently design them from scratch or manipulate them to enhance their aesthetics. For example, professional artists can generate a 2D icon quite efficiently using a vector graphics authoring tool. On the contrary, naïveusers often spend long hours but still fail to generate an aesthetic design. In this dissertation, we investigate several data-driven approaches for eliminating this asymmetry by combining human priors (including their preferences and knowledge) with novel optimization methods. First, we investigate how to enable the users to control the image generation process using a deep generative model. Second, we investigate methods for generating 2D clipart from existing low-resolution raster icon images and single category labels. Third, we investigate a method on how to generate2D clipart from unseen viewpoints given only a single viewpoint. Specifically, we propose the following three human-guided optimization methods to facilitate efficient 2D visual content design. 1.First, we present a human-in-the-optimization method that allows users to directly explore and search the latent vector space of generative image modeling. Our system provides multiple candidates, and the user selects the best blending result using multiple sliders and image editing tools. 2.Second, we propose approaches (i) to convert artist-drawn images stored as raster images to their vector image form and (ii) to generate the 2D vector clipart directly from a single category label. We first leverage previous studies about human perception of shapes to generate vector images consistent with viewer expectations. Furthermore, we design a generative model to synthesize clipart directly from a single category label. And we trained this generative model on a new clipart dataset of man-made objects called ClipNet. 3.Third, we design an assistive system for clipart design by providing visual scaffolds from the unseen viewpoints. We combined user-provided structure information and automatically predicted 3D structures into a novel curve extrusion optimization method. We evaluated these methods using perceptual comparisons through online crowdsourcing. The results showed that our proposed methods were able to accurately capture various aspects of human prior and provide meaningful supports for various design activities; thus, the user using our methods are able to obtain better visual content than other methods. We envision that these methods and the experiences we learned in this study will provide a good foundation for future research on computational assistive design system to generate more complicated visual content.	en
dc.description.provenance	Made available in DSpace on 2021-06-17T06:00:21Z (GMT). No. of bitstreams: 1 U0001-2611202009262100.pdf: 67567711 bytes, checksum: f67ddd39db7bc61d4147eedd14610cfe (MD5) Previous issue date: 2020	en
dc.description.tableofcontents	Acknowledgments iii 摘要vii Abstract ix 1 Introduction 1 1.1 Human-in-the-loop Optimization for Steering Generative Image Modeling 2 1.2 Perception-Driven Clipart Vectorization and Synthesis . . . . . . . . . . . 3 1.3 Structural-guidance for Multi-view Clipart Design . . . . . . . . . . . . . . 5 2 Related Work 7 2.1 Image vectorization and clipart synthesis . . . . . . . . . . . . . . . . . . . 7 2.2 Curve fitting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 2.3 Corner detection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 2.4 Generative model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 2.5 Image and shape dataset . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 2.6 Interactive Generative Image Modeling . . . . . . . . . . . . . . . . . . . . 10 2.7 Bayesian Optimization with Gaussian Process . . . . . . . . . . . . . . . . 12 3 Human-in-the-loop Optimization for Steering Generative Image Modeling 13 3.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 3.2 User interface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 3.2.1 Multi-way slider . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 3.2.2 Image editing tools . . . . . . . . . . . . . . . . . . . . . . . . . . . 17 3.3 Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 3.3.1 Sequential Subspace Search . . . . . . . . . . . . . . . . . . . . . . 18 3.3.2 Preference learning by Bayesian optimization . . . . . . . . . . . . 21 3.3.3 Content-aware sampling strategy . . . . . . . . . . . . . . . . . . . 23 3.4 Applications and Evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . 24 3.4.1 Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 3.4.2 Comparison to iGAN . . . . . . . . . . . . . . . . . . . . . . . . . . 26 3.4.3 Ablation study . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32 3.5 Discussion, limitations and future work . . . . . . . . . . . . . . . . . . . . 35 3.6 Chapter conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38 4 Perception-Driven Semi-Structured Boundary Vectorization 39 4.1 Algorithm Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42 4.2 Initial Data-driven Corner Prediction . . . . . . . . . . . . . . . . . . . . . 46 4.2.1 Learning Corner Likelihood . . . . . . . . . . . . . . . . . . . . . . 47 4.2.2 Training Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49 4.3 Perception-Driven Corner Removal . . . . . . . . . . . . . . . . . . . . . . 51 4.3.1 Piecewise Smooth Vectorization . . . . . . . . . . . . . . . . . . . 52 4.3.2 Corner Removal Iterations . . . . . . . . . . . . . . . . . . . . . . . 56 4.3.3 Computing Global Context Cues . . . . . . . . . . . . . . . . . . . 58 4.4 Boundary Regularization . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60 4.5 Multi-Color Inputs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61 4.6 Results and Validation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63 4.7 Chapter conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69 5 ClipGen : A Deep Generative Model for Clipart Vectorization and Synthesis 71 5.1 ClipNet : Man-made object Clipart collection . . . . . . . . . . . . . . . . 73 5.1.1 Data Characteristics . . . . . . . . . . . . . . . . . . . . . . . . . . 73 5.1.2 Data Preprocessing . . . . . . . . . . . . . . . . . . . . . . . . . . . 74 5.2 Problem Formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75 5.3 Synthesis Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77 5.3.1 Visual representation of canvas . . . . . . . . . . . . . . . . . . . . 78 5.3.2 First step: continue to add layer? . . . . . . . . . . . . . . . . . . . 79 5.3.3 Second step: what path to add next? . . . . . . . . . . . . . . . . . 80 5.3.4 Shape Regularization . . . . . . . . . . . . . . . . . . . . . . . . . . 89 5.4 Results and Evaluations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90 5.4.1 Data analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90 5.4.2 Implementation detail . . . . . . . . . . . . . . . . . . . . . . . . . 90 5.4.3 Ablation study . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92 5.4.4 Application . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95 5.5 Chapter conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96 5.5.1 Limitations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 6 Structural-guidance for Multi-view Clipart Design 101 6.1 Visual Scaffold Synthesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103 6.1.1 Single-view guiding shape synthesis . . . . . . . . . . . . . . . . . 105 6.1.2 User-assisted curve extrusion . . . . . . . . . . . . . . . . . . . . . 107 6.2 User Interface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111 6.3 Results and Evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112 6.3.1 User study . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114 6.4 Chapter conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117 7 Conclusion and Future Vision 121 Bibliography 125
dc.language.iso	en
dc.subject	向量圖	zh_TW
dc.subject	機器學習	zh_TW
dc.subject	數值最佳化	zh_TW
dc.subject	電腦圖學	zh_TW
dc.subject	numerical optimization	en
dc.subject	human-in-the-loop	en
dc.subject	vector graphics	en
dc.subject	machine learning	en
dc.subject	computer graphics	en
dc.title	利用使用者引導之最佳化的二維內容設計	zh_TW
dc.title	2D Visual Content Design Driven by Human-Guided Optimization	en
dc.type	Thesis
dc.date.schoolyear	109-1
dc.description.degree	博士
dc.contributor.author-orcid	0000-0003-4201-3793
dc.contributor.oralexamcommittee	莊永裕(Yung-Yu Chuang),歐陽明(Ming Ouhyoung),陳維超(Wei-Chao Chen),王鈺強(Yu-Chiang Wang),林文杰(Wen-Chieh Lin)
dc.subject.keyword	電腦圖學,向量圖,數值最佳化,機器學習,	zh_TW
dc.subject.keyword	computer graphics,vector graphics,machine learning,numerical optimization,human-in-the-loop,	en
dc.relation.page	141
dc.identifier.doi	10.6342/NTU202004361
dc.rights.note	有償授權
dc.date.accepted	2020-12-03
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	資訊網路與多媒體研究所	zh_TW
顯示於系所單位：	資訊網路與多媒體研究所

文件中的檔案：

檔案	大小	格式
U0001-2611202009262100.pdf 未授權公開取用	65.98 MB	Adobe PDF

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。