RA-GAN: 多重領域圖像轉換使用相對屬性值

Po-Wui Wu; 吳柏威

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/74184

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	廖世偉(Shih-wei Liao)
dc.contributor.author	Po-Wui Wu	en
dc.contributor.author	吳柏威	zh_TW
dc.date.accessioned	2021-06-17T08:23:22Z	-
dc.date.available	2019-08-19
dc.date.copyright	2019-08-19
dc.date.issued	2019
dc.date.submitted	2019-08-13
dc.identifier.citation	[1] D. Berthelot, C. Raffel, A. Roy, and I. Goodfellow. Understanding and improving interpolation in autoencoders via an adversarial regularizer. arXiv preprint arXiv:1807.07543, 2018. [2] A. Brock, J. Donahue, and K. Simonyan. Large scale gan training for high fidelity natural image synthesis. arXiv preprint arXiv:1809.11096, 2018. [3] Y. Choi, M. Choi, M. Kim, J.-W. Ha, S. Kim, and J. Choo. Stargan: Unified generative adversarial networks for multi-domain image-to-image translation. In CVPR, 2018. [4] I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio. Generative adversarial nets. In NIPS, 2014. [5] K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. In CVPR, 2016. [6] Z. He, W. Zuo, M. Kan, S. Shan, and X. Chen. Arbitrary facial attribute editing: Only change what you want. arXiv preprint arXiv:1711.10678, 2017. [7] M. Heusel, H. Ramsauer, T. Unterthiner, B. Nessler, and S. Hochreiter. Gans trained by a two time-scale update rule converge to a local nash equilibrium. In Advances in Neural Information Processing Systems, pages 6626–6637, 2017. [8] X. Huang, M.-Y. Liu, S. Belongie, and J. Kautz. Multimodal unsupervised imageto-image translation. In ECCV, 2018. [9] P. Isola, J.-Y. Zhu, T. Zhou, and A. A. Efros. Image-to-image translation with conditional adversarial networks. In CVPR, 2017. [10] A. Jolicoeur-Martineau. The relativistic discriminator: a key element missing from standard gan. arXiv preprint arXiv:1807.00734, 2018. [11] T. Karras, T. Aila, S. Laine, and J. Lehtinen. Progressive growing of gans for improved quality, stability, and variation. In ICLR, 2018. [12] T. Karras, S. Laine, and T. Aila. A style-based generator architecture for generative adversarial networks. In CVPR, 2019. [13] D. P. Kingma and J. Ba. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014. [14] G. Lample, N. Zeghidour, N. Usunier, A. Bordes, L. Denoyer, et al. Fader networks: Manipulating images by sliding attributes. In NIPS, 2017. [15] M.-Y. Liu, T. Breuel, and J. Kautz. Unsupervised image-to-image translation networks. In NIPS, 2017. [16] Z. Liu, P. Luo, X. Wang, and X. Tang. Deep learning face attributes in the wild. In ICCV, 2015. [17] P. Luo, J. Ren, and Z. Peng. Differentiable learning-to-normalize via switchable normalization. arXiv preprint arXiv:1806.10779, 2018. [18] X. Mao, Q. Li, H. Xie, R. Y. K. Lau, Z. Wang, and S. P. Smolley. On the effectiveness of least squares generative adversarial networks. PAMI, 2018. [19] M. Mirza and S. Osindero. Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784, 2014. [20] T. Miyato and M. Koyama. cgans with projection discriminator. arXiv preprint arXiv:1802.05637, 2018. [21] A. Odena, C. Olah, and J. Shlens. Conditional image synthesis with auxiliary classifier gans. In Proceedings of the 34th International Conference on Machine Learning-Volume 70, pages 2642–2651. JMLR. org, 2017. [22] G. Perarnau, J. van de Weijer, B. Raducanu, and J. M. Álvarez. Invertible Conditional GANs for image editing. In NIPS Workshop on Adversarial Training, 2016. [23] A. Pumarola, A. Agudo, A. M. Martinez, A. Sanfeliu, and F. Moreno-Noguer. Ganimation: Anatomically-aware facial animation from a single image. In ECCV, 2018. [24] S. Reed, Z. Akata, X. Yan, L. Logeswaran, B. Schiele, and H. Lee. Generative adversarial text to image synthesis. In ICML, 2016. [25] R. Sun, C. Huang, J. Shi, and L. Ma. Mask-aware photorealistic face attribute manipulation. arXiv preprint arXiv:1804.08882, 2018. [26] Z. Yi, H. R. Zhang, P. Tan, and M. Gong. Dualgan: Unsupervised dual learning for image-to-image translation. In ICCV, 2017. [27] G. Zhang, M. Kan, S. Shan, and X. Chen. Generative adversarial network with spatial attention for face attribute editing. In ECCV, 2018. [28] H. Zhang, T. Xu, H. Li, S. Zhang, X. Wang, X. Huang, and D. Metaxas. Stackgan++: Realistic image synthesis with stacked generative adversarial networks. arXiv preprint arXiv:1710.10916, 2017. [29] B. Zhao, B. Chang, Z. Jie, and L. Sigal. Modular generative adversarial networks. In ECCV, 2018. [30] J.-Y. Zhu, T. Park, P. Isola, and A. A. Efros. Unpaired image-to-image translation using cycle-consistent adversarial networks. In ICCV, 2017.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/74184	-
dc.description.abstract	多域圖像到圖像的翻譯最近越來越受到關注。以前的方法將圖像和一些目標屬性作為輸入，並生成具有所需屬性的輸出圖像。但是，這有一個局限性。它們需要指定整個屬性集，即使它們中的大多數都不會被更改。為了解決這一局限性，我們提出了一種新的實用的多域圖像到圖像轉換公式RA-GAN。關鍵的想法是使用相對屬性，它描述了所選屬性的所需變化。為此，我們提出了一個對抗框架，它學習單個生成器來翻譯不僅與相關屬性相匹配，而且表現出更好質量的圖像。此外，我們的發生器能夠通過連續地改變感興趣的特定屬性來修改圖像，同時保留其他特徵。實驗結果證明了我們的方法在面部屬性轉移和插值任務中的定性和定量的有效性。	zh_TW
dc.description.abstract	Multi-domain image-to-image translation has gained increasing attention recently. Previous methods take an image and some target attributes as inputs and generate an output image that has the desired attributes. However, this has one limitation. They require specifying the entire set of attributes even most of them would not be changed. To address this limitation, we propose RA-GAN, a novel and practical formulation to multi-domain image-to-image translation. The key idea is the use of relative attributes, which describes the desired change on selected attributes. To this end, we propose an adversarial framework that learns a single generator to translate images that not only match the relative attributes but also exhibit better quality. Moreover, Our generator is capable of modifying images by changing particular attributes of interest in a continuous manner while preserving the other ones. Experimental results demonstrate the effectiveness of our approach both qualitatively and quantitatively to the tasks of facial attribute transfer and interpolation.	en
dc.description.provenance	Made available in DSpace on 2021-06-17T08:23:22Z (GMT). No. of bitstreams: 1 ntu-108-R06922074-1.pdf: 12030496 bytes, checksum: 681bb93e07730dc804aec7d99adc8c5c (MD5) Previous issue date: 2019	en
dc.description.tableofcontents	口試委員會審定書 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii 誌謝 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii 摘要 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix Abstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi 1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 2 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 3 Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 3.1 Relative Attributes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 3.2 Adversarial Loss . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 3.3 Conditional Adversarial Loss . . . . . . . . . . . . . . . . . . . . . . . . 11 3.4 Reconstruction loss . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 3.5 Interpolation loss . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 3.6 Full loss . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 3.7 Differences with Previous Methods . . . . . . . . . . . . . . . . . . . . . 15 4 Implementation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .17 5 Experiments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 5.1 Dataset . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 5.2 Facial attribute transfer . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 5.3 Facial attribute interpolation . . . . . . . . . . . . . . . . . . . . . . . . 24 5.4 User Study . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27 Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29 Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33 .1 Usage Scenario . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33 .2 Training Process and Network Architecture . . . . . . . . . . . . . . . . 33 .3 Ablation Study . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 .4 Additional Qualitative Results . . . . . . . . . . . . . . . . . . . . . . . 37
dc.language.iso	en
dc.title	RA-GAN: 多重領域圖像轉換使用相對屬性值	zh_TW
dc.title	RA-GAN: Multi-domain Image-to-Image Translation via Relative Attributes	en
dc.type	Thesis
dc.date.schoolyear	107-2
dc.description.degree	碩士
dc.contributor.oralexamcommittee	賴尚宏(Shang-Hong Lai),傅楸善(Chiou-Shann Fuh)
dc.subject.keyword	深度學習,生成對抗網路,相對屬性,多領域圖像轉換,	zh_TW
dc.subject.keyword	deep learning,generative adversarial network,relative attributes,Multi-domain Image-to-Image translation,	en
dc.relation.page	50
dc.identifier.doi	10.6342/NTU201902059
dc.rights.note	有償授權
dc.date.accepted	2019-08-13
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	資訊工程學研究所	zh_TW
顯示於系所單位：	資訊工程學系

文件中的檔案：

檔案	大小	格式
ntu-108-1.pdf 目前未授權公開取用	11.75 MB	Adobe PDF

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。