基於拉普拉斯金字塔的深度人臉影像光影重建

Yu Tseng; 曾瑀

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/70884

標題:	基於拉普拉斯金字塔的深度人臉影像光影重建 Deep Face Relighting using Laplacian Pyramid
作者:	Yu Tseng 曾瑀
指導教授:	莊永裕(Yung-Yu Chuang)
關鍵字:	人臉光影重建,打光模式,拉普拉斯金字塔,深度影像,人臉剖析, face relighting,lighting pattern,Laplacian Pyramid,Depth map,Face parsing,
出版年 :	2018
學位:	碩士
摘要:	這篇論文提出了一個人臉的光影重建系統，將參考的人臉影像之打光模式重建到目標人臉影像上。我們提出了一個深度學習的架構，並使用有兩個輸入分支的U-Net模組，使我們可以同時對參考人臉影像以及目標人臉影像進行學習。我們的學習目標不定在色彩空間上，而是讓模組學習人臉的拉普拉斯金字塔。假設金字塔的高頻段保留了更多材質資訊，我們將學習目標定在金字塔的低頻部分，並保留原始的高頻部分。實驗結果顯示這樣的設計可以更好的適應不同色光並保留更多細節。我們亦將人臉的深度影像以及人臉的面部剖析分割圖與人臉影像串聯做為輸入，提供給模組更多的人臉幾何資訊。我們進行了讀者研究來評估我們的系統。在研究中，我們要求使用者指出重建光影後的影像其光線來向。此研究中我們達到了82.4%的正確辨識率。 This thesis presents a human face relighting system, which relights the target human face image in a specific lighting pattern according to the reference face image. We propose a 2-Input-Stream U-Net structure to learn from the target face and reference face simultaneously. Instead of learning the face on color space, we operate on frequency space by learning the Laplacian Pyramid of the face. Under the assumption that the high-frequency part preserves more information of textures, we only learn the low-frequency part and the high-frequency part remains the same. The experiment result shows that the model is more adaptive to lighting color and preserves more details. We also concatenate the depth map and face parsing segmentation map to the input stack to provide facial geometry information. To evaluate our model, we conduct a user study by asking users to identify the lighting direction of the relit image. We obtained 82.4% accuracy on this survey.
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/70884
DOI:	10.6342/NTU201802521
全文授權:	有償授權
顯示於系所單位：	資訊網路與多媒體研究所

文件中的檔案：

檔案	大小	格式
ntu-107-1.pdf 未授權公開取用	4.94 MB	Adobe PDF

顯示文件完整紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料（如：文字、圖片、PDF）並使其易於取用。