基於深度學習方法之單張文件影像陰影去除

Yun-Hsuan Lin; 林昀宣

Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/64961

Title:	基於深度學習方法之單張文件影像陰影去除 Deep Learning-based Approach for Single Document Image Shadow Removal
Authors:	Yun-Hsuan Lin 林昀宣
Advisor:	陳文進(Wen-Chin Chen)
Co-Advisor:	莊永裕(Yung-Yu Chuang)
Keyword:	陰影去除,文件影像處理,深度學習,條件生成對抗式網路, Shadow Removal,Document Image Processing,Deep Learning,Conditional Generative Adversarial Network,
Publication Year :	2019
Degree:	碩士
Abstract:	在本篇論文之中，我們提出一個深度學習模型BEDSR-Net，專門設計為對一般文件影像進行陰影去除。文件通常具有一個共通的全局背景顏色的資訊，因此我們利用深度學習方式使模型學到如何預測整張文件的全局背景顏色資訊。在模型訓練的過程，模型亦同時掌握了文件影像中陰影和非陰影的位置資訊，透過將模型的中間產物特徵圖視覺化以熱度圖方式呈現，此熱度圖可被定位為表達了文件影像中陰影分布的陰影遮罩。透過全局背景顏色以及陰影位置資訊的協助，我們提出的深度學習架構BEDSR-Net將有效對原圖進行陰影去除，且在大部分的評比之中，我們的效果在各方數據均表現優異，整體來說更優於前人的方法。除此之外，BEDSR-Net僅在合成資料集上進行訓練，應用在實際評比用資料集時表現依舊亮眼，這也反映出我們的模型架構對於表現的穩定度上是有明顯的幫助。在本論文中，對於文件影像陰影去除這個任務，我們收集了兩個資料集，分別為合成影像資料集SDSRD以及實際影像資料集DSRD，前者提供了深度學習在這個領域中足夠的訓練資料，並在文件種類和光線複雜度的這兩個面向中達到了足夠的豐富度；後者更涵蓋了大量複雜文件，可作為一個比較模型表現優劣上更泛用的資料集。 In this paper, we propose a novel deep neural network architecture, named BEDSR-Net, which is designed to remove shadow from document images. With our observation that documents usually have single global background color, we utilize deep learning technique to detect the color from a document image. While training process, our model is able to understand the shadow distribution in an image, including intensity and location. We further visualize the knowledge about shadow distribution of our model in the form of heatmap. The heatmap is capable of precisely denoting the shadow location. With the assistance of global background color and the heatmap, our model, BEDSR-Net, achieves state-of-the-art in most evaluation comparison with previous works in the field of document images shadow removal. Also, our model, only trained with a synthetic dataset, still outperforms others in real benchmark datasets, which indeed shows our proposed model's stability and robustness. Besides, we collect two datasets in this task, including a synthetic dataset (SDSRD) and a real dataset (DSRD). The former one enables the training process of deep learning approach in this task while the latter one can be served as a much more general benchmark dataset. Both SDSRD and DSRD are aimed at capturing more diverse scenario.
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/64961
DOI:	10.6342/NTU201902033
Fulltext Rights:	有償授權
Appears in Collections:	資訊網路與多媒體研究所

Files in This Item:

File	Size	Format
ntu-108-1.pdf Restricted Access	26.48 MB	Adobe PDF

Show full item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets