應用字型生成資料開發環境中文字辨識系統

Yu-An Chen; 陳昱安

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/68716

標題:	應用字型生成資料開發環境中文字辨識系統 Chinese character recognition system in life developed by applying font generation data
作者:	Yu-An Chen 陳昱安
指導教授:	黃乾綱(Chien-Kang Huang)
關鍵字:	中文字元影像辨識,影像處理,資料增強,深度學習,卷積神經網絡, CCR,Image Processing,Data Augmentation,Deep Learning,Convolutional Neural Networks,
出版年 :	2020
學位:	碩士
摘要:	近年來，隨著人工智慧的快速發展，深度學習（Deep Learning）的技術也隨之蓬勃發展，並廣泛應用在各個領域，包括中文字元影像辨識（Chinese Character Recognition）。　　本研究的目的在改善中文漢字之辨識模型建立問題，利用現有電腦系統內建的字型資源來產生文字影像，再經由一系列的影像處理來模擬真實環境中的影像型態，並調整影像內文字本體部分數值，使得在使用機器學習中的卷積神經網路（Convolutional Neural Networks）之技術時能更有效學習到文字架構特徵而非邊界像素點分布之特徵。　　經由實驗結果顯示，使用本研究方法在現代報紙與民初晶報等印刷文件之辨識準確率分別為97.66%與78.21%，在CASIA 公開中文手寫測試集內達到63.15%之辨識準確率，以及在針對ICDAR-2019年ReCTS (Robust Reading Challenge on Reading Chinese Text on Signboard)競賽內之測試資料集，在使用官方提供之訓練資料額外加入本研究方法所產生之文字影像一同訓練，達到91.26%的辨識準確率，上述所提及之辨識表現優於現有OCR系統及方法。　　In recent years, with the rapid development of artificial intelligence, deep learning technology has also been widely applied to various fields, including Chinese Character Recognition. 　　The main purpose of this paper is to solve the problem of Chinese character recognition model building. By using the existing Chinese font resources in computer system to generate text images, and then use a series of image processing to simulate the image in the real environment and adjust the pixel value of text in image. That makes it more effective to learn the features of the text structure rather than the characteristics of the boundary pixel distribution when using the technology of Convolutional Neural Networks in machine learning 　　We conduct our experiments with newspaper and the Jing Newspaper, the CASIA handwritten Chinese character public test set and the Chinese character of ICDAR-2019 ReCTS race testing dataset. The results show that the model and method we proposed in this paper can reach the accuracy of 97.66% on newspaper, 78.21% on Jing Newspaper, 63.15% on handwritten, and 91.26% on ICDAR ReCTS. Compared with the existing common OCR recognition software, our method can improve the accuracy.
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/68716
DOI:	10.6342/NTU202003685
全文授權:	有償授權
顯示於系所單位：	工程科學及海洋工程學系

文件中的檔案：

檔案	大小	格式
U0001-1708202009474100.pdf 目前未授權公開取用	6.53 MB	Adobe PDF

顯示文件完整紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料（如：文字、圖片、PDF）並使其易於取用。