利用電腦字型建立卷積神經網絡之中文漢字模型進行手寫與印刷字體辨識

Yu-An Li; 李育安

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/72156

標題:	利用電腦字型建立卷積神經網絡之中文漢字模型進行手寫與印刷字體辨識 Handwritten and Printed Chinese Character Recognition By Using Computer Font Type Chinese Characters into Convolutional Neural Network
作者:	Yu-An Li 李育安
指導教授:	黃乾綱
關鍵字:	中文漢字手寫字元辨識,中文漢字印刷體字元辨識,影像處理,機器學習,卷積神經網絡, HCCR,PCCR,Image Processing,Machine Learning,Convolutional Neural Networks,
出版年 :	2018
學位:	碩士
摘要:	本研究的目的在改善中文漢字的手寫與印刷字體之辨識。利用現有網路上與電腦內建的現存的不同風格的字型資源，取常用的5000及10000字，並搭配影像處理技術，對這些字體做數種變形與前處理來產生所需要的訓練資料。運用機器學習中的卷積神經網路(Convolutional Neural Networks)之技術，訓練出一個同時具有辨識手寫與印刷體漢字的模型。調整與優化模型參數，反覆驗證，並用其他具有代表性之不同測試資料集做實驗評估。如何利用影像處理技術生成有效之訓練資料、以提升辨識模型的正確率，對不同代表性測試集皆可辨識正確，是本研究的核心目標。本研究的研究成果主要包含: (1) 如何只以現存的電腦字體來訓練可以同時對手寫字體與印刷字體進行辨識的模型。 (2) 針對古典文獻中的印刷字體辨識最優化，改善古典文獻影像上字體模糊與罕見字等辨識問題。以實際民初京報、磧砂藏佛典和2013CASIA手寫漢字公開測試集等資料進行實驗，結果顯示，本研究所提出的模型與方法可達正確率京報69.9%、佛典89.29%、手寫字集58.24%。與現有之常用OCR辨識軟體做比較，可提升2~3%的正確率。 The main purpose of this paper is to improve Handwritten Chinese Character Recognition and traditional, non-modern Printed Chinese Character Recognition problem. By using the existing different style of Chinese font resources in computer system and online sources, we take most commonly used 5000 and 10000 words, then do several data deformation and preprocessing by image processing skills to produce training data. Combined with the technology of Convolutional Neural Networks in machine learning, we trained a distinguished model which can be used to recognize handwritten and printed Chinese character both. The main goal of this paper is to find the valid training features, optimize parameters and fine tune our model to get a better performance. The results of this paper mainly include: (1) How to train a model which can recognize both the handwritten font and the printed font simultaneously on by existing computer word font. (2) For the printed Chinese character font, we mainly focus on early traditional printed fonts, and improves the recognition problems, such as rare Chinese characters recognition and characters easily damaged or blur in the original text. (3) We conduct our experiments with the Beijing Civil News, the Biansha Tibetan Buddhist Dharma and the 2013 CASIA handwritten Chinese character public test set. The results show that the model and method we proposed in this paper can reach the accuracy of 69.9% on News, 89.29% on Buddhist Dharma, and 58.27% on handwriting testing set. Compared with the existing common OCR recognition software, our model can improve the accuracy about 2~3%. Key Word : HCCR、PCCR、Image Processing、Machine Learning、Convolutional Neural Networks
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/72156
DOI:	10.6342/NTU201803839
全文授權:	有償授權
顯示於系所單位：	工程科學及海洋工程學系

文件中的檔案：

檔案	大小	格式
ntu-107-1.pdf 目前未授權公開取用	5.41 MB	Adobe PDF

顯示文件完整紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料（如：文字、圖片、PDF）並使其易於取用。