利用電腦字型建立卷積神經網絡之中文漢字模型進行手寫與印刷字體辨識

Yu-An Li; 李育安

Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/72156

Title:	利用電腦字型建立卷積神經網絡之中文漢字模型進行手寫與印刷字體辨識 Handwritten and Printed Chinese Character Recognition By Using Computer Font Type Chinese Characters into Convolutional Neural Network
Authors:	Yu-An Li 李育安
Advisor:	黃乾綱
Keyword:	中文漢字手寫字元辨識,中文漢字印刷體字元辨識,影像處理,機器學習,卷積神經網絡, HCCR,PCCR,Image Processing,Machine Learning,Convolutional Neural Networks,
Publication Year :	2018
Degree:	碩士
Abstract:	本研究的目的在改善中文漢字的手寫與印刷字體之辨識。利用現有網路上與電腦內建的現存的不同風格的字型資源，取常用的5000及10000字，並搭配影像處理技術，對這些字體做數種變形與前處理來產生所需要的訓練資料。運用機器學習中的卷積神經網路(Convolutional Neural Networks)之技術，訓練出一個同時具有辨識手寫與印刷體漢字的模型。調整與優化模型參數，反覆驗證，並用其他具有代表性之不同測試資料集做實驗評估。如何利用影像處理技術生成有效之訓練資料、以提升辨識模型的正確率，對不同代表性測試集皆可辨識正確，是本研究的核心目標。本研究的研究成果主要包含: (1) 如何只以現存的電腦字體來訓練可以同時對手寫字體與印刷字體進行辨識的模型。 (2) 針對古典文獻中的印刷字體辨識最優化，改善古典文獻影像上字體模糊與罕見字等辨識問題。以實際民初京報、磧砂藏佛典和2013CASIA手寫漢字公開測試集等資料進行實驗，結果顯示，本研究所提出的模型與方法可達正確率京報69.9%、佛典89.29%、手寫字集58.24%。與現有之常用OCR辨識軟體做比較，可提升2~3%的正確率。 The main purpose of this paper is to improve Handwritten Chinese Character Recognition and traditional, non-modern Printed Chinese Character Recognition problem. By using the existing different style of Chinese font resources in computer system and online sources, we take most commonly used 5000 and 10000 words, then do several data deformation and preprocessing by image processing skills to produce training data. Combined with the technology of Convolutional Neural Networks in machine learning, we trained a distinguished model which can be used to recognize handwritten and printed Chinese character both. The main goal of this paper is to find the valid training features, optimize parameters and fine tune our model to get a better performance. The results of this paper mainly include: (1) How to train a model which can recognize both the handwritten font and the printed font simultaneously on by existing computer word font. (2) For the printed Chinese character font, we mainly focus on early traditional printed fonts, and improves the recognition problems, such as rare Chinese characters recognition and characters easily damaged or blur in the original text. (3) We conduct our experiments with the Beijing Civil News, the Biansha Tibetan Buddhist Dharma and the 2013 CASIA handwritten Chinese character public test set. The results show that the model and method we proposed in this paper can reach the accuracy of 69.9% on News, 89.29% on Buddhist Dharma, and 58.27% on handwriting testing set. Compared with the existing common OCR recognition software, our model can improve the accuracy about 2~3%. Key Word : HCCR、PCCR、Image Processing、Machine Learning、Convolutional Neural Networks
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/72156
DOI:	10.6342/NTU201803839
Fulltext Rights:	有償授權
Appears in Collections:	工程科學及海洋工程學系

Files in This Item:

File	Size	Format
ntu-107-1.pdf Restricted Access	5.41 MB	Adobe PDF

Show full item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets