Skip navigation

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets

Learn More
DSpace logo
English
中文
  • Browse
    • Communities
      & Collections
    • Publication Year
    • Author
    • Title
    • Subject
    • Advisor
  • Search TDR
  • Rights Q&A
    • My Page
    • Receive email
      updates
    • Edit Profile
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 光電工程學研究所
Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/84518
Title: 基於神經網路中光學像差在OCR系統的影響
Influence of Optical Aberrations on Optical Character Recognition System based on Neural Network
Authors: Hao-Sheng Zhang
張浩陞
Advisor: 蘇國棟(Guo-Dung Su)
Keyword: 光學字元辨識,光學像差,光束追蹤,神經網路,文字框字,廣角透鏡,
Optical Character Recognition,Optical Aberration,Ray Tracing,Neural Network,Text Box,Wide-angle Lens,
Publication Year : 2022
Degree: 碩士
Abstract: 近年來,由於光學字元辨識系統的技術逐漸發展成熟,對於人類圖像文字的辨識有更全面的影響。光學字元辨識(OCR)是指對文字資料的圖像檔案進行分析辨識處理,取得文字及版面資訊的過程。在光學中,光學像差是實際成像與理論成像結果的偏差,其中包括球面像差、彗星像差、散光、場曲和畸變。在本篇論文中,我們提出一種光學模擬方式進行光束追蹤,其主要設計方法為利用Zemax光學模擬軟體設計五種光學像差結構,而針對OCR系統內的英文字母進行辨識,在文字圖形結構產生像差影響之後,藉由神經網路的訓練下使用Pytesseract影像辨識程式將圖片文字框字後重新訓練圖片,進而提升文字圖形的辨識率。接著,我們展現所有英文字母辨識率之模擬結果在OCR系統中。利用廣角透鏡的參考設計來驗證OCR系統對於字母辨識的影響。我們也分析與討論藉由改變參數設計對於OCR系統的文字辨識情形。基於神經網路的OCR系統可以明顯降低廣角透鏡設計的複雜性。透鏡的數量從10個減少到6個,而孔徑從0.62毫米增加到0.71毫米。因此從辨識結果中發現,藉由文字框字後而多次訓練下的圖片,能夠提高所有英文字母的辨識準確率,最後我們驗證了所提出的光學模擬方法,也探討基於神經網路中五種光學像差結構對於OCR系統的影響。
In recent years, as the technology of the optical character recognition system has gradually developed and matured, it has had a more comprehensive impact on the recognition of human image characters. Optical Character Recognition (OCR) is the process of analyzing and identifying image files of text data to obtain text and layout information. Optical aberration is the deviation of actual imaging from theoretical imaging results, including spherical aberration, coma, astigmatism, field curvature, and distortion. In this paper, we propose an optical simulation method for ray tracing. The primary design method uses Zemax simulation software to design five optical aberration structures. In recognition, after the structure of text and graphics produces aberration effects, the Pytesseract® image recognition program based on a neural network is used to frame the text and re-train the picture under the neural network training, thereby improving the recognition rate of text and graphics. Next, we show the recognition results of all English letter recognition rates in the OCR system. A reference design of a wide-angle lens is used to verify the impact of the OCR system on letter recognition. We also analyze and discuss the text recognition situation for the OCR system by changing the parameter design. The neural network-based OCR system can significantly reduce the complexity of a wide-angle lens design. The number of lenses can be reduced from ten to six. The aperture can be increased from 0.62 mm to 0.71 mm. Therefore, the recognition results show that the recognition accuracy of all English letters can be enhanced by using the pictures after the text box and repeated training. Finally, we verified the proposed optical simulation method—the influence of optical aberration structure on the OCR system.
URI: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/84518
DOI: 10.6342/NTU202203769
Fulltext Rights: 同意授權(限校園內公開)
metadata.dc.date.embargo-lift: 2022-09-30
Appears in Collections:光電工程學研究所

Files in This Item:
File SizeFormat 
U0001-2109202221254300.pdf
Access limited in NTU ip range
2.51 MBAdobe PDF
Show full item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved