以深層卷積神經網路對中文語調進行分類

Shih-Che Chen; 陳釋澈

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/70814

標題:	以深層卷積神經網路對中文語調進行分類 Mandarin Tone Classification Using CNN/DNN
作者:	Shih-Che Chen 陳釋澈
指導教授:	張智星(Jyh-Shing Jang)
關鍵字:	聲調分類,頻譜,影像識別,卷積神經網路,華語, tone classification,spectrogram,image recognition,convolutional neural network,Mandarin Chinese,
出版年 :	2018
學位:	碩士
摘要:	在華語系統中，聲調扮演十分重要之角色，同樣的一個音節，只要聲調的不同，即會產生完全不同的意義。母語是否為中文，常常可藉由講出來字詞之聲調辨認。為此，本論文提出一個對語音聲調進行分類的方法：先將聲音訊號轉為頻譜，將頻譜視為圖片，輸入至現有之影像識別卷積神經網路架構中，訓練出聲調分類模型，比較現成之影像辨識模型對處理聲調分類的效果如何。最後以此建立出不需對音訊進行過多處理步驟，即可達到一定程度之聲調分類架構。此聲調分類架構可套用至華語教學系統之中，為語言教學之方式提供新的選擇。 In Mandarin Chinese system, the tone plays an important role. Different tone patterns of the same syllable may result in different meanings. People whose native language aren’t Mandarin can be distinguished by their tone patterns. Therefore, we propose a method for tone classification. First, we convert the audio signal into the spectrogram. We treat the spectrogram as images, apply them as the image inputs for image recognition convolutional neural networks, and create tone classification models. We compare different image recognition models for tone classification. This approach can achieve good accuracy without too many processes on the audio signal. The tone classification architecture can be applied to Chinese teaching methods which will lead to educational success.
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/70814
DOI:	10.6342/NTU201802657
全文授權:	有償授權
顯示於系所單位：	資訊工程學系

文件中的檔案：

檔案	大小	格式
ntu-107-1.pdf 未授權公開取用	6.88 MB	Adobe PDF

顯示文件完整紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料（如：文字、圖片、PDF）並使其易於取用。