增補資源匱乏漢語方言之漢字發音

Chu-Cheng Lin; 林居正

Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/47124

Title:	增補資源匱乏漢語方言之漢字發音 Augmentation of Character Pronunciations for Resource-poor Chinese Dialects
Authors:	Chu-Cheng Lin 林居正
Advisor:	許永真(Jane Yung-jen Hsu)
Co-Advisor:	蔡宗翰(Richard Tzong-han Tsai)
Keyword:	資料增補,生成模型,漢語方言,發音資料庫, data augmentation,generative model,Chinese dialects,pronunciation database,
Publication Year :	2010
Degree:	碩士
Abstract:	大多數漢語方言缺乏完整的數位發音資料庫,而這卻是語音處理不可或缺的。若有相關方言的完整發音資料庫便能憑某漢字之韻書特徵,及其於相關方言之發音,使用監督式學習方法預測該漢字於目標方言之發音。遺憾的是漢語方言發音資料庫資源仍不完備。我們提出一新式生成模型,同時利用方言發音資料以及中古韻書以發掘在多方言間存在之音韻規律。我們提出之模型能利用現存不完整之方言發音資料庫以及韻書所載資料增補得出一完整之方言發音資料庫。該方言發音資料庫之後即可利用傳統監督式學習方法預測某方言之漢字發音。我們藉整體發音特徵準確率 (OPFA) 項目評估。第一個實驗結果可看出若加入方言發音特徵相較於僅有韻書特徵,能大幅度改進支持向量機分類器 (SVM classifier) 的效能。第二個實驗中我們比較利用親屬關係相近之方言與親屬關係相距遙遠之方言之音韻特徵對支持向量機效能影響。實驗結果顯露利用相近方言可得較高準確率。第三個實驗中可看出利用我們提出之增補模型可以提高 SVM 模型之 OPFA 準確率高達 4.9%。 Most spoken Chinese dialects lack comprehensive digital pronunciation databases, which are crucial for speech processing tasks. Given complete pronunciation databases for related dialects, one can use supervised learning techniques to predict a Chinese character’s pronunciation in a target dialect based on the character’s features and its pronunciation in other related dialects. Unfortunately, Chinese dialect pronunciation databases are far from complete. We propose a novel generative model that makes use of both existing dialect pronunciation data plus medieval rime books to discover patterns that exist in multiple dialects. The proposed model can augment missing dialectal pronunciations based on existing dialect pronunciation tables (even if in-complete) and the pronunciation data in rime books. The augmented pronunciation database can then be used in supervised learning settings. We evaluate the prediction accuracy in terms of phonological features, such as tone, initial phoneme, final phoneme, etc. For each character, features are evaluated on the whole, overall pronunciation feature accuracy (OPFA). Our first experimental results show that adding features from dialectal pronunciation data to our baseline rime-book model dramatically improves OPFA using the support vector machine (SVM) model. In the second experiment, we compare the performance of the SVM model using phonological features from closely related dialects with that of the model using phonological features from non-closely related dialects. The experimental results show that using features from closely-related dialects results in higher accuracy. In the third experiment, we show that using our proposed data augmentation model to fill in missing data can increase the SVM model’s OPFA by up to 4.9%.
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/47124
Fulltext Rights:	有償授權
Appears in Collections:	資訊工程學系

Files in This Item:

File	Size	Format
ntu-99-1.pdf Restricted Access	1.27 MB	Adobe PDF

Show full item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets