請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/73048
標題: | 現代漢語平行語料庫建構及其應用 Construction and Applications of a Modern Chinese Parallel Corpus |
作者: | Da-Chen Lian 連大成 |
指導教授: | 謝舒凱(Shu-Kai Hsieh) |
關鍵字: | 台灣國語,大陸國語,語言變異,字幕, Taiwan Mandarin,Mainland Mandarin,language variation,subtitles, |
出版年 : | 2019 |
學位: | 碩士 |
摘要: | 隨著中文使用者的人數增加,語言上的變異也會隨之產生,這些變異可能來自外來或本身擁有的因素。雖然已存在研究中文變異的語料庫,但是這些資源不適用於研究篇口語的語域,一個能夠反應非正式的語言是影片裡的字幕,此論文的目的是以電影字幕和TED Talks字幕為基礎建構一個平行語料庫,方便學者研究台灣國語和大陸國語之間的變異 As the number of Mandarin Chinese speakers continues to increase, variations will inevitably begin to emerge as all speakers do not reside in one place. This variation can stem from internal factors or external ones, such as culture or location. While there exist corpora that can be used to study Mandarin Chinese variation, the existing resources do not offer insight into more colloquial registers. A good source of material that can more reliably reflect everyday speech is subtitles for TV shows, movies, and videos in general. Because the subtitles are meant to reflect dialogue heard on screen, it can better reflect colloquial speech. The goal of this thesis is to create a parallel corpus based on movie subtitles and TED Talks that can allow researchers to study language variation between Taiwan Mandarin and Mainland Mandarin. |
URI: | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/73048 |
DOI: | 10.6342/NTU201901469 |
全文授權: | 有償授權 |
顯示於系所單位: | 語言學研究所 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-108-1.pdf 目前未授權公開取用 | 1.86 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。