Please use this identifier to cite or link to this item:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/73048
Title: | 現代漢語平行語料庫建構及其應用 Construction and Applications of a Modern Chinese Parallel Corpus |
Authors: | Da-Chen Lian 連大成 |
Advisor: | 謝舒凱(Shu-Kai Hsieh) |
Keyword: | 台灣國語,大陸國語,語言變異,字幕, Taiwan Mandarin,Mainland Mandarin,language variation,subtitles, |
Publication Year : | 2019 |
Degree: | 碩士 |
Abstract: | 隨著中文使用者的人數增加,語言上的變異也會隨之產生,這些變異可能來自外來或本身擁有的因素。雖然已存在研究中文變異的語料庫,但是這些資源不適用於研究篇口語的語域,一個能夠反應非正式的語言是影片裡的字幕,此論文的目的是以電影字幕和TED Talks字幕為基礎建構一個平行語料庫,方便學者研究台灣國語和大陸國語之間的變異 As the number of Mandarin Chinese speakers continues to increase, variations will inevitably begin to emerge as all speakers do not reside in one place. This variation can stem from internal factors or external ones, such as culture or location. While there exist corpora that can be used to study Mandarin Chinese variation, the existing resources do not offer insight into more colloquial registers. A good source of material that can more reliably reflect everyday speech is subtitles for TV shows, movies, and videos in general. Because the subtitles are meant to reflect dialogue heard on screen, it can better reflect colloquial speech. The goal of this thesis is to create a parallel corpus based on movie subtitles and TED Talks that can allow researchers to study language variation between Taiwan Mandarin and Mainland Mandarin. |
URI: | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/73048 |
DOI: | 10.6342/NTU201901469 |
Fulltext Rights: | 有償授權 |
Appears in Collections: | 語言學研究所 |
Files in This Item:
File | Size | Format | |
---|---|---|---|
ntu-108-1.pdf Restricted Access | 1.86 MB | Adobe PDF |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.