請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/49106
完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.advisor | 高照明(Zhao-Ming Gao) | |
dc.contributor.author | Hoi-Lam Lee | en |
dc.contributor.author | 李海琳 | zh_TW |
dc.date.accessioned | 2021-06-15T11:16:09Z | - |
dc.date.available | 2016-08-25 | |
dc.date.copyright | 2016-08-25 | |
dc.date.issued | 2016 | |
dc.date.submitted | 2016-08-20 | |
dc.identifier.citation | Aarts, B. (2011). Oxford modern English grammar. Oxford: Oxford University Press.
China Committee of National Legal English Certificate Test. (2009). The translation of lgal terminology. A course in legal English translation (pp. 41-47). Beijing: China Legal Publishing House. Manning, C. D., Raghavan, P., & Schütze, H. (2008a). Term frequency and weighting. In Introduction to Information Retrieval. Retrieved November 1, 2015, from http://nlp.stanford.edu/IR-book/html/htmledition/tf-idf-weighting-1.html. Manning, C. D., Raghavan, P., & Schütze, H. (2008b). Tf-idf weighting. Introduction to Information Retrieval. Retrieved November 1, 2015, from http://nlp.stanford.edu/IR-book/html/ htmledition/tf-idf-weighting-1.html. Manning, C. D., Raghavan, P., & Schütze, H. (2008c). Reference Corpora. Introduction to Information Retrieval. Retrieved November 1, 2015, from http://www.ilc.cnr.it/EAGLES/corpustyp/node18.html. Dagan, I., & Church, K. (1994, October). Termight: Identifying and translating technical terminology. In Proceedings of the fourth conference on Applied natural language processing (pp. 34-40). Association for Computational Linguistics. Flowerdew, L. (2012). Corpora and language education. Houndmills, Basingstoke, Hampshire: Palgrave Macmillan. Feng, Z. (1997). Modern terminology. Taipei, Taiwan: Agricultural Science Information Center. Fung, P. (2000). A statistical view on bilingual lexicon extraction: From parallel corpora to non-parallel corpora. Text, Speech and Language Technology 13 (pp. 219-236). Springer Link. Gao, Z.M. (2013). Class notes of an introduction to corpus linguistics. Héja, E. (2010, May). The Role of Parallel Corpora in Bilingual Lexicography. In The International Conference on Language Resources and Evaluation. Retrieved November 1, 2015, from http://www.lrec-conf.org/proceedings/lrec2010/pdf/ 559_Paper.pdf. Hermans, T. (2013). Norms of Translation. Retrieved November 1, 2015, from http://www.researchschool.org/documents/Hermans_Norms%20of%20Trl.pdf. ICTPOS3.0詞性標記集 (Jieba’s Chinese parts-of-speech tags). Retrieved November, 1, 2015, from https://gist.github.com/luw2007/6016931. Jacquemin, C. (2001). Spotting and discovering terms through natural language processing. Cambridge, Mass.: MIT Press. Kwong, O. Y., & Tsou, B. K. (2011). Automatic corpus-based extraction of Chinese lgal terms. Proceedings of the 6th Natural Language Processing Pacific Rim Symposium. Lardilleux, A., & Lepage, Y. (2009, September). Sampling-based multilingual alignment. International Conference on Recent Advances in Natural Language Processing (RANLP 2009). Borovets, Bulgaria. Li, C., & Thompson, S. (1981). Mandarin Chinese: A functional reference grammar. Berkeley: University of California Press. London: Routledge. Law Dictionary, The (Black’s Law Dictionary). (n.d.). Retrieved October 15, 2015, from http://thelawdictionary.org/. Lin, Y.H., Gao, Z.M., & Kao, C.Y. (2008). Chinese NP chunking: Experiments with supervised, and semi-supervised learning. Retrieved May 7, 2015, from http://aclclp.org.tw/rocling/2008/OS4_3.pdf. Liu, Y. P. (2014). Skopos theory and legal translation: A case study of examples from the criminal law of the P.R.C. International Journal for the Semiotics of Law 28, 125-133. Springer Link. Lu, B. & Tsou, B. K. (2009). Towards bilingual term extraction in comparable patents. 23rd Pacific Asia Conference on Language, Information and Computation (pp. 755-762). Retrieved August 1, 2016, from http://www.aclweb.org/anthology/ Y09-2038. McEnery, T., & Xiao, R. (2006). Corpus based language studies: An advanced Resource. Nation, P. (1995). Range. Victoria, University of Wellington. Retrieved November 1, 2015, from http://www.victoria.ac.nz/lals/about/staff/paul-nation. Neubig, G., Watanabe, T., Sumita, E., Mori, S., & Kawahara, T. (2011). An unsupervised model for joint phrase alignment and extraction. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics – ACL ’11. Oxford Dictionaries - Dictionary, Thesaurus, & Grammar. (n.d.). Retrieved November 1, 2015, from http://www.oxforddictionaries.com/. Penn Treebank P.O.S. Tags. (n.d.). May 7, 2015, from https://www.ling.upenn.edu/ courses/Fall_2003/ling001/penn_treebank_pos.html. Sager, J. (1990). A Practical Course in Terminology Processing. Amsterdam: John Benjamins Pub. Wang, L. (2008, April). Cultural transfer in legal translation: A case study of the translation of the common law into Chinese in Hong Kong (Unpublished doctoral dissertation). Department of Chinese, Translation, and Linguistics. City University of Hong Kong, Hong Kong. Wu, D. (1995, August). Stochastic inversion transduction grammars, with application to segmentation, bracketing, and alignment of parallel corpora. 14th International Joint Conference on Artificial Intelligence. Montreal. Wu, T.J. (2012). Corpus-based approaches in relation to wordlists and formulaic sequences in the redesigned TOEIC. (Unpublished Master’s thesis). Department of Applied Foreign Languages, National Taiwan University of Science and Technology. Available from National Digital Library of Theses and Dissertations, Taiwan. 王道庚 (2013). 法律翻譯理論與實踐. Hong Kong: City University of Hong Kong Press. 朱國斌,林來梵,徐孟洲 (2007). 新編中國法. Hong Kong: City University of Hong Kong Press. 陳武天 (2010, July). 現代漢語「的」字結構的語法分析與教學應用 (Unpublished Master’s thesis). Department of Chinese as a Second Language, National Taiwan Normal University. Available from National Digital Library of Theses and Dissertations, Taiwan. 黃國平,廖柏森 (2007). 術語翻譯之探討:以關務詞彙為例. Studies of Translation and Interpretation 10, 1-29. Taipei, Taiwan. 蘇銘翔 (2011). 圖解刑法. Taipei, Taiwan: Wunan Press. Texts of Legal Corpora Criminal Law of the People’s Republic of China (Chinese version) (1997). The National People’s Congress of the People’s Republic of China. Retrieved May 27, 2015, from http://www.npc.gov.cn/wxzl/wxzl/2000-12/17/content_4680.htm. Criminal Law of the People’s Republic of China (English translation) (1997). Retrieved May 27, 2015, from http://www.npc.gov.cn/englishnpc/Law/2007-12/13/ content_ 1384075.htm. Criminal Code of the Republic of China (Chinese Version) (2014). Laws & Regulations Database of The Republic of China. Retrieved May 27, 2015, from http://law.moj.gov.tw/Law/LawSearchResult.aspx?p=A&t=A1A2E1F1&k1=%E5%88%91%E6%B3%95. Criminal Code of the Republic of China (English Translation) (2014). Laws & Regulations Database of The Republic of China. Retrieved May 27, 2015, from http://law.moj.gov.tw/Eng/LawClass/LawAll.aspx?PCode=C0000001. Code of Criminal Procedure (Chinese Version) (2015). Laws & Regulations Database of The Republic of China. Retrieved May 27, 2015, from http://law.moj.gov.tw/ Law/LawSearchResult.aspx?p=A&t=A1A2E1F1&k1=%E5%88%91%E4%BA%8B%E8%A8%B4%E8%A8%9F%E6%B3%95. Code of Criminal Procedure (English Translation) (2015). Laws & Regulations Database of The Republic of China. Retrieved May 27, 2015, from http://law.moj.gov.tw/Eng/LawClass/LawAll.aspx?PCode=C0010001. Criminal Procedure Law of the People’s Republic of China (Chinese version) (2012). The Central People’s Government of the People’s Republic of China. Retrieved May 27, 2015, from http://www.gov.cn/flfg/2012-03/17/content_2094354.htm. Criminal Procedure Law of the People’s Republic of China (English translation) (2012). Retrieved May 27, 2015, from http://www.cecc.gov/resources/legal-provisions/ criminal-procedure-law-of-the-peoples-republic-of-china. Software Anthony, L. (2014a). AntConc (Version 3.4.3) [Computer Software]. Tokyo, Japan: Waseda University. Retrieved May 7, 2015, from http://www.laurenceanthony.net/. Anthony, L. (2014b). AntPConc (Version 1.1.0) [Computer Software]. Tokyo, Japan: Waseda University. Retrieved November 1, 2015, from http://www. laurenceanthony.net/. Anthony, L. (2014c). AntWordProfiler (Version 1.4.0) [Computer Software]. Tokyo, Japan: Waseda University. Retrieved November 1, 2015, from http://www.laurenceanthony.net/. Cygwin v.2.0.0 (2015). The Cygwin DLL and utilities. Retrieved May 7, 2015, from https://www.cygwin.com/. Fuzzy Lookup Add-In for Excel (2016). Available from https://www.microsoft. com/en-us/download/details.aspx?id=15011. Lardilleux, A. (2009). Anymalign (Version 2.5). Retrieved May 7, 2015, from http://anymalign.limsi.fr/. Neubig, G. (2012). Pialign - Phrasal ITG Aligner (Version 0.2.4). Retrieved May 7, 2015, from http://www.phontron.com/pialign/. Sun, J. (2009). Jieba (Version 0.38). Retrieved May 7, 2015, from https://github.com/fxsjy/jieba. Stanford Log-linear Part-Of-Speech Tagger (2015). The Stanford Natural Language Processing Group. Retrieved May 7, 2015, from http://nlp.stanford.edu/ software/tagger.shtml. | |
dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/49106 | - |
dc.description.abstract | 建立雙語詞彙表能有助譯者掌握專門領域翻譯及維持一致性。本文希望提供有效的半自動方法擷取雙語詞組供譯者使用,並有系統的建立編纂雙語專業詞彙機制。首先,本文透過Anymalign及Pialign兩套自然語言處理軟體,從兩岸刑法平行語料庫中取得詞組對應機率較高的漢英雙語候選詞及詞組。鑑軟體的品質不盡完善,故本文著手改善擷取結果,首先處理英文部分,即利用詞性標記軟體自動標記英文語料找出名詞或名詞組。後透過語言規律,將較不符合中英文名詞組組成規律的候選詞組過濾。所得的有效臺灣法律漢英語詞組為1,852個,中國大陸3,782個,其中包含更長單位的漢英名詞組。基於前述結果篩選出漢英詞組單位對照正確的詞組:臺灣有694組,中國大陸852組。另外,本文採用美國的參考語料庫,設下關鍵性字比值LLR ≥ 3.84的門檻,從一般名詞組中篩選出術語。臺灣487個英文名詞�名詞組中,有394個適合為術語;中國大陸的517個名詞�名詞組中則只有418個合適。擷取所得的有效名詞�名詞組經進一步的處理後製作成可比語料,顯示兩岸在法律中文用語或英譯上的異同。
就擷取效果而言,英文的名詞組擷取遠較中文容易,因中文的名詞性僅能在句子中才能顯出。反觀英文單字的後綴明顯,擷取名詞組時較有效率。中文在包含或不包含「之」(臺灣),或含有「的」或「之」(中國大陸)的二大類名詞組中,由於漢語詞組結構及其英語翻譯並未見一定規律,使得成功雙語完整擷取的難度大幅提高。就篩選術語而言,隨着名詞組長度單位增加,包括術語與一般字的組合亦同時增加,造成術語篩選上的困難。另兩岸的刑法內涵不盡相同。擷取單位的不完整性容易產生偽異同詞組。 本文研究方法及結果,冀能啟迪其他語言的雙語名詞組擷取或套用於其他專門領域,有助推行編纂雙語術語字彙表及促進翻譯在地化。 | zh_TW |
dc.description.abstract | Bilingual glossaries enable translators to maintain accurate domain-specific translation and consistency. This study aims to extract bilingual pairs semi-automatically and provides a systematic specialized term compilation for translators to follow. First, two natural language processing tools, Anymalign and Pialign, were adopted to extract Chinese-English candidate pairs that had higher translation probabilities from Taiwan’s and Mainland China’s criminal parallel corpora.
As the extraction tools are not perfect, this study focuses on improving the quality of the preliminary extracted results: qualified English noun (phrases) were identified first. English words/phrases were assigned parts-of-speech labels by Stanford POS Tagger automatically. Linguistic information was helpful in identifying and removing non-noun (phrase) patterns in order to locate qualified English noun phrases (NP). 1,852 English NPs with their Chinese pairs were found to be qualified (Taiwan), while 3,782 NPs were found to be valid (Mainland China). These results include noun phrases with longer word units. Based on these candidate results, 694 bilingual pairs were identified to be correctly aligned (Taiwan), while 852 correctly aligned bilingual pairs were identified (Mainland China). An American reference corpus was utilized to mark terms out by the indication of Keyness scores. The threshold was set at the critical value ≥ 3.84 calculated by the log-likelihood ratio. 394 terms were found among 487 qualified noun (phrases) from Taiwan, while 418 terms were identified among 517 Mainland China’s result. Qualified noun (phrases) and terms were adopted to produce a comparable list of noun (phrases), showing the similarities and differences in Chinese legal expressions and English translation across the Straits. Improving the extraction of English NPs was confirmed to be much more effective than that of Chinese NPs because Chinese parts-of-speech can be made clearer only in sentences. In contrast, as suffixes of English words are more distinct, it is shown that English noun (phrase) extraction was more effective by having identified their suffixes according to the parts-of-speech. Based on the result, extraction of bilingual pairs was proven to be difficult because, first, no regular patterns could be identified in the two main Chinese NP groups: those NPs include/exclude之 zhi (in Taiwan’s NP), or的de/之zhi (Mainland China’s NP). Second, the English translation patterns corresponding to Chinese NPs were neither always predictable. Filtering terms was proven to be even more difficult. As the length of an NP unit increases, mixture of terms and common words within an NP will more likely appear. Legal connotations are not entirely the same in Taiwan’s and Mainland China’s legal systems. If the word units of certain phrases are not extracted completely, falsely similar or seemingly different bilingual pairs will be created. The methodology and findings presented in this study are recommended to be applicable to other language pairs and different domain-specific genres, which may facilitate improvements in the compilation of bilingual term glossaries and localization in translation. | en |
dc.description.provenance | Made available in DSpace on 2021-06-15T11:16:09Z (GMT). No. of bitstreams: 1 ntu-105-R02147004-1.pdf: 10804565 bytes, checksum: 74a1e209960cecb1cdb81768bc3ddba8 (MD5) Previous issue date: 2016 | en |
dc.description.tableofcontents | VERIFICATION LETTER FROM THE ORAL EXAMINATION COMMITTEE i
誌謝 ii ACKNOWLEDGEMENT iii 中文摘要 iv ABSTRACT v TABLE OF CONTENTS vii LIST OF TABLES ix LIST OF FIGURES x CHAPTER ONE INTRODUCTION 1 1.1 Background and Motivation 2 1.2 Purpose of This Study 7 1.3 Research Questions 10 1.4 Significance of This Study 10 1.5 Definitions of Key Terms Used in This Study 11 1.6 Outline of This Study 13 CHAPTER TWO LITERATURE REVIEW 15 2.1 Definition of Noun (Phrases) 15 2.1.1 Syntactic Features of English Noun (Phrases) and Terms 15 2.1.2 Syntactic Features of Chinese Noun (Phrases) and Terms 18 2.1.3 Definition of a Term 25 2.1.4 Translation of a Term 27 2.1.5 Legal Translation Strategies 30 2.1.6 Definition of a Legal Term 31 2.2. Extracting Noun (Phrases) and Terms from Corpora 34 2.2.1 Types of Corpora 34 2.2.2 Extracting Words and Phrases from Corpora 35 2.2.3 Statistical Methods 39 2.2.4 Extraction Bilingual Terms from Parallel Corpora 44 2.3 Extracting Terms for Lexicographical Use 49 2.4 Summary 51 CHAPTER THREE METHODOLOGY 52 3.1 Research Data 52 3.2 Data Collection 52 3.2.1 Corpus Design 52 3.2.2 Introduction to the Texts in the Legal Corpora 54 3.3 Instruments 57 3.3.1 Operation Platform 57 3.3.2 Introduction to Natural Language Processing (NLP) Tools 58 3.3.3 Corpus Analysis Tools 63 3.4 Procedure 67 CHAPTER FOUR RESULTS AND DISCUSSIONS 86 4.1 English Noun (Phrases) for Taiwan’s and China’s Cases 86 4.1.1 Qualified English Noun (Phrase) Patterns in Taiwan 86 4.1.2 Qualified English Noun (Phrase) Patterns in China 101 4.1.3 Summary of Qualified English NP Patterns from Two to Seven Word Units 108 4.2 Chinese Noun Phrase Patterns for Taiwan’s and China’s Cases 119 4.2.1 The Features of Taiwan’s Chinese Noun Phrase Patterns 123 4.2.2 The Features of China’s Chinese Noun Phrase Patterns 125 4.3 Bilingual NP Candidate Patterns 127 4.4 Human Evaluation 144 4.5 Distributions of Noun (Phrases) and Terms 161 4.6 Similarities and Differences in Taiwan’s and China’s Legal Terms 168 4.6.1 Similarities Found in Taiwan’s and China’s Legal Terms 169 4.6.2 Differences in Taiwan’s and China’s Legal Terms 172 4.7 Summary 172 CHAPTER FIVE CONCLUSION 175 5.1 Summary of This Study 175 5.2 Limitations of This Study 179 5.3 Suggestions for Future Research 182 REFERENCES 184 APPENDICES 189 | |
dc.language.iso | en | |
dc.title | 從平行語料庫編纂漢英法律雙語詞彙 | zh_TW |
dc.title | Using Chinese-English Parallel Corpora for Compiling Bilingual Legal Glossaries | en |
dc.type | Thesis | |
dc.date.schoolyear | 104-2 | |
dc.description.degree | 碩士 | |
dc.contributor.oralexamcommittee | 王世平(Shih-ping Wang),陳雅齡(Ya-ling Chan) | |
dc.subject.keyword | 名詞組擷取,平行語料庫,法律用語,兩岸法律用語對比分析, | zh_TW |
dc.subject.keyword | noun phrase extraction,parallel corpora,legal expressions,contrastive analysis in Cross-Strait terminology, | en |
dc.relation.page | 244 | |
dc.identifier.doi | 10.6342/NTU201603217 | |
dc.rights.note | 有償授權 | |
dc.date.accepted | 2016-08-21 | |
dc.contributor.author-college | 文學院 | zh_TW |
dc.contributor.author-dept | 翻譯碩士學位學程 | zh_TW |
顯示於系所單位: | 翻譯碩士學位學程 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-105-1.pdf 目前未授權公開取用 | 10.55 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。