請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/50924
完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.advisor | 蔡毓芬(Yvonne Tsai) | |
dc.contributor.author | Yan-juan He | en |
dc.contributor.author | 何艷娟 | zh_TW |
dc.date.accessioned | 2021-06-15T13:06:52Z | - |
dc.date.available | 2017-07-04 | |
dc.date.copyright | 2016-07-04 | |
dc.date.issued | 2016 | |
dc.date.submitted | 2016-07-01 | |
dc.identifier.citation | References
Alcaraz, E., & Hughes, B. (2002). Legal translation explained. Manchester: St. Jerome Publishing. Anthony, L. (2014). AntConc (version 3.4.4). Retrieved from http://www.laurenceanthony.net/software/antconc/ Archer, D. (2009). Does Frequency Really Matter? In D. Archer (Ed.), What's in a word-list? Investigating word frequency and keyword extraction. Surrey and Burlington: Ashgate. Baker, M. (1995). Corpora in translation studies: An overview and some suggestions for future research. Target, 7(2), 223-243. Baker, M. (1998). Routledge encyclopedia of translation studies (1st ed.). London: Routledge. Baker, M. (2004). A corpus-based view of similarity and difference in translation. International Journal of Corpus Linguistics, 9(2), 167-193. Baker, P. (2004). Querying keywords: questions of difference, frequency and sense in keyword analysis. Journal of English Linguistics, 32(4), 346-359. Baker, P. (2006). Using corpora in discourse analysis. London & New York: Continuum. Barnbrook, G. (1996). Language and computers. Edinburgh: Edinburgh University Press. Bondi, M. (2010). Perspectives on keywords and keyness: an introduction. In M. Bondi & M. Scott (Eds.), Keyness in texts (pp. 1). Philadelphia: John Benjamins B.V. Bowker, L., & Pearson, J. (2002). Working with specialized language: A practical guide to using corpora. London: Routledge. Chang, Q. (2010). Application of functional equivalence theory in English translation of Chinese idioms. Journal of Language Teaching and Research, 1(6), 880-888. Charniak, E., & Johnson, M. (2013). Introduction to computational linguistics. Retrieved from http://cs.brown.edu/courses/csci2951-k/papers/cl-intro.pdf Chen, C., & Lee, T. (2006). TW Patent No. I254355. TIPO. Chen, F., & Liang, S. (2015). CN Patent No. CN102856328. SIPO. Chen, P. (2012). Using comparable specialized corpora with machine translation for extracting N-gram translation equivalents: A case study of Chinese and English contracts. National Taiwan Normal University. Chen, W., & Lin, N. (2014). CN Patent No. CN102610533B. SIPO. Cheng, Y., Tsui, C., & Kao, c. (2015). Transliteration disparities in Taiwan and China: in the perspective of corpus and SMT. Paper presented at the 2015 Taiwan Translation Conference: Corpus and Translation. Chi, T. (2009). The problem of Simplified Chinese characters: a historical review of the developments of Simplified Chinese characters by the Communist China. Journal of National Taiwan Normal University: Linguistics & Literature, 54(2), 107﹣133. Chiang, K. (2015). [Filing a patent in China]. Chuang. (2008). Patent Translation. Beijing: Intellectual Property Publishing House. Church, K. W., & Hanks, P. (1990). Word association norms, mutual information, and lexicography. Computational Linguistics, 16(1), 22-29. Cross, M. (2007a). Types of patent translation. Retrieved from http://patenttranslations.com/Publications/ATAChronicalApril05.pdf Cross, M. (2007b). Literal translation of patents. Retrieved from http://patenttranslations.com/Publications/ATAPatentTranslatorsHandbook-Martin%20Cross.pdf Culpepper, J. (2002). Computer, language and characterization: an analysis of six characters in Romeo and Juliet. In U. Melander-Marttala, C. Ostman, & M. Kyto (Eds.), Conversation in life and literature. Uppsala: Universitetstryckeriet. Fano, R. (1961). Transmission of information: A statistical theory of communications. Cambridge: M.I.T Press. Firth, J. R. (1957). A synopsis of linguistic theory, 1930-1955. In J. R. Firth (Ed.), Studies in linguistic analysis (Vol. Special Volume of the Philological Society). Oxford: Blackwell. Flowerdew, J., & Forest, R. (2009). Schematic structure and lexical-grammatical realization in corpus-based genre analysis: The case of research in the PhD literature Review In M. P. Charles, Diane. & Hunston, Susan (Ed.), Academic writing: At the interface of corpus and discourse (pp. 21). New York: Continuum. Flowerdew, L. (2012). Corpora and language education. Hampshire: Palgrave Macmillan. Goh, G. (2011). Choosing a reference corpus for keyword calculation. Linguistic Research, 28(1), 239-256. Goldsmith, J. A. (2010). Segmentation and morphology. In A. Clark, C. Fox, & S. Lappin (Eds.), The handbook of computational linguistics and natural language processing (pp. 517). West Sussex: Wiley-Blackwell. Google. (2012). Google Translate. Retrieved from https://translate.google.com/ Guidelines on Examination of Patents. (2010). China: SIPO. He, M. (2009, April 9). Culture observation: should Chinese characters be in the Simplified or Traditional form? People's Daily Online. Retrieved from http://culture.people.com.cn/GB/9098068.html Hsieh, Y. (2008). A comparison of Two Chinese translations of Stephen King's Bag of Bones. National Taiwan Normal University. Hsun, J. (2009). jieba. Retrieved from http://jiebademo.ap01.aws.af.cm/ Huang, C., Chen, K., & Chang, L. (1996). Segmentation standard for Chinese natural language processing. Paper presented at the 1996 International Conference on Computational Linguistics Copenhagen. Huang, C., & Hsu, E. (2015, June 30). Nearly half of Taiwan's export orders made in China, Hong Kong. Focus Taiwan. Retrieved from http://focustaiwan.tw/news/aeco/201506300019.aspx Huang, W. (2010). Approaches to patent translation and relevant regulations. Intellectual Property Rights Journal (141), 60-89. Hunston, S. (2002). Corpora in applied linguistics. Cambridge: Cambridge University Press. Hunter, S. P. (2015). The importance of patents. Retrieved from LaboratoryNews website: http://www.labnews.co.uk/features/the-importance-of-patents-01-07-2005/ Hutchins, J. (2003). Machine translation. In R. Mitkov (Ed.), The Oxford handbook of computational linguistics (pp. 501). New York: Oxford University Press. Implementing Rules of the Patent Law of the People's Republic of China, N.306 C.F.R. (2010). Jacquemin, C., & Bourigault, D. (2003). Term extraction and automatic indexing. In R. Mitkov (Ed.), The Oxford handbook of computational linguistics (pp. 604). New York: Oxford University Press. Jakobson, R. (1959). On linguistic aspects of translation. In L. B. Venuti, Mona (Ed.), The translation studies reader (3rd, 2012 ed., pp. 113-118). London and New York: Routledge. Jiang, Y., Shi, Z., & Chen, M. (2010). CN Patent No. CN101221961B. SIPO. Joho, H., & Sanderson, M. (2007). Document frequency and term specificity. Paper presented at the RIAO '07 Large Scale Semantic Access to Content (Text, Image, Video, and Sound), Paris. Kao, C., Lin, C., Ting, Y., Liu, P., & Cheng, Y. (2012). Investigating translation types of scientific terminologies in cross-strait primary and secondary school textbooks. Paper presented at the 2012 Taiwan Translation Conference. Kaur, G., Jain, S. K., Parmar, S., & Kumar, A. (2011). Extraction of domain-specific concepts to create expertise profiles. In P. V. Krishna, M. R. Babu, & E. Ariwa (Eds.), Global trends in computing and communication systems (pp. 767). Berlin: Springer. Laviosa, S. (1997). How comparable can 'comparable corpora' be? Target, 9(2), 289-319. Lee, H. (2006). Investigating Traditional Chinese and Simplified Chinese: from an academic perspective. Journal of Chinese Language, 98, 42﹣54. Leech, G. (1991). The state of the art in corpus linguistics. In K. Aijmer & B. Altenberg (Eds.), English corpus linguistics (pp. 8-29). London: Longman. Lewis, M. P., Simons, F. G., & Fennig, D. C. (2016). Ethnologue: languages of the world (19th ed.). Retrieved from https://www.ethnologue.com/statistics/size Liao, H., & Chen, C. (2009). Practices in applying and drafting patent specifications of invention patents: machinery, electronics and software related inventions. Taipei: TIPO. Liao, M., Hong, M., Ko, H., Chen, K., & Wang, Y. (2015). TW Patent No. I511199. TIPO. Lin, B., Chen, C., Gau, T., Lin, C., Liu, R., & Shih, J. (2006). TW Patent No. I259517. TIPO. Liu, T. (2014). PTT corpus: construction, design and application. (Master), National Taiwan University. Liu, Y., Tan, L., & Song, S. (2013). Chinese patent explosion factors: an empirical analysis based on system and policy Paper presented at the PICMET 2013: Technology Management in the IT-Driven Services, California. Machine translation: what is statistical machine translation (SMT)? (2015). Retrieved from http://www.asiaonline.net/EN/MachineTranslation/default.aspx?QID=16#QID16 McCarthy, M., & O'Keeffe, A. (2010). Historical perspective: what are corpora and how have they evolved? In M. McCarthy & A. O'Keeffe (Eds.), The Routledge handbook of corpus linguistics. New York: Routledge. McEnery, T. (2003). Corpus linguistics. In R. Mitkov (Ed.), The Oxford handbook of computational linguistics. New York: Oxford University Press. McEnery, T., Xiao, R., & Tono, Y. (2006). Corpus-based language studies: an advanced resource book. New York: Routledge. Microsoft. (2016). Localization. Retrieved from https://msdn.microsoft.com/en-us/library/5839we2z(v=vs.110).aspx Mikheev, A. (2003). Text segmentation. In R. Mitkov (Ed.), The Oxford handbook of computational linguistics. New York: Oxford University Press. Morah, C. (2015). Patents are assets, so learn how to value them. Retrieved from investopedia website: http://www.investopedia.com/articles/fundamental-analysis/09/valuing-patent.asp Nesselhauf, N. (2011). Corpus linguistics: a practical introduction. Retrieved from Heidelberg University website: http://www.as.uni-heidelberg.de/personen/Nesselhauf/files/Corpus%20Linguistics%20Practical%20Introduction.pdf Nida, E. (1964). Toward a science of translating. Leiden: E.J.Brill. Pan, W., Lin, C., Rao, B., Chen, C., Ting, Y., Liu, P., & Hsiao, J. (2012). Preliminary study on the translation of academic vocabularies and terminologies in Taiwan and China: a complete report of comparison and analysis. Taipei: National Academy for Educational Research. Patent Law of the People's Republic of China, (2008). Peng, Z. (2009). Main Chinese characters, auxiliary Chinese characters and Simplified Chinese, Traditional Chinese characters. Journal of Beihua University (Social Sciences), 10(5), 61-63. Reppen, R. (2010). Building a corpus: What are the key considerations? In M. McCarthy & A. O'Keeffe (Eds.), The Routledge Handbook of Corpus Linguistics (pp. 32). New York: Routledge. Regular expressions. (2016). Retrieved from https://developer.mozilla.org/en-US/docs/Web/JavaScript/Guide/Regular_Expressions Ryu, T., & Han, Y. (2012). Indicator for evaluating national patent performance: comparative analysis among the 300 OECD countries. Journal of Intellectual Property Rights, 17, 103-110. Scott, M. (1997). PC analysis of key words - and key key words. System, 25 (1), 1-13. Scott, M. (2006). In search of a bad reference corpus. Paper presented at the AHRC ICT Methods Network Expert Seminar on Linguistics, Lancaster University. Scott, M. (2010). WordSmith Tools (Version 5.0). Liverpool: Lexical Analysis Software Scott, M., & Tribble, C. (2006). Textual Patterns: keywords and corpus analysis in language education. Philadelphia: John Benjamins B.V. SDL. (2015). Trados 2007. Retrieved from http://www.sdl.com/about/sdl/global-cxm/ SDL. (2016a). How does translation memory differ from machine translation? Retrieved from http://www.translationzone.com/products/translation-memory/ SDL. (2016b). What is a translation memory? Retrieved from http://www.translationzone.com/solutions/translation-memory/ Shiflett, M. (2012). Functional equivalence and its role in legal translation. English Matters III, 29-33. Shih, Z., Guo, S., & Chen, W. (2015). CN Patent No. CN102790035B. SIPO. Sinclair, J. (1991). Corpus, Concordance and Collocation. Oxford: Oxford University Press. Sinclair, J. (1996). EAGLS - Preliminary recommendations on corpus typology. EAG--TCWG--CTYP/P. Retrieved from EAGLS website: http://www.ilc.cnr.it/EAGLES96/corpustyp/node5.html#SECTION00041000000000000000 Sinclair, J. (2004). Trust the text - language, corpus and discourse. London and New York: Routledge. SIPO. (2011). 2010 annual report of SIPO. Retrieved from http://www.sipo.gov.cn/gk/ndbg/2010/201310/t20131025_859522.html. Speelman, D., Grondelaers, S., & Geeraerts, D. (2008). Variation in the choice of adjectives in the two main national varieties Dutch. In G. Kristiansen & R. Dirven (Eds.), Cognitive Sociolinguistics: Language variation, cultural models, social systems. Berlin: Mouton de Gruyter. Taiwan Patent Act, (2014). Tang, W. M. (2016). Corpus Linguistics - A Short Introduction. Retrieved from http://wmtang.org/resources/corpus-resources/corpus-linguistics/ The China that works. (2015, September 12). The Economist. Retrieved from http://www.economist.com/news/leaders/21664143-if-economic-miracle-continue-officials-must-give-private-sector-more-freedom Thomas, R. (1990). Bible translations: the link between exegesis and expository preaching. The Master's Journal 1, 53-79. TIPO. (2013). 2013 annual report of TIPO. Retrieved from http://www.tipo.gov.tw/lp.asp?CtNode=6721&CtUnit=3231&BaseDSD=7&mp=1. TIPO. (2014). 2014 annual report of TIPO. Retrieved from http://www.tipo.gov.tw/lp.asp?CtNode=6721&CtUnit=3231&BaseDSD=7&mp=1. TIPO. (2015). Chinese-English Technical Patent Glossary. Retrieved from http://paterm.tipo.gov.tw/IPOTechTerm/doIPOTechTermIndex.do Toll, S. G. (2014). U.S Patent No. 8,845,018. USPTO. Translator's Workbench Users Guide. (2007). (DC-T27-TWB-02-EN). Tribble, C. (1999). Writing difficult texts. (Ph.D), Lancaster University. Tribble, C. (2010). What are concordances and how are they used? In A. O'Keeffe & M. McCarthy (Eds.), The Routledge handbook of corpus linguistics. New York: Routledge. Tsai, Y. (2015). A study of intralingual association of patent texts in China and Taiwan. Studies in English Language and Literature (35), 115-129. Way, A. (2010). Machine translation. In A. Clark, C. Fox, & S. Lappin (Eds.), The Handbook of Computational Linguistics and Natural Language Processing (pp. 531-532). West Sussex: Wiley-Blackwell. Wheatley, J. K. (2011). A foundation course in learning Mandarin Chinese. New Heaven: Yale University Press. WIPO. (2015a). What is patent? Retrieved from http://www.wipo.int/patents/en/ WIPO. (2015b). International Treaties and Conventions on Intellectual Property. Retrieved from http://www.wipo.int/export/sites/www/about-ip/en/iprm/pdf/ch5.pdf WIPO. (2015c). Regulations under the PCT Rule 6: the claims. Retrieved from http://www.wipo.int/pct/en/texts/rules/r6.htm. WIPO. (2015d). WIPO patent draft manual. Retrieved from http://www.wipo.int/edocs/pubdocs/en/patents/867/wipo_pub_867.pdf WIPO. (2015e). About the international patent classification. Retrieved from http://www.wipo.int/classifications/ipc/en/preface.html WIPO. (2015f). WIPO Translate. Retrieved from http://www.wipo.int/patentscope/en/news/pctdb/2015/news_0004.html Wu, W., & Li, C. (2009). Linguistics and teaching Chinese as a second language. Hongkong: The University of Hong Kong. Xiao, H. (2012). An introduction to the Modern Chinese Corpus of China's State Language Affairs Commission. Retrieved from CNCorpus: http://www.cncorpus.org/Resources.aspx Xiao, Z., & McEnery, A. (2005). Two approaches to genre analysis: Three genres in modern American English. Journal of English Linguistics, 33(1), 62-82. Xu, C. (2009). Ma Yingjeou's views on standard Chinese characters and Simplified Chinese characters. Journal of Beihua University (Social Sciences), 10(5), 56-60. [Yuanhe Partners: Intralingual translation of patent specifications from Traditional Chinese into Simplified Chinese] (2015). Zethsen, K. K. (2009). Intralingual translation: an attempt at description. Meta: Translators' Journal, 54(4), 795-812. Zhang, C., Hsieh, C., & Shue, S. (2006). TW Patent No. I256677. TIPO. Zheng, H. (2015). CN Patent No. CN102931156B. SIPO. Zhou, Y. (2015). CN Patent No. CN102654636B. SIPO. | |
dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/50924 | - |
dc.description.abstract | 隨著兩岸交流活動的日益增長,越來越多台灣企業前往中國申請專利,但依據中國專利法規定,只能以簡體中文撰寫申請文件。現階段在實務層面,台灣申請人會提供以繁體中文撰寫的說明書給中國專利代理機構,代理機構再根據中國專利法規定調整格式,修正繁、簡體中文之間的用詞差異。此一修正詞彙的過程,乃是發生在同種語言內的語內翻譯,特別值得加以探討。
為輔助專利譯者在語內翻譯過程中修正用詞差異,本文以語料庫為基礎,對比兩岸申請專利範圍(權利要求書)的詞彙搭配及使用習慣。具體步驟包括:(1) 建置專利公報繁、簡體中文語料庫;(2) 處理生語料並生成繁、簡體詞頻表;(3) 自動提取n連詞;(4) 使用線上翻譯系統分別將提取到的繁、簡體中文名詞組翻譯為英文;(5) 利用簡體中文名詞組和對應的英文建置翻譯記憶庫 (TM) ;(6) 把TM和繁體中文名詞組的翻譯導入機器輔助翻譯工具(Trados 2007),即可得出英譯相同的繁、簡體中文名詞對照組 。 | zh_TW |
dc.description.abstract | Traditional Chinese and Simplified Chinese are respectively used as the official languages in Taiwan and China. Although both are classified as Mandarin Chinese, there are differences in form, pronunciation, and wording. The warming cross-strait exchanges prompted more and more Taiwan applicants to file patents in China, which in turn, triggered discussions on the phraseological differences between the two variants.
To facilitate intralingual translation of Chinese patent documents, a corpus-based method is proposed to identify collocation patterns and lexical preferences of Traditional Chinese and Simplified Chinese patent claims. Steps are as follows: (1) collect data and compile comparable corpora; (2) process raw data and generate wordlists; (3) extract n-grams automatically; (4) translate Traditional Chinese and Simplified Chinese nominal n-grams into English with Patent Glossary and WIPO Translate; (5) align Simplified Chinese n-grams and their translations to compile a TM; (6) import the TM and translations of Traditional Chinese nominal n-grams into Trados 2007, to automatically identify intralingual equivalents which share the same English translation. | en |
dc.description.provenance | Made available in DSpace on 2021-06-15T13:06:52Z (GMT). No. of bitstreams: 1 ntu-105-R02147012-1.pdf: 2004647 bytes, checksum: 523bcabe2f2aca8cfdb11d74c3ee339b (MD5) Previous issue date: 2016 | en |
dc.description.tableofcontents | Table of Contents
Figures and Tables vii List of Abbreviations viii Chapter 1 Introduction 1 1.1 Differences between Traditional Chinese and Simplified Chinese 1 1.2 Patents and patent filing in China 3 1.2.1 The trend of Taiwan-based applicants filing patents in China 3 1.2.2 Language requirements for filing patents in China 4 1.2.3 Intralingual translation of Taiwan-based patent filing documents 4 1.3 Characteristics of patent specifications 5 1.3.1 Linguistic characteristics of patent claims 6 1.3.2 Structural characteristics of patent claims 7 1.4 Research questions 8 1.5 Research overview 8 Chapter 2 Literature Review 10 2.1 Intralingual Translation 10 2.2 Contrastive studies on Traditional Chinese and Simplified Chinese 12 2.3 Studies on patent translation 15 2.4 Studies on the application of corpus linguistics 17 2.4.1 Criteria for defining a corpus 18 2.4.2 Types of corpora 19 2.4.3 Corpus-driven analysis 20 2.4.4 The industrial application of corpora 25 2.5 Summary 27 Chapter 3 Methodology 29 3.1 Corpus construction criteria 29 3.1.1 Sampling based on patent type 31 3.1.2 Sampling based on IPC 32 3.1.3 Sampling based on region and issue date 34 3.2 Data collection and processing 34 3.2.1 Segmentation 35 3.2.2 Part-of-Speech (POS) Tagging 37 3.3 Identifying prominent n-grams 38 3.3.1 Comparing wordlists 39 3.3.2 N-gram extraction 40 3.3.3 Concordance 41 3.3.4 Collocational strength of n-grams 42 3.4 Identifying nominal equivalents 45 3.4.1 Online MT and TM 45 3.4.2 Building a TM 48 3.4.3 Identifying translation equivalents with Trados 2007 48 3.5 Summary 49 Chapter 4 Discussion 51 4.1 Functional n-grams in Traditional Chinese and Simplified Chinese claims 52 4.1.1其 (qi, its/pronoun) and 其中 (qi zhong, wherein/conjunction) as transitional words 53 4.1.2 的 (de, of/possessive) and 之 (zhi, of/possessive) in the preamble 56 4.1.3 Quantifiers in Traditional Chinese and Simplified Chinese claims 62 4.2 Nominal equivalents in Traditional Chinese and Simplified Chinese claims 64 4.2.1 Multi-translations of nominal n-grams 66 4.2.2 Document frequency indicating lexical preferences 67 4.3 Summary 68 Chapter 5 Conclusion 72 5.1 Review of this study 72 5.1.1 Syntactic function words in patent claims 73 5.1.2 Quantifiers in patent claims 74 5.1.3 Translation equivalents in patent claims 74 5.2 Limitations of this study 76 5.3 Recommendations for future research 77 References 79 Appendices 88 | |
dc.language.iso | en | |
dc.title | 兩岸申請專利範圍之語內翻譯:基於語料庫之搭配及詞彙比較 | zh_TW |
dc.title | Intralingual Translation of Patent Claims in Taiwan and China: A Corpus-based Comparison of Collocations and Word Use | en |
dc.type | Thesis | |
dc.date.schoolyear | 104-2 | |
dc.description.degree | 碩士 | |
dc.contributor.oralexamcommittee | 耿筠(Yun Keng),高照明(Chaoming Kao) | |
dc.subject.keyword | 繁體中文,簡體中文,語料庫,語內翻譯,申請專利範圍, | zh_TW |
dc.subject.keyword | Traditional Chinese,Simplified Chinese,patent claims,corpus,intralingual translation, | en |
dc.relation.page | 103 | |
dc.identifier.doi | 10.6342/NTU201600621 | |
dc.rights.note | 有償授權 | |
dc.date.accepted | 2016-07-01 | |
dc.contributor.author-college | 文學院 | zh_TW |
dc.contributor.author-dept | 翻譯碩士學位學程 | zh_TW |
顯示於系所單位: | 翻譯碩士學位學程 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-105-1.pdf 目前未授權公開取用 | 1.96 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。