概念表徵及其應用

Chi-Hsin Yu; 游基鑫

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/5881

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	陳信希(Hsin-Hsi Chen)
dc.contributor.author	Chi-Hsin Yu	en
dc.contributor.author	游基鑫	zh_TW
dc.date.accessioned	2021-05-16T16:18:00Z	-
dc.date.available	2017-08-28
dc.date.available	2021-05-16T16:18:00Z	-
dc.date.copyright	2013-08-28
dc.date.issued	2013
dc.date.submitted	2013-08-16
dc.identifier.citation	Agarwal, S., & Niyogi, P. (2005). Stability and generalization of bipartite ranking algorithms. In Proceedings of the 18th Annual Conference on Learning Theory (pp. 32–47). Berlin, Heidelberg: Springer-Verlag. Agirre, E., & Edmonds, P. (Eds.). (2006). Word sense disambiguation: Algorithms and Applications. Springer. Agirre, E., & Martinez, D. (2004). Unsupervised WSD based on automatically retrieved examples: The importance of bias. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25–32. Agirre, E., & Stevenson, M. (2006). Knowledge sources for WSD. In Word sense disambiguation: Algorithms and Applications. Agirre, Eneko and Edmonds, Philip (Eds.). Springer. Ando, R. K. (2006). Applying alternating structure optimization to word sense disambiguation. In Proceedings of the Tenth Conference on Computational Natural Language Learning, 77–84. Association for Computational Linguistics. Barabanov, N. E., & Prokhorov, D. V. (2002). Stability analysis of discrete-time recurrent neural networks. IEEE Transactions on Neural Networks, 13(2), 292–303. Barker, C. (2004). Continuations in natural language. In Proceedings of the Fourth ACM SIGPLAN Continuations Workshop. Bengio, Y. (2008). Neural net language models. In Scholarpedia, 3(1):3881. Bousquet, O., & Elisseeff, A. (2002). Stability and generalization. Journal of Machine Learning Research, 2, 499–526. Brants, T., & Franz, A. (2006). Web 1T 5-gram Version 1. Linguistic Data Consortium, Philadelphia. Brown, P. F., deSouza, P. V., Mercer, R. L., Pietra, V. J. D., & Lai, J. C. (1992). Class-based n-gram models of natural language. Computational Linguistics, 18(4), 467–479. Cankaya, H. C., & Moldovan, D. (2009). Method for extracting commonsense knowledge. In Proceedings of the Fifth International Conference on Knowledge Capture (pp. 57–64). ACM. Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Hruschka, E. R., & Mitchell, T. M. (2010). Toward an architecture for never-ending language learning. In Proceedings of the Twenty-Fourth Conference on Artificial Intelligence (AAAI 2010). Carpuat, M., & Wu, D. (2005). Word sense disambiguation vs. statistical machine translation. In Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, 387–394. Stroudsburg, PA, USA: Association for Computational Linguistics. Carpuat, M., & Wu, D. (2007). Improving statistical machine translation using word sense disambiguation. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL 2007) , 61–72. Chan, Y. S., & Ng, H. T. (2007). Word sense disambiguation improves statistical machine translation. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL-07), 33–40. Chang, C.-C., & Lin, C.-J. (2011). LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2(3), 27:1–27:27. Chklovski, T. (2003). Learner: A system for acquiring commonsense knowledge by analogy. In Proceedings of the 2nd International Conference on Knowledge Capture (pp. 4–12). ACM. Chklovski, T., & Gil, Y. (2005). An analysis of knowledge collected from volunteer contributors. In Proceedings of the 20th International Conference on Artificial Intelligence, 564–570. AAAI Press. Chomsky, N. (1986). Knowledge of Language: Its Nature, Origin, and Use. Praeger. Clark, P., & Harrison, P. (2009). Large-scale extraction and use of knowledge from text. In Proceedings of the Fifth International Conference on Knowledge Capture (pp. 153–160). ACM. Cohen, W. W., Schapire, R. E., & Singer, Y. (1999). Learning to order things. Journal of Artificial Intelligence Research, 10(1), 243–270. De Marneffe, M.-C., MacCartney, B., & Manning, C. D. (2006). Generating typed dependency parses from phrase structure parses. In Proceedings of the IEEE/ACL 2006 Workshop on Spoken Language Technology. Stanford University. Dhillon, P. S., Foster, D., & Ungar, L. H. (2011). Minimum description length penalization for group and multi-task sparse learning. Journal of Machine Learning Research, 12, 525–564. Dhillon, P. S., & Ungar, L. H. (2009). Transfer learning, feature selection and word sense disambiguation. In Proceedings of the ACL-IJCNLP 2009 Conference, 257–260. Stroudsburg, PA, USA: Association for Computational Linguistics. Edmonds, P., & Cotton, S. (2001). Senseval-2: Overview. In The Proceedings of the Second International Workshop on Evaluating Word Sense Disambiguation Systems (pp. 1–5). Association for Computational Linguistics. Erk, K., & McCarthy, D. (2009). Graded word sense assignment. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 440–449. Association for Computational Linguistics. Erk, K., McCarthy, D., & Gaylord, N. (2009). Investigations on word senses and word usages. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, 10–18. Association for Computational Linguistics. Erk, K., & Pado, S. (2008). A structured vector space model for word meaning in context. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, 897–906. Association for Computational Linguistics. Escudero, G., Marquez, L., & Rigau, G. (2000). Naive Bayes and exemplar-based approaches to word sense disambiguation. Proceedings of the 14th European Conference on Artificial Intelligence, 421–425. Etzioni, O., Banko, M., Soderland, S., & Weld, D. S. (2008). Open information extraction from the web. Communications of the ACM, 51(12), 68–74. Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., & Lin, C.-J. (2008). LIBLINEAR: A library for large linear classification. Journal of Machine Learning Research, 9, 1871–1874. Fellbaum, C. (1998). WordNet: An Electronic Lexical Database. MIT Press. Felleisen, M. (1988). The theory and practice of first-class prompts. In Proceedings of the 15th ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages (pp. 180–190). Florian, R., & Yarowsky, D. (2002). Modeling consensus: Classifier combination for word sense disambiguation. In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, 25–32. Friedman, M. (1974). Explanation and scientific understanding. Journal of Philosophy, 71(1), 5–19. Geng, X., Liu, T.-Y., Qin, T., Arnold, A., Li, H., & Shum, H.-Y. (2008). Query dependent ranking using K-nearest neighbor. In Proceedings of the 31st Annual International Conference on Research and Development in Information Retrieval (SIGIR), 115–122. Girju, R., Badulescu, A., & Moldovan, D. (2006). Automatic discovery of part-whole relations. Computational Linguistics, 32(1), 83–135. Gold, E. M. (1967). Language identification in the limit. Information and Control, 10(5), 447–474. Gonzalo, J., & Verdejo, F. (2006). Automatic acquisition of lexical information and examples. In Word sense disambiguation: Algorithms and Applications. Agirre, Eneko and Edmonds, Philip (Eds.). Springer. Harris, Z. (1954). Distributional structure. Word, 10(23), 146–162. Hjorland, B. (2009). Concept theory. Journal of the American Society for Information Science and Technology, 60(8), 1519–1536. Joachims, T. (2002). Optimizing search engines using clickthrough data. In Proceedings of the 8th International Conference on Knowledge Discovery and Data Mining, 133–142. Jones, M. N., & Mewhort, D. J. K. (2007). Representing word meaning and order information in a composite holographic lexicon. Psychological Review, 114, 1–37. Jurafsky, D., & Martin, J. H. (2009a). Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Pearson Prentice Hall. Jurafsky, D., & Martin, J. H. (2009b). The representation of meaning. In Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Kilgarriff, A. (2006). Word senses. In Word sense disambiguation: Algorithms and Applications. Agirre, Eneko and Edmonds, Philip (Eds.). Springer. Kintsch, W. (2001). Predication. Cognitive Science, 25(2), 173–202. Komarova, N. L., Niyogi, P., & Nowak, M. A. (2002). Computational and evolutionary aspects of language. Nature, 417(6889), 611–617. Lee, Y. K., & Ng, H. T. (2002). An empirical evaluation of knowledge sources and learning algorithms for word sense disambiguation. In Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, 41–48. Association for Computational Linguistics. Lee, Y. K., Ng, H. T., & Chia, T. K. (2004). Supervised word sense disambiguation with Support Vector Machines and multiple knowledge sources. In R. Mihalcea & P. Edmonds (Eds.), Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text (pp. 137–140). Association for Computational Linguistics. Lenat, D. B., & Guha, R. V. (1989). Building Large Know-ledge-Based Systems: Representation and Inference in the Cyc Project. Addison-Wesley Longman Publishing Co., Inc. Liu, F., Yang, M., & Lin, D. (2010). Chinese Web 5-gram version 1. Linguistic Data Consortium. Liu, T.-Y. (2009). Learning to rank for information retrieval. Foundations and Trends in Information Retrieval, 3(3), 225–331. Margolis, E., & Laurence, S. (2011). Concepts. In The Stanford Encyclopedia of Philosophy. Markert, K., & Nissim, M. (2007). SemEval-2007 Task 08: Metonymy resolution at SemEval-2007. In Proceedings of the 4th International Workshop on Semantic Evaluations (Vol. 36–41). Martinez, D., de Lacalle, O. L., & Agirre, E. (2008). On the use of automatically acquired examples for all-nouns word sense disambiguation. Journal of Artificial Intelligence Research, 33(1), 79–107. McCarthy, D., Koeling, R., Weeds, J., & Carroll, J. (2004). Finding predominant word senses in untagged text. In Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics. Association for Computational Linguistics. Mihalcea, R., Chklovski, T., & Kilgarriff, A. (2004). The Senseval-3 English lexical sample task. In R. Mihalcea & P. Edmonds (Eds.), Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text (pp. 25–28). Mihalcea, R. F. (2002). Bootstrapping large sense tagged corpora. In Proceedings of the 3rd International Conference on Language Resources and Evaluations (LREC), Las Palmas. Mihalcea, R., & Moldovan, D. I. (1999). An automatic method for generating sense tagged corpora. In Proceedings of the Sixteenth International Conference on Artificial Intelligence and the Eleventh Innovative Applications of Artificial Intelligence Conference Innovative Applications of Artificial Intelligence, 461–466. Minsky, M. (1986). The Society of Mind. New York: USA: Simon & Schuster, Inc. Mitchell, J., & Lapata, M. (2008). Vector-based models of semantic composition. In Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics, 236–244. Mueller, E. T. (2010). Commonsense Reasoning. Elsevier Science. Navigli, R. (2009). Word sense disambiguation: A survey. ACM Computing Surveys (CSUR), 41(2), 10:1–10:69. Navigli, R., & Lapata, M. (2010). An experimental study of graph connectivity for unsupervised word sense disambiguation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(4), 678–692. Navigli, R., Litkowski, K. C., & Hargraves, O. (2007). SemEval-2007 Task 07: Coarse-grained English all-words task. In Proceedings of the 4th International Workshop on Semantic Evaluations (pp. 30–35). Ng, H. T., & Lee, H. B. (1996). Integrating multiple knowledge sources to disambiguate word sense: an exemplar-based approach. In Proceedings of the 34th annual meeting on Association for Computational Linguistics, 40–47. Nowak, M. A., Plotkin, J. B., & Krakauer, D. C. (1999). The evolutionary language game. Journal of Theoretical Biology, 200(2), 147 – 162. Palmer, M., Fellbaum, C., Cotton, S., Delfs, L., & Dang, H. T. (2001). English tasks: all-words and verb lexical sample. In Proceedings of the Second International Workshop on Evaluating Word Sense Disambiguation Systems (pp. 21–24). Pantel, P., & Lin, D. (2002). Discovering word senses from text. In Proceedings of the Eighth International Conference on Knowledge Discovery and Data Mining, 613–619. ACM. Plate, T. A. (1995). Holographic Reduced Representations. IEEE Transactions on Neural Networks, 6(3), 623–641. Plate, T. A. (2003). Holographic Reduced Representation: Distributed Representation for Cognitive Structures. Stanford, CSLI Publications. Plotkin, J. B., & Nowak, M. A. (2000). Language evolution and information theory. Journal of Theoretical Biology, 205, 147–159. Priss, U. (2004). Linguistic applications of Formal Concept Analysis. In Proceedings of the First International Conference on Formal Concept Analysis. Springer. Priss, U. (2006). Formal Concept Analysis in information science. Annual Review of Information Science and Technology, 40(1), 521–543. Russell, S., & Norvig, P. (2003). Artificial Intelligence: A Modern Approach, 2/E. Pearson Education. Sanderson, M. (1994). Word sense disambiguation and information retrieval. In Proceedings of the 17th Annual International Conference on Research and Development in Information Retrieval, 142–151. Schubert, L. (2009). From generic sentences to scripts. In Proceedings of the Twenty-First International Joint Conference on Artificial Intelligence, Workshop: Logic and the Simulation of Interaction and Reasoning (LSIR). Schubert, L., & Tong, M. (2003). Extracting and evaluating general world knowledge from the Brown corpus. In Proceedings of the HLT-NAACL 2003 Workshop on Text Meaning - Volume 9 (pp. 7–13). Schuemie, M. J., Kors, J. A., & Mons, B. (2005). Word sense disambiguation in the biomedical domain: An overview. Journal of Computational Biology, 12(5), 554–565. Schwartz, H. A., & Gomez, F. (2009). Acquiring applicable common sense knowledge from the Web. In Proceedings of the Workshop on Unsupervised and Minimally Supervised Learning of Lexical Semantics (pp. 1–9). Searle, J. (1980). Minds, brains and programs. Brains and Programs. Behavioral and Brain Sciences, 3(3), 417–457. Shannon, C. E. (1948). A mathematical theory of communication. The Bell System Technical Journal, 27, 379–423. Singh, P., Lin, T., Mueller, E. T., Lim, G., Perkins, T., & Zhu, W. L. (2002). Open Mind Common Sense: Knowledge acquisition from the general public. In On the Move to Meaningful Internet Systems (pp. 1223–1237). Springer-Verlag. Snyder, B., & Palmer, M. (2004). The English all-words task. In Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text (pp. 41–43). Sowa, J. F. (1984). Conceptual Structures: Information Processing in Mind and Machine. Boston: Addison-Wesley Longman Publishing Co., Inc. Stevenson, M., & Guo, Y. (2010). Disambiguation in the biomedical domain: The role of ambiguity type. Journal of Biomedical Informatics, 43(6), 972–981. Stevenson, M., Guo, Y., & Gaizauskas, R. (2008). Acquiring sense tagged examples using relevance feedback. In Proceedings of the 22nd International Conference on Computational Linguistics, 809–816. Stokoe, C., Oakes, M. P., & Tait, J. (2003). Word sense disambiguation in information retrieval revisited. In Proceedings of the 26th Annual International Conference on Research and Development in Informaion Retrieval, 159–166. Thater, S., Furstenau, H., & Pinkal, M. (2010). Contextualizing semantic representations using syntactically enriched vector models. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, 948–957. Toutanova, K., Klein, D., Manning, C. D., & Singer, Y. (2003). Feature-rich part-of-speech tagging with a cyclic dependency network. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, 173–180. Towell, G., & Voorhees, E. M. (1998). Disambiguating highly ambiguous words. Computational Linguistics, 24(1), 125–145. Trapa, P. E., & Nowak, M. A. (2000). Nash equilibria for an Evolutionary Language Game. Journal of Mathematical Biology, 41(2), 172–188. Tseng, H., Chang, P., Andrew, G., Jurafsky, D., & Manning, C. (2005). A conditional random field word segmenter. In Fourth SIGHAN Workshop on Chinese Language Processing. Turian, J., Ratinov, L., & Bengio, Y. (2010). Word representations: a simple and general method for semi-supervised learning. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, 384–394. Turney, P. D., & Pantel, P. (2010). From frequency to meaning: vector space models of semantics. Journal of Artificial Intelligence Research, 37(1), 141–188. Wolpert, D. H., & Macready, W. G. (1997). No free lunch theorems for optimization. IEEE Transactions on Evolutionary Computation, 1(1), 67–82. Yu, C.-H., & Chen, H.-H. (2010). Commonsense knowledge mining from the Web. In Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 1480–1485. Yu, C.-H., & Chen, H.-H. (2012a). Chinese web scale linguistic datasets and toolkit. In Proceedings of the 24th International Conference on Computational Linguistics, 501–508. Yu, C.-H., & Chen, H.-H. (2012b). Detecting word ordering errors in Chinese sentences for learning Chinese as a foreign language. In Proceedings of the 24th International Conference on Computational Linguistics, 3003–3018. Yu, C.-H., Tang, Y., & Chen, H.-H. (2012). Development of a web-scale Chinese word n-gram corpus with parts of speech information. In Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC’12), 320–324. Zhong, Z., & Ng, H. T. (2012). Word sense disambiguation improves information retrieval. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, 273–282.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/5881	-
dc.description.abstract	在此論文中，我們為概念進行了定義，並基於此定義，提出了為系統建構概念表徵的架構，及將此架構，套用在常識知識分類以及文字岐義消解這兩應用中。除此之外，我們還驗證了兩個跟知識抽取有關的假設，這分別是常識知識是否出現在文字中，以及小規模網路文件集是否足以支援重要的自然語言處理工作。最後，我們介紹了 ClueWeb09 這一網絡規模資料集的一些前處理結果，希望能提供給其他研究者更好用的資源。我們給出的概念定義符合三個標準：本質上具有可計算性、沒有無定義的組成、有內建的特質可被人或機器自身進行分析。我們將概念定義成一種延續 (continuation)，這種延續可看成是一種概念運算過程的暫存態，此暫存態則放在進化語言博弈 (evolutionary language game) 的架構下來詮釋。在此定義基礎上，我們將概念表徵分為靜態跟動態兩方面，並使用機器學習理論來對系統的許多面向進行了理論的探討。將概念表徵應用在常識知識分類時，我們用向量空間模型來建構表徵，並展示如何用我們的概念定義，來詮釋一般的機器學習處理過程。而在文字岐義消解這一應用中，我們更進一步運用了我們發展出的概念，為文字岐義消解引入了脈絡適切性 (context appropriateness) 及概念適切性 (concept fitness) 此兩面向，並用此來建構嶄新的文字岐義消解演算法。為了未來使用自動知識抽取的架構為機器建構概念，我們驗證了知識內容及大小這兩基本問題。為了確認文件是好的知識內容來源，我們發現甚至連常識知識都會出現在文件中。另外，我們利用文字語序錯誤這一問題，間接驗證了雖然 ClueWeb09 的規模只是網路網頁的一小部份，它的規模已可產生跟 Google Web 5-gram 同樣的實驗結果，能很好的支援重要的自然語言處理工作。最後，我們對 ClueWeb09 這一網絡規模資料集進行了前處理，並產生了許多有用的資源可提供給研究者，這些資源包括 (1) 完成詞性標記、詞組切分及語句剖析的英文語料庫、(2) 完成斷詞、詞性標記及語篇標記詞標記的中文語料庫、(3) 中文詞性 n-gram資料集 (NTU Chinese POS 5-gram)。	zh_TW
dc.description.abstract	In this dissertation, we propose a concept definition in language, derive a concept representation scheme based on this definition, and apply this framework in two applications: commonsense knowledge classification and word sense disambiguation. In addition, we assert two important assumptions for building concept representation using knowledge extraction: does commonsense knowledge appear in texts and is a small part of the Web sufficient for supporting important NLP tasks. Last, we introduce processed ClueWeb09 datasets. We hope the produced datasets can boost NLP research. We give a definition of concept that meets three criteria: having native origin in computational perspective, having no undefined terms in the definition, and having build-in nature for deep analysis by human and by intelligent system itself to understand internal structures of an intelligent system. We define concept a continuation, which is a temporary state in the concept computation process. This temporary state is interpreted within the context of the evolutionary language game. Based on this definition, we define concept representation to have two parts: static and dynamic parts. We investigate some theoretical aspects using theories in machine learning literatures. In the application of commonsense knowledge classification, we adopt vector space model to build representation and interpret this machine learning process in our framework. In WSD, we further apply our framework to develop two new concepts for solving WSD: context appropriateness and concept fitness. We use these two new concepts to build many new algorithms to solve WSD problem. For using knowledge extraction to build concept representation in the future, we verify two important perspectives: content of knowledge and size of knowledge sources. We find that commonsense knowledge are recorded in texts and assert that the web is a good source to extract human knowledge. We use word ordering error task to indirectly assert that a small part of the web, such as ClueWeb09 dataset, can support NLP applications to produce comparable results to that of larger datasets, such as Google Web 5-gram dataset. These two assertions give us confidence to extract knowledge from a smaller dataset to build concept representation. Lastly, we preprocess English and Chinese web pages in ClueWeb09 and produce many resources for researchers, including (1) POS-tagged, phrase-chunked, and partly parsed English dataset, (2) segmented, POS-tagged, and discourse markers identified Chinese dataset, and (3) NTU Chinese POS-5gram dataset.	en
dc.description.provenance	Made available in DSpace on 2021-05-16T16:18:00Z (GMT). No. of bitstreams: 1 ntu-102-D95922002-1.pdf: 1287631 bytes, checksum: e9ac12b0cfdfe4c6fac1ff03418e63da (MD5) Previous issue date: 2013	en
dc.description.tableofcontents	Chapter 1. Introduction 1 1.1 Motivation 1 1.2 Overview of this Dissertation 3 Chapter 2. Concept as Continuation 5 2.1 Concept Theory 5 2.2 Concept and Language 8 2.3 Defining Concept as a Continuation 11 2.4 Some Theoretical Aspects of the Definition 15 2.5 Related Work in Concept Definition 21 2.6 Considerations of Implementation 24 Chapter 3. Concept Representation 26 3.1 Representation of Continuation 26 3.2 Related Work 28 3.3 Framework of Knowledge Extraction 31 3.4 Applications of Concept Representation 32 Chapter 4. Commonsense Knowledge Classification 34 4.1 OMCS Database 35 4.2 Related Work 35 4.3 Concept Representation Scheme for Phrase 37 4.4 CSK Classification Algorithm 38 4.5 Experiment Settings 39 4.6 Experiment Results 40 4.7 Interpretation of Feature Engineering Process 41 Chapter 5. Word Sense Disambiguation 42 5.1 Introduction 42 5.2 Related Work 46 5.3 Context Appropriateness and Concept Fitness 50 5.4 Problem Formulations in WSD 54 5.4.1 Multi-class Classification (Baseline) 55 5.4.2 Multi-class Classification with Meaning Composition 55 5.4.3 Binary Classification with Meaning Composition 57 5.4.4 Ranking 2-Level with Meaning Composition 58 5.4.5 Ranking 3-Level with Meaning Composition 60 5.5 Feature Extraction and Experiment Settings 61 5.6 Experiment Results 64 Chapter 6. Knowledge Sources 67 6.1 ClueWeb09 Dataset 68 6.2 Commonsense Knowledge in the Web 68 6.3 Preprocessing of English Web Pages 70 6.4 Preprocessing of Chinese Web Pages 71 6.5 A Verification of ClueWeb09 Dataset 75 Chapter 7. Conclusion and Future Work 78 REFERENCE 80 APPENDIX I. The Definition of definition 89 APPENDIX II. The Filtering of Noise Texts 91 APPENDIX III. English POS Tag Distribution 94
dc.language.iso	en
dc.title	概念表徵及其應用	zh_TW
dc.title	Concept Representation and Its Application	en
dc.type	Thesis
dc.date.schoolyear	101-2
dc.description.degree	博士
dc.contributor.oralexamcommittee	張俊盛(Jason S. Chang),盧文祥(Wen-Hsiang Lu),鄭卜壬(PJ Cheng),陳建錦(Chien Chin Chen),曾元顯(Yuen-Hsien Tseng)
dc.subject.keyword	概念表徵,常識知識,字義消歧,排序,	zh_TW
dc.subject.keyword	Concept Representation,Continuation,Commonsense,WSD,Ranking,	en
dc.relation.page	95
dc.rights.note	同意授權(全球公開)
dc.date.accepted	2013-08-16
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	資訊工程學研究所	zh_TW
顯示於系所單位：	資訊工程學系

文件中的檔案：

檔案	大小	格式
ntu-102-1.pdf	1.26 MB	Adobe PDF	檢視/開啟

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。