Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 資訊工程學系
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/70854
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor李允中
dc.contributor.authorPo-Yee Liuen
dc.contributor.author劉鎛漪zh_TW
dc.date.accessioned2021-06-17T04:41:06Z-
dc.date.available2019-08-08
dc.date.copyright2018-08-08
dc.date.issued2018
dc.date.submitted2018-08-06
dc.identifier.citation[1] Penn treebank, [online] https://web.archive.org/web/19970614160127/ http://www.cis.upenn.edu/˜treebank/. 2018.
[2] Universal dependencies, [online] http://universaldependencies.org. 2018.
[3] Wikipedia data dump, [online] https://dumps.wikimedia.org/. 2018.
[4] J. Allan. Topic detection and tracking: event-based information organization, volume 12. Springer Science & Business Media, 2012.
[5] J. Allan, R. Papka, and V. Lavrenko. On-line new event detection and tracking. In Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval, pages 37–45. ACM, 1998.
[6] G. Angeli, M. J. J. Premkumar, and C. D. Manning. Leveraging linguistic structure for open domain information extraction. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), volume 1, pages 344–354, 2015.
[7] K. Bollacker, C. Evans, P. Paritosh, T. Sturge, and J. Taylor. Freebase: a collaboratively created graph database for structuring human knowledge. In Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pages 1247–1250. AcM, 2008.
[8] A. Bordes, N. Usunier, A. Garcia-Duran, J. Weston, and O. Yakhnenko. Translating embeddings for modeling multi-relational data. In Advances in neural information processing systems, pages 2787–2795, 2013.
[9] K. Clark and C. D. Manning. Deep reinforcement learning for mention-ranking coreference models. In Empirical Methods on Natural Language Processing, 2016.
[10] S. Cucerzan. Large-scale named entity disambiguation based on wikipedia data. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), 2007.
[11] M.-C. De Marneffe, B. MacCartney, C. D. Manning, et al. Generating typed dependency parses from phrase structure parses. In Proceedings of LREC, volume 6, pages 449–454. Genoa Italy, 2006.
[12] C. Fellbaum. Wordnet: an eletronic lexical database. cambridge, massachusetts, eua, 1998.
[13] J. R. Finkel, T. Grenager, and C. Manning. Incorporating non-local information into information extraction systems by gibbs sampling. In Proceedings of the 43rd annual meeting on association for computational linguistics, pages 363–370. Association for Computational Linguistics, 2005.
[14] O.-E. Ganea, M. Ganea, A. Lucchi, C. Eickhoff, and T. Hofmann. Probabilistic bag-of-hyperlinks model for entity linking. In Proceedings of the 25th International Conference on World Wide Web, pages 927–938. International World Wide Web Conferences Steering Committee, 2016.
[15] M. Girvan and M. E. Newman. Community structure in social and biological networks. Proceedings of the national academy of sciences, 99(12):7821–7826, 2002.
[16] S. Gregory. Finding overlapping communities in networks by label propagation. New Journal of Physics, 12(10):103018, 2010.
[17] Z. Guo and D. Barbosa. Robust entity linking via random walks. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, pages 499–508. ACM, 2014.
[18] R. Jenatton, N. L. Roux, A. Bordes, and G. R. Obozinski. A latent factor model for highly multi-relational data. In Advances in Neural Information Processing Systems, pages 3167–3175, 2012.
[19] C. D. Manning, M. Surdeanu, J. Bauer, J. Finkel, S. J. Bethard, and D. McClosky. The Stanford CoreNLP natural language processing toolkit. In Association for Computational Linguistics (ACL) System Demonstrations, pages 55–60, 2014.
[20] D. Milne and I. H. Witten. Learning to link with wikipedia. In Proceedings of the 17th ACM conference on Information and knowledge management, pages 509–518. ACM, 2008.
[21] M. Nickel, K. Murphy, V. Tresp, and E. Gabrilovich. A review of relational machine learning for knowledge graphs. Proceedings of the IEEE, 104(1):11–33, 2016.
[22] H. Sayyadi, M. Hurst, and A. Maykov. Event detection and tracking in social streams. In Icwsm, 2009.
[23] H. Sayyadi, M. Hurst, and A. Maykov. Event detection and tracking in social streams. In Icwsm, 2009.
[24] R. Socher, D. Chen, C. D. Manning, and A. Ng. Reasoning with neural tensor networks for knowledge base completion. In Advances in neural information processing systems, pages 926–934, 2013.
[25] F. M. Suchanek, G. Kasneci, and G. Weikum. Yago: a core of semantic knowledge. In Proceedings of the 16th international conference on World Wide Web, pages 697–706. ACM, 2007.
[26] K. Toutanova, D. Klein, C. D. Manning, and Y. Singer. Feature-rich part-of-speech tagging with a cyclic dependency network. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology-Volume 1, pages 173–180. Association for Computational Linguistics, 2003.
[27] J. Weng and B.-S. Lee. Event detection in twitter. ICWSM, 11:401–408, 2011.
[28] J. J. Whang, D. F. Gleich, and I. S. Dhillon. Overlapping community detection using seed set expansion. In Proceedings of the 22nd ACM international conference on Conference on information & knowledge management, pages 2099–2108. ACM, 2013.
[29] J. Yang and J. Leskovec. Structure and overlaps of communities in networks. arXiv preprint arXiv:1205.6228, 2012.
[30] J. Yang and J. Leskovec. Overlapping community detection at scale: a nonnegative matrix factorization approach. In Proceedings of the sixth ACM international conference on Web search and data mining, pages 587–596. ACM, 2013.
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/70854-
dc.description.abstract新聞在生活中扮演重要的角色,而新聞中的事件卻是一個抽象的概念,我們定義事件是一群實體的集合。利用 Wikipedia 的資訊,我們整合三個 entity linking 系統 Wiki, PBoH 和 RandomWalk,並將新聞中的實體名字對應到 Wikipedia 的實體。並使用 TransE 模型將知識圖譜向量化,保存實體和實體之間的關係。最後,我們從新聞中建立一實體圖,用辨識圖型中重疊社群 (overlapping community) 的方法,使用 Nonnegative Matrix Factorization 的模型,辨識出新聞中的事件。zh_TW
dc.description.abstractNowadays, news is an important part of our daily life. But the event in news is an abstract concept. We define an event as a group of entities. By leveraging the information from Wikipedia, we integrate three entity linking system Wiki, PBoH and RandomWalk to link the entity mention in documents to correspondent entities in Wikipedia. We also use TransE model to embed each entity as a vector, which contains the relational information between entities. Finally, we build an entity graph from news document and use Nonnegative Matrix Factorization approach to find the overlapping community in our entity graph. The community is a group of vertices(entities) in the entity graph, which corresponds to our entity definition.en
dc.description.provenanceMade available in DSpace on 2021-06-17T04:41:06Z (GMT). No. of bitstreams: 1
ntu-107-R05922118-1.pdf: 1889505 bytes, checksum: b5dbfd3342fff3c3f9904557f6898c30 (MD5)
Previous issue date: 2018
en
dc.description.tableofcontents致謝 ii
摘要 iii
Abstracts iv
List of Figures viii
List of Tables ix
Chapter 1 Introduction 1
Chapter 2 Related Work 4
2.1 Stanford Core NLP . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2.1.1 Part-Of-Speech Tagger . . . . . . . . . . . . . . . . . . . . . . 4
2.1.2 Dependency Parser . . . . . . . . . . . . . . . . . . . . . . . . 5
2.1.3 Named Entity Recognizer . . . . . . . . . . . . . . . . . . . . 5
2.1.4 Coreference Resolution . . . . . . . . . . . . . . . . . . . . . . 5
2.1.5 Open Information Extraction . . . . . . . . . . . . . . . . . . 5
2.2 Entity Linking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2.2.1 Task Description . . . . . . . . . . . . . . . . . . . . . . . . . 6
2.2.2 Knowledge Base . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.2.3 Entity Linking Modules . . . . . . . . . . . . . . . . . . . . . 7
2.3 Knowledge Graph Embedding . . . . . . . . . . . . . . . . . . . . . . 8
2.3.1 Translating Embeddings . . . . . . . . . . . . . . . . . . . . . 9
2.4 Community Detection . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.4.1 Overlapping Community Detection . . . . . . . . . . . . . . . 10
2.4.2 Nonnegative Matrix Factorization . . . . . . . . . . . . . . . . 11
Chapter 3 News Pipeline 12
Chapter 4 Parse News Document 14
Chapter 5 Entity Linking 15
5.1 Candidate Entity Generation . . . . . . . . . . . . . . . . . . . . . . 16
5.2 Candidate Entity Ranking . . . . . . . . . . . . . . . . . . . . . . . . 17
5.3 Experiment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
5.3.1 Benchmark . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
5.3.2 Evaluation Matrics . . . . . . . . . . . . . . . . . . . . . . . . 24
5.3.3 Result . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
5.3.4 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
Chapter 6 Entity Embedding 27
6.1 Establish Knowledge Graph . . . . . . . . . . . . . . . . . . . . . . . 27
6.2 Knowledge Graph Embedding . . . . . . . . . . . . . . . . . . . . . . 28
6.3 Experiment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
Chapter 7 Event Discovery 31
7.1 Build Entity Graph . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
7.2 Community Detection . . . . . . . . . . . . . . . . . . . . . . . . . . 32
7.2.1 Event Discovery Demonstration . . . . . . . . . . . . . . . . . 33
7.2.2 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
Chapter 8 Conclusion 34
Bibliography 36
dc.language.isozh-TW
dc.subject新聞zh_TW
dc.subject事件zh_TW
dc.subject實體連結zh_TW
dc.subject知識圖譜向量化zh_TW
dc.subject社群辨識zh_TW
dc.subjectknowledge graph embeddingen
dc.subjecteventen
dc.subjectentity linkingen
dc.subjectcommunity detectionen
dc.subjectnewsen
dc.title使用事件向量解析新聞zh_TW
dc.titleDeciphering News with Event Embeddingen
dc.typeThesis
dc.date.schoolyear106-2
dc.description.degree碩士
dc.contributor.oralexamcommittee鄭有進,郭忠義,李信杰,薛念林
dc.subject.keyword新聞,事件,實體連結,知識圖譜向量化,社群辨識,zh_TW
dc.subject.keywordnews,event,entity linking,knowledge graph embedding,community detection,en
dc.relation.page39
dc.identifier.doi10.6342/NTU201802570
dc.rights.note有償授權
dc.date.accepted2018-08-06
dc.contributor.author-college電機資訊學院zh_TW
dc.contributor.author-dept資訊工程學研究所zh_TW
顯示於系所單位:資訊工程學系

文件中的檔案:
檔案 大小格式 
ntu-107-1.pdf
  未授權公開取用
1.85 MBAdobe PDF
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved