Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 資訊工程學系
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/5164
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor陳信希(Hsin-Hsi Chen)
dc.contributor.authorQing-Cheng Lien
dc.contributor.author李卿澄zh_TW
dc.date.accessioned2021-05-15T17:52:50Z-
dc.date.available2014-08-12
dc.date.available2021-05-15T17:52:50Z-
dc.date.copyright2014-08-12
dc.date.issued2014
dc.date.submitted2014-08-08
dc.identifier.citationAdolphs, P., Theobald, M., Schafer, U., Uszkoreit, H., and Weikum, G. (2011). Yago-qa: Answering questions by structured knowledge queries. In Semantic Computing (ICSC), 2011 Fifth IEEE International Conference on, pages 158–161.
Berant, J., Chou, A., Frostig, R., and Liang, P. (2013). Semantic parsing on freebase from question-answer pairs. In Proceedings of ACL, pages 1533–1544.
Bollacker, K., Evans, C., Paritosh, P., Sturge, T., and Taylor, J. (2008). Freebase: A collaboratively created graph database for structuring human knowledge. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, SIGMOD ’08, pages 1247–1250, New York, NY, USA. ACM.
Bonnefoy, L., Bouvier, V., and Bellot, P. (2013). A weakly-supervised detection of entity central documents in a stream. In Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’13, pages 769–772, New York, NY, USA. ACM.
Fader, A., Soderland, S., and Etzioni, O. (2011). Identifying relations for open information extraction. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP ’11, pages 1535–1545, Stroudsburg, PA, USA. Association for Computational Linguistics.
Frank, J. R., Bauer, S. J., KleimanAWeine, M., Roberts, D. A., Tripuraneni, N., Zhang, C., Re, C., Voorhees, E. M., and Soboroff, I. (2013). Evaluating stream filtering for entity profile updates for trec 2013. In Proceedings of The 22th Text REtrieval Conference (TREC 2013).
Frank, J. R., Kleiman-Weiner, M., Roberts, D. A., Niu, F., Zhang, C., Re, C., and Soboroff, I. (2012). Building an entity-centric stream filtering test collection for trec 2012. In Proceedings of The 21th Text REtrieval Conference (TREC 2012).
Kjersten, B. and McNamee, P. (2012). The hltcoe approach to the trec 2012 kba track. In Proceedings of The 21th Text REtrieval Conference (TREC 2012).
Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P. N., Hellmann, S., Morsey, M., van Kleef, P., Auer, S., and Bizer, C. (2014). DBpedia - a large-scale, multilingual knowledge base extracted from wikipedia. Semantic Web Journal.
Mendes, P. N., Jakob, M., Garcia-Silva, A., and Bizer, C. (2011). Dbpedia spotlight: Shedding light on the web of documents. In Proceedings of the 7th International Conference on Semantic Systems, I-Semantics ’11, pages 1–8, New York, NY, USA. ACM.
Moro, A. and Navigli, R. (2012). Wisenet: Building a wikipedia-based semantic network with ontologized relations. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management, CIKM ’12, pages 1672–1676, New York, NY, USA. ACM.
Nakashole, N., Weikum, G., and Suchanek, F. (2012a). Discovering and exploring relations on the web. Proc. VLDB Endow., 5(12):1982–1985.
Nakashole, N., Weikum, G., and Suchanek, F. (2012b). Patty: A taxonomy of relational patterns with semantic types. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL ’12, pages 1135–1145, Stroudsburg, PA, USA. Association for Computational Linguistics.
Suchanek, F. M., Kasneci, G., and Weikum, G. (2007). Yago: A Core of Semantic Knowledge. In 16th international World Wide Web conference (WWW 2007), New York, NY, USA. ACM Press.
Takaku, Y., Kaji, N., Yoshinaga, N., and Toyoda, M. (2012). Identifying constant and unique relations by using time-series text. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL ’12, pages 883–892, Stroudsburg, PA, USA. Association for Computational Linguistics.
Wang, J., Song, D., Lin, C.-Y., and Liao, L. (2013). Bit and msra at trec kba ccr track 2013. In Proceedings of The 22th Text REtrieval Conference (TREC 2013).
Yao, X. and Van Durme, B. (2014). Information extraction over structured data: Question answering with freebase. In Proceedings of ACL.
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/5164-
dc.description.abstract世界上的知識日新月異,透過志願編輯者更新的知識庫無法跟上知識產生與改變的速度,如何縮短知識產生與知識庫更新間的差距,也就是知識庫加速,便成為了重要的議題。
知識庫中記載的實體與其特性也是相當重要的知識,本研究提出了基於樣式,自資訊匯集而成之內容串流中快速地偵測文件是否包含特定實體特性的方法。偵測流程包含了樣式比對、樣式篩選與特性消歧義等步驟。透過樣式比對與實體特性與樣式的關聯偵測實體特性,存在樣式的品質、可信賴度、對映特性的歧義等問題,本研究於樣式比對前進行樣式篩選,比對後進行特性消歧義以降低上述問題的影響。
實驗結果分析了樣式信心值、可信賴度、特性歧義度對效能造的影響,發現特性消歧義的步驟中,引入實體類型資訊與使用簡單貝氏分類器後,偵測效能有顯著的提升。
透過實體特性的偵測,有助於自內容串流中篩選對知識庫更新有幫助的文章,以供志願編輯者作為更新與維護知識庫的依據。
zh_TW
dc.description.abstractWorld knowledge varies with time, but the change of knowledge about an entity often waits for a long time before a human editor update it in knowledge base (KB). How to accelerate the update of KB is an important problem, it’s also called knowledge base acceleration (KBA).
In this paper, we propose a method that detects entity’s properties in content stream efficiently and effectively base on patterns. The detection process has three phases including pattern selection phase, pattern matching phase and property disambiguation phase. pattern quality, reliability and ambiguity are three major issues in the process.
The experimental results show the impact of patterns’ confidence value,
reliability and ambiguity degree. We found that using the entity type information and naive bayes classifier improve the performance of the detection system.
Detection of entity’s properties filters documents from content stream. It’s
helpful for human editors to use the information in those documents to update the KB.
en
dc.description.provenanceMade available in DSpace on 2021-05-15T17:52:50Z (GMT). No. of bitstreams: 1
ntu-103-R01922024-1.pdf: 5984321 bytes, checksum: f0708fdfd3609643c33c07f3d6ea3cca (MD5)
Previous issue date: 2014
en
dc.description.tableofcontents口試委員會審定書 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . i
誌謝 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ii
摘要 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii
Abstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv
目錄 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v
圖目錄 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii
表目錄 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . viii
第一章 緒論 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1 背景介紹 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 研究動機 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.3 研究目標 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.4 論文架構 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
第二章 相關研究與文獻 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2.1 知識庫 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2.1.1 結構化知識庫 . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2.1.2 知識庫加速 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.1.3 知識庫的應用 . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.2 樣式與實體間關係 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
第三章 研究方法 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
3.1 以樣式偵測特性 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
3.2 樣式比對 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
3.3 樣式篩選 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
3.4 特性消歧義 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
第四章 實驗結果與分析 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
4.1 測試資料集 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
4.2 評估標準 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
4.3 實驗結果 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
4.3.1 效率 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
4.3.2 原始效能 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
4.3.3 樣式篩選 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
4.3.4 特性消歧義 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
4.3.5 錯誤分析 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
第五章 結論與未來展望 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
5.1 結論 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
5.2 未來展望 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
參考文獻 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
dc.language.isozh-TW
dc.subject實體特性偵測zh_TW
dc.subject知識庫加速zh_TW
dc.subject樣式比對zh_TW
dc.subjectKnowledge Base Accelerationen
dc.subjectEntity’s Property Detectionen
dc.subjectPattern Matchingen
dc.title內容串流中實體特性偵測之研究zh_TW
dc.titleDetection of Entity Properties in Content Streamen
dc.typeThesis
dc.date.schoolyear102-2
dc.description.degree碩士
dc.contributor.oralexamcommittee鄭卜壬(Pu-Jen Cheng),蔡銘峰(Ming-Feng Tsai),郭俊桔(June-Jei Kuo)
dc.subject.keyword知識庫加速,樣式比對,實體特性偵測,zh_TW
dc.subject.keywordKnowledge Base Acceleration,Pattern Matching,Entity’s Property Detection,en
dc.relation.page44
dc.rights.note同意授權(全球公開)
dc.date.accepted2014-08-08
dc.contributor.author-college電機資訊學院zh_TW
dc.contributor.author-dept資訊工程學研究所zh_TW
顯示於系所單位:資訊工程學系

文件中的檔案:
檔案 大小格式 
ntu-103-1.pdf5.84 MBAdobe PDF檢視/開啟
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved