請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/27005
標題: | 中文資訊檢索之詞彙資源效益 Effectiveness of Vocabulary Resources in Chinese Information Retrieval |
作者: | Fu-chia Lin 林孚嘉 |
指導教授: | 陳光華 |
關鍵字: | 資訊檢索,查詢詞彙擴張,詞彙輔助資源,知識本體, Chinese Information Retrieval,Query Expansion,Vocabulary Resources,Ontology, |
出版年 : | 2008 |
學位: | 碩士 |
摘要: | 摘要
本研究以NTCIR第五次檢索會議實驗所提供之標準文件集與問題集為實驗測試環境,挑選問題集內容敘述全為中文之檢索問題進行檢索實驗。同時提供受測者三種不同觀念與技術所產生之詞彙輔助資源:傳統式索引典、統計式索引典、知識本體,經相關詞回饋處理後,供受測者選取查詢擴張詞辭彙。研究結果發現以增進檢索效益之題數來看,統計式索引典詞彙輔助資源提升檢索效益的題目數略多於知識本體輔助資源;但以增進檢索效應之幅度來說,則是知識本體詞彙輔助資源最好。本次實驗發現,傳統式索引典詞彙輔助資源提升檢索效益的表現最差。 Abstract In this study, the effectiveness of different kinds of vocabulary resources for Chinese information retrieval are examined and compared based on interactions between users and the information retrieval system. We use traditional thesaurus, statistical thesaurus, and ontology to carry out a series of experiments for detailed investigation. The NTCIR5 test collection is used as the benchmark, which is composed of topic set, document set, and answer set. In order to make the study much more targeted, 25 queries with Chinese only are extracted and examined from totally 50 queries in NTCIR5 topic sets. The experimental results show that the statistical thesaurus greatly increases the number of improved queries, but ontology greatly increases the retrieval performance. Traditional thesaurus shows the poorest performance among these vocabulary resources. We also find that the users with good experience in information retrieval do well utilize vocabulary resources, and produce good retrieval results. In addition, all vocabulary resources do help Type-II queries, i.e., queries with simple concepts and non-specific temporal and spacial scope. |
URI: | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/27005 |
全文授權: | 有償授權 |
顯示於系所單位: | 圖書資訊學系 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-97-1.pdf 目前未授權公開取用 | 827.14 kB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。