請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/17058
標題: | 以自然語言處理分析社群網路願望之研究 Detecting Chinese Wish Messages in Social Media and Categorizing into Knowledge Base |
作者: | George Chang 張衡 |
指導教授: | 許永真(Jane Yung-Jen Hsu) |
關鍵字: | 自然語言處理,社群網路,願望, linkwish,Social Media,NLP, |
出版年 : | 2013 |
學位: | 碩士 |
摘要: | 夢想與希望自古以來象徵著人類的價值與前進的動力。在網路社群(Social Media)與微網誌盛行之時,網路的隔世造就世人輕易於雲端坦白露念,因此期許與願望不再受限於噴泉、宗教神氏或流星殞落之時。收集分析為網誌之中的研究願望,不但能從中發探究商場產品的趨勢與潛在市場,也能挖掘特殊需求並提供解決方案,受惠企業、百姓與弱勢族群。藉由分析一個活躍於港澳台區域的 Linkwish 行動願望社群網站。我們得以歸納了解願望之特性與內容,並以支援向量機(Support Vector Machine)搭配多種語言特徵作為依據,偵測網誌是否為願望,並藉由圖樣分析取得其目標資訊,終將願望分類至知識庫(Knowledge Base)做為具有認知意義的分類。以便用於檢索與統計。本篇論文使用語言特徵能提升願望偵測準確達0.95 AUC,對於精簡明確的願望能準確分析出願望目標資訊,並分類至知識庫。 People have wishes and sometimes share their wishes in social media, hoping to get greetings or to find partners with the same wishes. By collecting and analyzing those wishes, we may find out not only the trend of common wishes, but also the needs of individuals. This paper presents a preliminary study of Chinese wish analysis. We provide analysis on the data from Linkwish, which is a micro social network for wish sharing with users mainly from Taiwan, Hong Kong, and Macao. Then, we use SVM with various types of features to classify these messages as wish or not, extract wish target information, and categorized wish into knowledge base. Our experimental results show that some features in wish detector can achieve average areas under precision-recall curves higher than 0.95 in 10-fold cross validation, And extract target, link into knowledge base from simple wishes. |
URI: | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/17058 |
全文授權: | 未授權 |
顯示於系所單位: | 資訊網路與多媒體研究所 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-102-1.pdf 目前未授權公開取用 | 3 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。