Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 資訊網路與多媒體研究所
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/53968
標題: 分析顧客評論之情景及其情緒分析之研究
Analyzing Context in Customer Reviews for Sentiment
Analysis
作者: Jun-Jie Wang
王俊杰
指導教授: 陳信希(Hsin-Hsi Chen)
關鍵字: 自然語言處理,顧客評論之意見探勘,情境相關之情緒分析,
Natural Language Processing,Opinion Mining on Customer Reviews,Context-Aware Sentiment Analysis,
出版年 : 2015
學位: 碩士
摘要: In this “Big Data Era”, mining valuable information from customer reviews can benefit both companies and customers. Sentiment analysis on customer reviews remains to be challenging due to the variety of human language
usages.
This thesis proposes a new concept of context in sentiment analysis by exploring real data and referencing works in linguistics. In spite of the popular definition of context which refers to the surroundings of a word and is often
used in word sense disambiguation, our context refers to a fragment of texts which only contain ambiguous opinion words or even no opinion words. To conduct the sentiment analysis on customer reviews, a new Chinese customer review dataset in the hotel domain has is on both the snippet level and the clause level. For clause segmentation, two strategies, i.e., punctuation-based method and parsing-based method, have been proposed, and the polarity shift caused by discourse markers and negation operators is also considered in the parsing-based method.
To extract the context, two types of the opinion word lexicons are first built and then checked and filtered manually. Thus, four types of lexicons are constructed. We then apply these four lexicons to extracting context on both the snippet level and the context level. In the sentiment classification experiments, Support Vector Machine (SVM) model with bag-of-words features and Convolutional Neuron Network (CNN) model with word embedding vectors are employed. The experimental results show that CNN model can always achieve the highest performance on both levels. Besides, the results also indicate that considering context in the process of sentiment analysis on customer reviews is necessary. In addition, detail error analysis and case study are also performed to understand the experimental results.
URI: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/53968
全文授權: 有償授權
顯示於系所單位:資訊網路與多媒體研究所

文件中的檔案:
檔案 大小格式 
ntu-104-1.pdf
  目前未授權公開取用
1.14 MBAdobe PDF
顯示文件完整紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved