以非監督學習之語義文件向量進行細緻多層面情緒分析

Hao-Ming Fu; 傅浩明

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/691

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	鄭卜壬(Pu-Jen Cheng)
dc.contributor.author	Hao-Ming Fu	en
dc.contributor.author	傅浩明	zh_TW
dc.date.accessioned	2021-05-11T04:58:02Z	-
dc.date.available	2019-08-28
dc.date.available	2021-05-11T04:58:02Z	-
dc.date.copyright	2019-08-28
dc.date.issued	2019
dc.date.submitted	2019-08-07
dc.identifier.citation	Sanjeev Arora, Yingyu Liang, and Tengyu Ma. 2017. A simple but tough-to-beat baseline for sentence embeddings. In ICLR. Minmin Chen. 2017. Efficient vector representation for documents through corruption. In ICLR 2017. R.Kiros,Y.Zhu,R.Salakhutdinov,R.Zemel,R.Urtasun,A.Torralba,andS.Fidler. 2015. Skip-thought vectors. In In Advances in neural information processing systems. Q. V. Le and T. Mikolov. 2014. Distributed representations of sentences and documents. In In ICML, volume 14. Andrew L. Maas, Raymond E. Daly, Peter T. Pham, Dan Huang, Andrew Ng, and Christopher Potts. 2011. Learning word vectors for sentiment analysis. In ACL. Julian McAuley, Jure Leskovec, and Dan Jurafsky. 2012. Learning attitudes and attributes from multi-aspect reviews. In In Proceedings of ICDM. IEEE. T. Mikolov, K. Chen, G. Corrado, and J. Dean. 2013. Efficient estimation of word representations in vector space. In arXiv preprint arXiv:1301.3781. P. Vincent, H. Larochelle, Y. Bengio, and P. Manzagol. 2008. Extracting and composing robust features with denoising autoencoders. In ICML. Lingfei Wu, Ian En-Hsu Yen, Kun Xu, Fangli Xu, Avinash Balakrishnan, Pin- Yu Chen, Pradeep Ravikumar, and Michael J. Witbrock. 2018. Word mover's embedding: From word2vec to document embedding. In EMNLP.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/handle/123456789/691	-
dc.description.abstract	文件的向量表達方式在自然語言處理的許多應用上扮演核心角色。尤其，以非監督學習所得到的一般性向量表達在這些應用中更是一大助益。在實務上，情緒分析是一個縱使困難，卻被認為非常語意層面的的應用，也因此常被用來當作檢測向量品質的工具。目前以非監督方式學習文件向量的方法主要可分為以下兩類:序列式的，他們直接把字彙間的排列順序納入考慮，以及非序列式的，他們不直接考慮字彙間的順序。然而，他們各自都有各自的問題仍待解決。在這篇論文中，我們提出一個模型，可以同時解決這兩種主要方法所面臨的難處。實驗證明我們所提出的方法在常見的情緒分析和同時考量多層面的細緻情緒分析上，都遠遠優於現有的最佳方法。	zh_TW
dc.description.abstract	Document representation is the core of many NLP tasks on machine understanding. A general representation learned in an unsupervised manner reserves generality and can be used for various applications. In practice, sentiment analysis (SA) has been a challenging task that is regarded to be deeply semantic-related and is often used to assess general representations. Existing methods on unsupervised document representation learning can be separated into two families: sequential ones, which explicitly take the ordering of words into consideration, and non-sequential ones, which do not explicitly do so. However, both of them suffer from their own weaknesses. In this paper, we propose a model that overcomes difficulties encountered by both families of methods. Experiments show that our model outperforms state-of-the-art methods on popular SA datasets and a fine-grained aspect-based SA by a large margin.	en
dc.description.provenance	Made available in DSpace on 2021-05-11T04:58:02Z (GMT). No. of bitstreams: 1 ntu-108-R06922092-1.pdf: 1159027 bytes, checksum: 0485a7d456581a17b98bb068644bb7e3 (MD5) Previous issue date: 2019	en
dc.description.tableofcontents	口試委員會審定書................................................................................................ # Acknowledgement ............................................................................................... i 中文摘要................................................................................................................ ii ABSTRACT ........................................................................................................... iii CONTENTS........................................................................................................... iv LIST OF FIGURES ................................................................................................ vi LIST OF TABLES .................................................................................................. vii Chapter 1 Introduction ......................................................................................... 1 Chapter 2 Related Works .................................................................................... 3 Chapter 3 Our approach...................................................................................... 5 3.1 Overview ........................................................................................................ 5 3.2 Architecture.................................................................................................... 7 3.2.1 model ........................................................................................................... 7 3.2.2 Sentence encoders .................................................................................... 9 3.3 Training .......................................................................................................... 9 3.3.1 Context loss ................................................................................................ 9 3.3.2 Document loss ........................................................................................... 10 3.3.3 Total loss..................................................................................................... 10 3.4 Inference of document representation.......................................................... 11 Chapter 4 Experiments ....................................................................................... 12 4.1 Sentiment analysis ........................................................................................ 12 4.1.1 Dataset ........................................................................................................ 12 4.1.2 Experiment design ..................................................................................... 12 4.1.3 Results and discussion .............................................................................. 14 4.2 Aspect-based sentiment analysis................................................................ 15 4.2.1 Dataset and Experiment design ................................................................ 15 4.2.2 Results and discussion .............................................................................. 16 Chapter 5 Conclusions ....................................................................................... 18 REFERENCE......................................................................................................... 19
dc.language.iso	en
dc.title	以非監督學習之語義文件向量進行細緻多層面情緒分析	zh_TW
dc.title	Learning Unsupervised Semantic Document Representation for Fine-grained Aspect-based Sentiment Analysis	en
dc.date.schoolyear	107-2
dc.description.degree	碩士
dc.contributor.oralexamcommittee	陳信希(Hsin-Hsi Chen),陳柏琳(Berlin Chen),林守德(Shou-De Lin)
dc.subject.keyword	文件向量,句子向量,非監督學習,情緒分析,語義學習,文字分類,	zh_TW
dc.subject.keyword	Document representation,Sentence embedding,Unsupervised learning,Sentiment analysis,Semantic learning,Text classification,	en
dc.relation.page	19
dc.identifier.doi	10.6342/NTU201902410
dc.rights.note	同意授權(全球公開)
dc.date.accepted	2019-08-07
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	資訊工程學研究所	zh_TW
顯示於系所單位：	資訊工程學系

文件中的檔案：

檔案	大小	格式
ntu-108-1.pdf	1.13 MB	Adobe PDF	檢視/開啟

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。