Skip navigation

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets

Learn More
DSpace logo
English
中文
  • Browse
    • Communities
      & Collections
    • Publication Year
    • Author
    • Title
    • Subject
    • Advisor
  • Search TDR
  • Rights Q&A
    • My Page
    • Receive email
      updates
    • Edit Profile
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 資訊網路與多媒體研究所
Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/71351
Full metadata record
???org.dspace.app.webui.jsptag.ItemTag.dcfield???ValueLanguage
dc.contributor.advisor許永真
dc.contributor.authorSzu-Hung Wangen
dc.contributor.author王斯泓zh_TW
dc.date.accessioned2021-06-17T05:59:15Z-
dc.date.available2019-02-15
dc.date.copyright2019-02-15
dc.date.issued2019
dc.date.submitted2019-02-13
dc.identifier.citation[1] R. Agerri, M. Cuadros, S. Gaines, and G. Rigau. Opener: Open polarity enhanced named entity recognition. Procesamiento del Lenguaje Natural, 51:215–218, 2013.
[2] M. Allamanis, H. Peng, and C. A. Sutton. A convolutional attention network for extreme summarization of source code. In M. Balcan and K. Q. Weinberger, editors, Proceedings of the 33nd International Conference on Machine Learning, ICML 2016, New York City, NY, USA, June 19-24, 2016, volume 48 of JMLR Workshop and Conference Proceedings, pages 2091–2100. JMLR.org, 2016.
[3] P. Anderson, X. He, C. Buehler, D. Teney, M. Johnson, S. Gould, and L. Zhang. Bottom-up and top-down attention for image captioning and visual question answer- ing. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA, June 18-22, 2018, pages 6077–6086. IEEE Computer Society, 2018.
[4] P. Arthur, G. Neubig, and S. Nakamura. Incorporating discrete translation lexi- cons into neural machine translation. In J. Su, X. Carreras, and K. Duh, editors, Proceedings of the 2016 Conference on Empirical Methods in Natural Language Pro- cessing, EMNLP 2016, Austin, Texas, USA, November 1-4, 2016, pages 1557–1567. The Association for Computational Linguistics, 2016.
[5] D. Bahdanau, K. Cho, and Y. Bengio. Neural machine translation by jointly learning to align and translate. CoRR, abs/1409.0473, 2014.
[6] J. Barnes, R. Klinger, and S. Schulte im Walde. Assessing state-of-the-art sentiment models on state-of-the-art sentiment datasets. In A. Balahur, S. M. Mohammad, and E. van der Goot, editors, Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, WASSA@EMNLP 2017, Copenhagen, Denmark, September 8, 2017, pages 2–12. Association for Com- putational Linguistics, 2017.
[7] X. Ding, B. Liu, and P. S. Yu. A holistic lexicon-based approach to opinion mining. In M. Najork, A. Z. Broder, and S. Chakrabarti, editors, Proceedings of the Inter- national Conference on Web Search and Web Data Mining, WSDM 2008, Palo Alto, California, USA, February 11-12, 2008, pages 231–240. ACM, 2008.
[8] B. Felbo, A. Mislove, A. Søgaard, I. Rahwan, and S. Lehmann. Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm. In M. Palmer, R. Hwa, and S. Riedel, editors, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017, Copenhagen, Denmark, September 9-11, 2017, pages 1615–1625. Association for Computational Linguistics, 2017.
[9] F. A. Gers, J. Schmidhuber, and F. A. Cummins. Learning to forget: Continual prediction with LSTM. Neural Computation, 12(10):2451–2471, 2000.
[10] A. Graves and J. Schmidhuber. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Networks, 18(5-6):602–610, 2005.
[11] J. Gu, Z. Lu, H. Li, and V. O. K. Li. Incorporating copying mechanism in sequence- to-sequence learning. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016, August 7-12, 2016, Berlin, Germany, Volume 1: Long Papers. The Association for Computer Linguistics, 2016.
[12] K. M. Hermann, T. Kocisky ́, E. Grefenstette, L. Espeholt, W. Kay, M. Suleyman, and P. Blunsom. Teaching machines to read and comprehend. In C. Cortes, N. D. Lawrence, D. D. Lee, M. Sugiyama, and R. Garnett, editors, Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Pro- cessing Systems 2015, December 7-12, 2015, Montreal, Quebec, Canada, pages 1693– 1701, 2015.
[13] M. Hu and B. Liu. Mining and summarizing customer reviews. In W. Kim, R. Ko- havi, J. Gehrke, and W. DuMouchel, editors, Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Seattle, Wash- ington, USA, August 22-25, 2004, pages 168–177. ACM, 2004.
[14] M. Jabreel and A. Moreno. Eitaka at semeval-2018 task 1: An ensemble of n-channels convnet and xgboost regressors for emotion analysis of tweets. In M. Apidianaki, S. M. Mohammad, J. May, E. Shutova, S. Bethard, and M. Carpuat, editors, Proceedings of The 12th International Workshop on Semantic Evaluation, SemEval@NAACL-HLT, New Orleans, Louisiana, June 5-6, 2018, pages 193–199. Association for Computa- tional Linguistics, 2018.
[15] S. Kim and E. H. Hovy. Determining the sentiment of opinions. In COLING 2004, 20th International Conference on Computational Linguistics, Proceedings of the Con- ference, 23-27 August 2004, Geneva, Switzerland, 2004.
[16] Y. Kim. Convolutional neural networks for sentence classification. In A. Moschitti, B. Pang, and W. Daelemans, editors, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, EMNLP 2014, October 25-29, 2014, Doha, Qatar, A meeting of SIGDAT, a Special Interest Group of the ACL, pages 1746–1751. ACL, 2014.
[17] Z. Lei, Y. Yang, and M. Yang. SAAN: A sentiment-aware attention network for sentiment analysis. In K. Collins-Thompson, Q. Mei, B. D. Davison, Y. Liu, and E. Yilmaz, editors, The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, SIGIR 2018, Ann Arbor, MI, USA, July 08- 12, 2018, pages 1197–1200. ACM, 2018.
[18] B. Liu. Sentiment Analysis - Mining Opinions, Sentiments, and Emotions. Cam- bridge University Press, 2015.
[19] T. Luong, H. Pham, and C. D. Manning. E ective approaches to attention-based neural machine translation. In L. M`arquez, C. Callison-Burch, J. Su, D. Pighin, and Y. Marton, editors, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, EMNLP 2015, Lisbon, Portugal, September 17-21, 2015, pages 1412–1421. The Association for Computational Linguistics, 2015.
[20] T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed rep- resentations of words and phrases and their compositionality. In C. J. C. Burges, L. Bottou, Z. Ghahramani, and K. Q. Weinberger, editors, Advances in Neural In- formation Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013, Lake Tahoe, Nevada, United States., pages 3111–3119, 2013.
[21] S. Mohammad, F. Bravo-Marquez, M. Salameh, and S. Kiritchenko. Semeval-2018 task 1: A ect in tweets. In M. Apidianaki, S. M. Mohammad, J. May, E. Shutova, S. Bethard, and M. Carpuat, editors, Proceedings of The 12th International Workshop on Semantic Evaluation, SemEval@NAACL-HLT, New Orleans, Louisiana, June 5-6, 2018, pages 1–17. Association for Computational Linguistics, 2018.
[22] S. Mohammad, S. Kiritchenko, and X. Zhu. Nrc-canada: Building the state-of- the-art in sentiment analysis of tweets. In M. T. Diab, T. Baldwin, and M. Ba- roni, editors, Proceedings of the 7th International Workshop on Semantic Evaluation, SemEval@NAACL-HLT 2013, Atlanta, Georgia, USA, June 14-15, 2013, pages 321– 327. The Association for Computer Linguistics, 2013.
[23] H. Mulki, C. B. Ali, H. Haddad, and I. Babaoglu. Tw-star at semeval-2018 task 1: Preprocessing impact on multi-label emotion classification. In M. Apidianaki, S. M. Mohammad, J. May, E. Shutova, S. Bethard, and M. Carpuat, editors, Proceedings of The 12th International Workshop on Semantic Evaluation, SemEval@NAACL-HLT, New Orleans, Louisiana, June 5-6, 2018, pages 167–171. Association for Computa- tional Linguistics, 2018.
[24] T. Mullen and N. Collier. Sentiment analysis using support vector machines with diverse information sources. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing , EMNLP 2004, A meeting of SIGDAT, a Special Interest Group of the ACL, held in conjunction with ACL 2004, 25-26 July 2004, Barcelona, Spain, pages 412–418. ACL, 2004.
[25] B. Pang and L. Lee. Seeing stars: Exploiting class relationships for sentiment catego- rization with respect to rating scales. In K. Knight, H. T. Ng, and K. Oflazer, editors, ACL 2005, 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, 25-30 June 2005, University of Michigan, USA, pages 115–124. The Association for Computer Linguistics, 2005.
[26] B. Pang and L. Lee. Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, 2(1-2):1–135, 2007.
[27] B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up? sentiment classification using machine learning techniques. In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, EMNLP 2002, Philadelphia, PA, USA, July 6-7, 2002, 2002.
[28] Q.Qian,M.Huang,J.Lei,andX.Zhu.LinguisticallyregularizedLSTMforsentiment classification. In R. Barzilay and M. Kan, editors, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017, Vancouver, Canada, July 30 - August 4, Volume 1: Long Papers, pages 1679–1689. Association for Computational Linguistics, 2017.
[29] M. J. Seo, A. Kembhavi, A. Farhadi, and H. Hajishirzi. Bidirectional attention flow for machine comprehension. CoRR, abs/1611.01603, 2016.
[30] B. Shin, T. Lee, and J. D. Choi. Lexicon integrated CNN models with attention for sentiment analysis. In A. Balahur, S. M. Mohammad, and E. van der Goot, editors, Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, Sen- timent and Social Media Analysis, WASSA@EMNLP 2017, Copenhagen, Denmark, September 8, 2017, pages 149–158. Association for Computational Linguistics, 2017.
[31] R. Socher, A. Perelygin, J. Wu, J. Chuang, C. D. Manning, A. Y. Ng, and C. Potts. Recursive deep models for semantic compositionality over a sentiment treebank. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Pro- cessing, EMNLP 2013, 18-21 October 2013, Grand Hyatt Seattle, Seattle, Wash- ington, USA, A meeting of SIGDAT, a Special Interest Group of the ACL, pages 1631–1642. ACL, 2013.
[32] M. Speriosu, N. Sudan, S. Upadhyay, and J. Baldridge. Twitter polarity classification with label propagation over lexical links and the follower graph. In O. Abend, A. Ko- rhonen, A. Rappoport, and R. Reichart, editors, Proceedings of the First workshop on Unsupervised Learning in NLP@EMNLP 2011, Edinburgh, Scotland, July 30, 2011, pages 53–63. Association for Computational Linguistics, 2011.
[33] M. Taboada, J. Brooke, M. Tofiloski, K. D. Voll, and M. Stede. Lexicon-based methods for sentiment analysis. Computational Linguistics, 37(2):267–307, 2011.
[34] K. S. Tai, R. Socher, and C. D. Manning. Improved semantic representations from tree-structured long short-term memory networks. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, ACL 2015, July 26-31, 2015, Beijing, China, Volume 1: Long Papers, pages 1556–1566. The Association for Computer Linguistics, 2015.
[35] O. Uryupina, B. Plank, A. Severyn, A. Rotondi, and A. Moschitti. Sentube: A corpus for sentiment analysis on youtube social media. In N. Calzolari, K. Choukri, T. De- clerck, H. Loftsson, B. Maegaard, J. Mariani, A. Moreno, J. Odijk, and S. Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, Reykjavik, Iceland, May 26-31, 2014., pages 4244–4249. European Language Resources Association (ELRA), 2014.
[36] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, and I. Polosukhin. Attention is all you need. In I. Guyon, U. von Luxburg, S. Bengio, H. M. Wallach, R. Fergus, S. V. N. Vishwanathan, and R. Garnett, editors, Advances in Neural Information Processing Systems 30: Annual Conference on Neural Infor- mation Processing Systems 2017, 4-9 December 2017, Long Beach, CA, USA, pages 6000–6010, 2017.
[37] T. Wilson, J. Wiebe, and P. Ho mann. Recognizing contextual polarity in phrase- level sentiment analysis. In HLT/EMNLP 2005, Human Language Technology Con- ference and Conference on Empirical Methods in Natural Language Processing, Pro- ceedings of the Conference, 6-8 October 2005, Vancouver, British Columbia, Canada, pages 347–354. The Association for Computational Linguistics, 2005.
[38] K. Xu, J. Ba, R. Kiros, K. Cho, A. C. Courville, R. Salakhutdinov, R. S. Zemel, and Y. Bengio. Show, attend and tell: Neural image caption generation with vi- sual attention. In F. R. Bach and D. M. Blei, editors, Proceedings of the 32nd International Conference on Machine Learning, ICML 2015, Lille, France, 6-11 July 2015, volume 37 of JMLR Workshop and Conference Proceedings, pages 2048–2057. JMLR.org, 2015.
[39] T. Yanase, K. Yanai, M. Sato, T. Miyoshi, and Y. Niwa. bunji at semeval-2016 task 5: Neural and syntactic models of entity-attribute relationship for aspect-based sentiment analysis. In S. Bethard, D. M. Cer, M. Carpuat, D. Jurgens, P. Nakov, and T. Zesch, editors, Proceedings of the 10th International Workshop on Semantic Evaluation, SemEval@NAACL-HLT 2016, San Diego, CA, USA, June 16-17, 2016, pages 289–295. The Association for Computer Linguistics, 2016.
[40] Z. Yang, X. He, J. Gao, L. Deng, and A. J. Smola. Stacked attention networks for image question answering. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27-30, 2016, pages 21–29. IEEE Computer Society, 2016.
[41] W. Yin and H. Schu ̈tze. An exploration of embeddings for generalized phrases. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014, June 22-27, 2014, Baltimore, MD, USA, Student Research Workshop, pages 41–47. The Association for Computer Linguistics, 2014.
[42] J. Yoon and H. Kim. Multi-channel lexicon integrated cnn-bilstm models for senti- ment analysis. In L. Ku and Y. Tsao, editors, Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, ROCLING 2017, Taipei, Taiwan, November 27-28, 2017, pages 244–253. The Association for Computational Linguis- tics and Chinese Language Processing (ACLCLP), 2017.
[43] L. Zhang, S. Wang, and B. Liu. Deep learning for sentiment analysis: A survey. Wiley Interdiscip. Rev. Data Min. Knowl. Discov., 8(4), 2018.
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/71351-
dc.description.abstract情緒分析 (sentiment analysis) 是一個找出文字中的情緒與情感的重要任務,常用於分析句子中的情緒與感情。此問題常被視為一種分類的問題,利用深度神經網路模型可以達到很好的成果,注意力機制也被證實有很好的效果。再者,先前研究也指出情緒辭典對情緒分析問題上有很好的成效。然而,情緒辭典並沒有適當地被應用在先前的研究中。
本篇論文探索了情緒引導之注意力機制,以完整利用情緒辭典並將情緒辭典結合在注意力機制中,藉此幫助分類。我們提出兩種結合方法,第一,為了有效利用情緒辭典,我們轉換情緒詞典中的情緒值,使其成為一組增強注意力權重係數,以最小化原本模型內的注意力權重係數之錯誤。第二,我們提出了情緒多頭注意力機制,我們使用從情緒值轉換而來的注意力權重係數,做為第二組注意力頭,以協助模型關注更多資訊。我們實驗在六組情緒分析資料集上,結果顯示此方法準確度皆超越先前最佳的模型,相較於先前的分數提升0.12%到8.12%。
zh_TW
dc.description.abstractSentiment analysis is an important task, which extracts sentiment, emotion or affect in text. The problem is often treated as a classification problem for which deep neural methods have been well explored and attention mechanisms have generated promising performance. Studies have shown that lexicon is highly effective for sentiment analysis. However, lexicon has not been fully utilized by the previous methods. No existing method integrates lexicon into the attention mechanism effectively to solve the problem.
This thesis explores the sentiment-guided attention mechanism, which integrates lexicon into attention mechanism and proposes two approaches. First, to utilize sentiment lexicons, we transform lexicon values into guiding weights to minimize the error of attention weights. Second, we propose sentiment multi-head attention to help the model jointly attend to sentiment information provided by the transformed lexicon values.
Experiments show that our models outperform state-of-the-art models on six sentiment analysis benchmarks with improved accuracy of 0.12% to 8.12%.
en
dc.description.provenanceMade available in DSpace on 2021-06-17T05:59:15Z (GMT). No. of bitstreams: 1
ntu-108-R05944025-1.pdf: 2459603 bytes, checksum: 33c534ff8408c961359fb7046ab9fc6a (MD5)
Previous issue date: 2019
en
dc.description.tableofcontentsAcknowledgments ................................ i
Abstract ................................ iii
List of Figures ................................ viii
List of Tables ................................ x
Chapter 1 Introduction ................................ 1
1.1 Background ................................ 1
1.2 Motivation................................. 3
1.3 Objective ................................. 4
1.4 Outline of the Thesis........................... 4
Chapter 2 Related Work ................................ 5
2.1 Sentiment Analysis ............................ 5
2.2 Attention Mechanism ........................... 6
2.3 Neural Model with Sentiment Lexicon .................. 7
Chapter 3 Sentiment Analysis with Lexicon ................................ 9
3.1 Problem Statement ............................ 9
3.2 Proposed Solution ............................. 11
Chapter 4 Sentiment-Guided Attention Mechanism ................................ 14
4.1 System Architecture ........................... 14
4.2 The Baseline Attention-Based Model .................. 15
4.3 Sentiment-Guided Weights Generation Model ................................ 17
4.4 Sentiment Boosted Attention Approach ................................ 19
4.4.1 Hard Boost Method ....................... 19
4.4.2 Soft Boost Method ........................ 21
4.4.3 Merge Boost Method ................................ 23
4.5 Sentiment Multi-Head Attention Approach ................................ 24
Chapter 5 Experiments ................................ 27
5.1 Lexicons and Datasets .......................... 27
5.1.1 Lexicons ................................ 27
5.1.2 Stanford Sentiment Treebank ................................ 29
5.1.3 OpenNER ................................ 29
5.1.4 Sentube Datasets ................................ 29
5.1.5 SemEval-2018 Task1: Affect in Tweets ................................ 30
5.2 Experimental Setup ................................ 30
5.3 Experimental Results ................................ 31
5.4 Model Analysis ................................ 33
5.4.1 Effect of Sentiment Boosted Attention ................................ 33
5.4.2 Effect of Sentiment Multi-Head Attention ................................ 36
5.4.3 Randomness in Deep Learning ................................ 38
5.4.4 Robustness of the Model ................................ 40
Chapter 6 Conclusion ................................ 42
6.1 Summary of Contributions ................................ 42
6.2 Future Work ................................ 43
Bibliography ................................ 44
dc.language.isoen
dc.subject情感分析zh_TW
dc.subject情緒分類zh_TW
dc.subject情緒辭典zh_TW
dc.subject注意力機制zh_TW
dc.subject類神經網路zh_TW
dc.subjectLexiconen
dc.subjectAttention Mechanismen
dc.subjectSentiment Analysisen
dc.subjectEmotion Classificationen
dc.subjectNeural Networksen
dc.title以情緒導引強化注意力機制於情緒分析之研究zh_TW
dc.titleSentiment-Guided Attention Mechanism for Sentiment Analysisen
dc.typeThesis
dc.date.schoolyear107-1
dc.description.degree碩士
dc.contributor.oralexamcommittee劉昭麟,李宏毅,古倫維,陳維超
dc.subject.keyword情感分析,情緒分類,情緒辭典,注意力機制,類神經網路,zh_TW
dc.subject.keywordSentiment Analysis,Emotion Classification,Lexicon,Attention Mechanism,Neural Networks,en
dc.relation.page52
dc.identifier.doi10.6342/NTU201900557
dc.rights.note有償授權
dc.date.accepted2019-02-14
dc.contributor.author-college電機資訊學院zh_TW
dc.contributor.author-dept資訊網路與多媒體研究所zh_TW
Appears in Collections:資訊網路與多媒體研究所

Files in This Item:
File SizeFormat 
ntu-108-1.pdf
  Restricted Access
2.4 MBAdobe PDF
Show simple item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved