一種加強網路服務描述語言配對的管線調合方法

Chih-Yu Shao; 邵志宇

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/74355

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	李允中
dc.contributor.author	Chih-Yu Shao	en
dc.contributor.author	邵志宇	zh_TW
dc.date.accessioned	2021-06-17T08:31:25Z	-
dc.date.available	2024-08-15
dc.date.copyright	2019-08-15
dc.date.issued	2019
dc.date.submitted	2019-08-12
dc.identifier.citation	[1] M. M. Breunig, H.-P. Kriegel, R. T. Ng, and J. Sander. Lof: identifying density-based local outliers. In ACM sigmod record, volume 29, pages 93–104. ACM, 2000. [2] R. Chinnici, M. Gudgin, J. J. Moreau, and S. Weerawarana. Web services description language (WSDL) version 1.2 w3c working draft. W3C, 9 July 2002. [3] M. Crasso, A. Zunino, and M. Campo. A survey of approaches to web service dis- covery in service-oriented architectures. Journal of Database Management (JDM), 22(1):102–132, 2011. [4] M. Fabian, K. Gjergji, W. Gerhard, et al. Yago: A core of semantic knowledge unifying wordnet and wikipedia. In 16th International World Wide Web Conference, WWW, pages 697–706, 2007. [5] M. Faruqui, J. Dodge, S. K. Jauhar, C. Dyer, E. Hovy, and N. A. Smith. Retrofitting word vectors to semantic lexicons. arXiv preprint arXiv:1411.4166, 2014. [6] C. Fellbaum, A. Osherson, and P. Clark. Putting semantics into wordnet’s ”mor- phosemantic” links. pages 350–358, 10 2007. [7] M. Friedman. The use of ranks to avoid the assumption of normality implicit in the analysis of variance. Journal of the american statistical association, 32(200):675–701, 1937. [8] J. Ganitkevitch, B. Van Durme, and C. Callison-Burch. Ppdb: The paraphrase database. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 758–764, 2013. [9] J. Goikoetxea, E. Agirre, and A. Soroa. Single or multiple? combining word repre- sentations independently learned from text and wordnet. In AAAI, pages 2608–2614, 2016. [10] M. A. Hearst, S. T. Dumais, E. Osuna, J. Platt, and B. Scholkopf. Support vector machines. IEEE Intelligent Systems and their applications, 13(4):18–28, 1998. [11] M. Klusch. Overview of the s3 contest: Performance evaluation of semantic service matchmakers. In Semantic web services, pages 17–34. Springer, 2012. [12] M. Klusch and P. Kapahnke. The isem matchmaker: A flexible approach for adaptive hybrid semantic service selection. Web Semantics: Science, Services and Agents on the World Wide Web, 15:1–14, 2012. [13] M. Klusch, P. Kapahnke, S. Schulte, F. Lecue, and A. Bernstein. Semantic web service search: a brief survey. KI-Ku ̈nstliche Intelligenz, 30(2):139–147, 2016. [14] Y.-Y. Lee, T.-Y. Yen, H.-H. Huang, and H.-H. Chen. Structural-fitting word vectors to linguistic ontology for semantic relatedness measurement. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, CIKM ’17, pages 2151–2154, New York, NY, USA, 2017. ACM. [15] L. McInnes, J. Healy, and S. Astels. hdbscan: Hierarchical density based clustering. The Journal of Open Source Software, 2(11):205, 2017. [16] T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed repre- sentations of words and phrases and their compositionality. In Advances in neural information processing systems, pages 3111–3119, 2013. [17] G. A. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. J. Miller. Introduction to wordnet: An on-line lexical database. International journal of lexicography, 3(4):235–244, 1990. [18] N. Mrkˇsi ́c, D. O. S ́eaghdha, B. Thomson, M. Gaˇsi ́c, L. Rojas-Barahona, P.-H. Su, D. Vandyke, T.-H. Wen, and S. Young. Counter-fitting word vectors to linguistic constraints. arXiv preprint arXiv:1603.00892, 2016. [19] A. Murom ̈agi, K. Sirts, and S. Laur. Linear ensembles of word embedding models. arXiv preprint arXiv:1704.01419, 2017. [20] Owls-tc. http://projects.semwebcentral.org/projects/owls-tc/. [21] J. Pennington, R. Socher, and C. Manning. Glove: Global vectors for word representation. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pages 1532–1543, 2014. [22] P. J. Rousseeuw. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. Journal of computational and applied mathematics, 20:53–65, 1987. [23] S. S. Shapiro and M. B. Wilk. An analysis of variance test for normality (complete samples). Biometrika, 52(3/4):591–611, 1965. [24] V. Torra. The weighted owa operator. International Journal of Intelligent Systems, 12(2):153–166, 1997. [25] Wikipedia. Plagiarism — Wikipedia, the free encyclopedia, 2004. [Online; accessed 22-July-2004]. [26] R. R. Yager. On ordered weighted averaging aggregation operators in multicriteria decisionmaking. IEEE Transactions on Systems, Man, and Cybernetics, 18(1):183–190, Jan 1988. [27] M. Yu, M. Gormley, and M. Dredze. Factor-based compositional embedding models. In NIPS Workshop on Learning Semantics, pages 95–101, 2014. [28] M. Yu, M. R. Gormley, and M. Dredze. Combining word embeddings and feature embeddings for fine-grained relation extraction. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 1374–1379, 2015.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/74355	-
dc.description.abstract	隨著網路服務的數目急遽地增加，如何有效率地從眾多的網路服務中找出符合使用者需求的服務變得很重要。在純文本服務匹配中，網路服務描述語言被視為純文本，會從中提取關鍵字來表示服務，然後通過預先訓練的Word2Vec模型轉換成文字向量。然而，在Word2Vec的模型中存在一個問題，因為文字向量是用文字的上下文來訓練，這會導致兩個意義不同的文字有相似的表示方式。此研究中，使用了管線調和方法讓文字向量和文字關係組合來改善文字向量，而此方法是會賦予每個管線Retrofitting和Counter-fitting，並檢查這些管線的內容和組合順序之後，才是最後使用的方法。我們的方法在OWLS-TC V4的表現為MAP=0.9307，跟先前的研究比起來這是最好的結果。	zh_TW
dc.description.abstract	Recently, as the number of web services has been increasing tremendously, it becomes essential to find a web service from numerous service providers to meet users’ needs in a more effective way. In plain text service matchmaking, WSDL is treated as a plain text, keywords are extracted from WSDL, used as the service representation, and then converted into vector by means of pre-trained Word2Vec model. However, there is a main problem with Word2Vec model, that is, the pre- trained word vectors are based upon the context of words which will result in similar representations for two very different words. In this work, Word2Vec word vectors are improved by combining word relation information by means of a pipeline fitting approach to assigning to each pipe the Retrofitting and Counter-fitting with a density-based definition of neighbors and to combining these pipes into a pipeline by examing the content and the order of each pipe before adding to the pipeline. The performance of our approach to the benchmark OWLS-TC V4 is MAP=0.9307, which is the best performance comparing with previous research works.	en
dc.description.provenance	Made available in DSpace on 2021-06-17T08:31:25Z (GMT). No. of bitstreams: 1 ntu-108-R06944036-1.pdf: 2153301 bytes, checksum: 9e9ee42011ab3e20e1498e212a736cb7 (MD5) Previous issue date: 2019	en
dc.description.tableofcontents	誌謝 ii 摘要 iii Abstract iv List of Figures viii List of Tables x Chapter 1 Introduction 1 Chapter 2 Related Work 5 2.1 WordRepresentations .......................... 5 2.2 VectorCombination............................ 7 2.3 ReferenceData .............................. 9 2.4 PipelinePattern.............................. 11 2.5 OWAOperator .............................. 11 2.6 ServiceMatchmaking........................... 12 Chapter 3 Pipeline Fitting 13 3.1 EntityLinking............................... 15 3.2 RelationAnalysis............................. 21 3.3 PipeAnalysis ............................... 23 Chapter 4 Service Matchmaker 31 4.1 KeywordExtractor ............................ 32 4.2 VectorCombiner ............................. 33 4.3 SimilarityCalculator ........................... 34 Chapter 5 Experiments 36 5.1 EvaluationBenchmark .......................... 36 5.2 Word Representation and Relational Information . . . . . . . . . . . 38 5.3 ExperimentResults............................ 45 5.4 PerformanceAnalysis........................... 47 5.5 Discussion................................. 52 Chapter 6 Conclusion 53 Bibliography 54
dc.language.iso	en
dc.subject	網路服務	zh_TW
dc.subject	服務比對	zh_TW
dc.subject	文字關係	zh_TW
dc.subject	文字向量	zh_TW
dc.subject	向量結合	zh_TW
dc.subject	Word Relation	en
dc.subject	Service Matchmaking	en
dc.subject	Vector Combination	en
dc.subject	Web Service	en
dc.subject	Word Vector	en
dc.title	一種加強網路服務描述語言配對的管線調合方法	zh_TW
dc.title	A Pipeline Fitting Approach to Enhancing WSDL Matchmaking	en
dc.type	Thesis
dc.date.schoolyear	107-2
dc.description.degree	碩士
dc.contributor.oralexamcommittee	蘇木春,郭忠義,劉建宏,馬尚彬
dc.subject.keyword	網路服務,服務比對,文字關係,文字向量,向量結合,	zh_TW
dc.subject.keyword	Web Service,Service Matchmaking,Word Relation,Word Vector,Vector Combination,	en
dc.relation.page	57
dc.identifier.doi	10.6342/NTU201903020
dc.rights.note	有償授權
dc.date.accepted	2019-08-12
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	資訊網路與多媒體研究所	zh_TW
顯示於系所單位：	資訊網路與多媒體研究所

文件中的檔案：

檔案	大小	格式
ntu-108-1.pdf 未授權公開取用	2.1 MB	Adobe PDF

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。