請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/74355
完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.advisor | 李允中 | |
dc.contributor.author | Chih-Yu Shao | en |
dc.contributor.author | 邵志宇 | zh_TW |
dc.date.accessioned | 2021-06-17T08:31:25Z | - |
dc.date.available | 2024-08-15 | |
dc.date.copyright | 2019-08-15 | |
dc.date.issued | 2019 | |
dc.date.submitted | 2019-08-12 | |
dc.identifier.citation | [1] M. M. Breunig, H.-P. Kriegel, R. T. Ng, and J. Sander. Lof: identifying density-based local outliers. In ACM sigmod record, volume 29, pages 93–104. ACM, 2000.
[2] R. Chinnici, M. Gudgin, J. J. Moreau, and S. Weerawarana. Web services description language (WSDL) version 1.2 w3c working draft. W3C, 9 July 2002. [3] M. Crasso, A. Zunino, and M. Campo. A survey of approaches to web service dis- covery in service-oriented architectures. Journal of Database Management (JDM), 22(1):102–132, 2011. [4] M. Fabian, K. Gjergji, W. Gerhard, et al. Yago: A core of semantic knowledge unifying wordnet and wikipedia. In 16th International World Wide Web Conference, WWW, pages 697–706, 2007. [5] M. Faruqui, J. Dodge, S. K. Jauhar, C. Dyer, E. Hovy, and N. A. Smith. Retrofitting word vectors to semantic lexicons. arXiv preprint arXiv:1411.4166, 2014. [6] C. Fellbaum, A. Osherson, and P. Clark. Putting semantics into wordnet’s ”mor- phosemantic” links. pages 350–358, 10 2007. [7] M. Friedman. The use of ranks to avoid the assumption of normality implicit in the analysis of variance. Journal of the american statistical association, 32(200):675–701, 1937. [8] J. Ganitkevitch, B. Van Durme, and C. Callison-Burch. Ppdb: The paraphrase database. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 758–764, 2013. [9] J. Goikoetxea, E. Agirre, and A. Soroa. Single or multiple? combining word repre- sentations independently learned from text and wordnet. In AAAI, pages 2608–2614, 2016. [10] M. A. Hearst, S. T. Dumais, E. Osuna, J. Platt, and B. Scholkopf. Support vector machines. IEEE Intelligent Systems and their applications, 13(4):18–28, 1998. [11] M. Klusch. Overview of the s3 contest: Performance evaluation of semantic service matchmakers. In Semantic web services, pages 17–34. Springer, 2012. [12] M. Klusch and P. Kapahnke. The isem matchmaker: A flexible approach for adaptive hybrid semantic service selection. Web Semantics: Science, Services and Agents on the World Wide Web, 15:1–14, 2012. [13] M. Klusch, P. Kapahnke, S. Schulte, F. Lecue, and A. Bernstein. Semantic web service search: a brief survey. KI-Ku ̈nstliche Intelligenz, 30(2):139–147, 2016. [14] Y.-Y. Lee, T.-Y. Yen, H.-H. Huang, and H.-H. Chen. Structural-fitting word vectors to linguistic ontology for semantic relatedness measurement. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, CIKM ’17, pages 2151–2154, New York, NY, USA, 2017. ACM. [15] L. McInnes, J. Healy, and S. Astels. hdbscan: Hierarchical density based clustering. The Journal of Open Source Software, 2(11):205, 2017. [16] T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed repre- sentations of words and phrases and their compositionality. In Advances in neural information processing systems, pages 3111–3119, 2013. [17] G. A. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. J. Miller. Introduction to wordnet: An on-line lexical database. International journal of lexicography, 3(4):235–244, 1990. [18] N. Mrkˇsi ́c, D. O. S ́eaghdha, B. Thomson, M. Gaˇsi ́c, L. Rojas-Barahona, P.-H. Su, D. Vandyke, T.-H. Wen, and S. Young. Counter-fitting word vectors to linguistic constraints. arXiv preprint arXiv:1603.00892, 2016. [19] A. Murom ̈agi, K. Sirts, and S. Laur. Linear ensembles of word embedding models. arXiv preprint arXiv:1704.01419, 2017. [20] Owls-tc. http://projects.semwebcentral.org/projects/owls-tc/. [21] J. Pennington, R. Socher, and C. Manning. Glove: Global vectors for word representation. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pages 1532–1543, 2014. [22] P. J. Rousseeuw. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. Journal of computational and applied mathematics, 20:53–65, 1987. [23] S. S. Shapiro and M. B. Wilk. An analysis of variance test for normality (complete samples). Biometrika, 52(3/4):591–611, 1965. [24] V. Torra. The weighted owa operator. International Journal of Intelligent Systems, 12(2):153–166, 1997. [25] Wikipedia. Plagiarism — Wikipedia, the free encyclopedia, 2004. [Online; accessed 22-July-2004]. [26] R. R. Yager. On ordered weighted averaging aggregation operators in multicriteria decisionmaking. IEEE Transactions on Systems, Man, and Cybernetics, 18(1):183–190, Jan 1988. [27] M. Yu, M. Gormley, and M. Dredze. Factor-based compositional embedding models. In NIPS Workshop on Learning Semantics, pages 95–101, 2014. [28] M. Yu, M. R. Gormley, and M. Dredze. Combining word embeddings and feature embeddings for fine-grained relation extraction. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 1374–1379, 2015. | |
dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/74355 | - |
dc.description.abstract | 隨著網路服務的數目急遽地增加,如何有效率地從眾多的網路服務中找出符合 使用者需求的服務變得很重要。 在純文本服務匹配中,網路服務描述語言被視為 純文本,會從中提取關鍵字來表示服務, 然後通過預先訓練的Word2Vec模型轉 換成文字向量。 然而,在Word2Vec的模型中存在一個問題,因為文字向量是用 文字的上下文來訓練,這會導致兩個意義不同的文字有相似的表示方式。
此研究中,使用了管線調和方法讓文字向量和文字關係組合來改善文字向量, 而此方法是會賦予每個管線Retrofitting和Counter-fitting,並檢查這些管線的內容 和組合順序之後,才是最後使用的方法。 我們的方法在OWLS-TC V4的表現為MAP=0.9307,跟先前的研究比起來這是 最好的結果。 | zh_TW |
dc.description.abstract | Recently, as the number of web services has been increasing tremendously, it becomes essential to find a web service from numerous service providers to meet users’ needs in a more effective way. In plain text service matchmaking, WSDL is treated as a plain text, keywords are extracted from WSDL, used as the service representation, and then converted into vector by means of pre-trained Word2Vec model. However, there is a main problem with Word2Vec model, that is, the pre- trained word vectors are based upon the context of words which will result in similar representations for two very different words.
In this work, Word2Vec word vectors are improved by combining word relation information by means of a pipeline fitting approach to assigning to each pipe the Retrofitting and Counter-fitting with a density-based definition of neighbors and to combining these pipes into a pipeline by examing the content and the order of each pipe before adding to the pipeline. The performance of our approach to the benchmark OWLS-TC V4 is MAP=0.9307, which is the best performance comparing with previous research works. | en |
dc.description.provenance | Made available in DSpace on 2021-06-17T08:31:25Z (GMT). No. of bitstreams: 1 ntu-108-R06944036-1.pdf: 2153301 bytes, checksum: 9e9ee42011ab3e20e1498e212a736cb7 (MD5) Previous issue date: 2019 | en |
dc.description.tableofcontents | 誌謝 ii
摘要 iii Abstract iv List of Figures viii List of Tables x Chapter 1 Introduction 1 Chapter 2 Related Work 5 2.1 WordRepresentations .......................... 5 2.2 VectorCombination............................ 7 2.3 ReferenceData .............................. 9 2.4 PipelinePattern.............................. 11 2.5 OWAOperator .............................. 11 2.6 ServiceMatchmaking........................... 12 Chapter 3 Pipeline Fitting 13 3.1 EntityLinking............................... 15 3.2 RelationAnalysis............................. 21 3.3 PipeAnalysis ............................... 23 Chapter 4 Service Matchmaker 31 4.1 KeywordExtractor ............................ 32 4.2 VectorCombiner ............................. 33 4.3 SimilarityCalculator ........................... 34 Chapter 5 Experiments 36 5.1 EvaluationBenchmark .......................... 36 5.2 Word Representation and Relational Information . . . . . . . . . . . 38 5.3 ExperimentResults............................ 45 5.4 PerformanceAnalysis........................... 47 5.5 Discussion................................. 52 Chapter 6 Conclusion 53 Bibliography 54 | |
dc.language.iso | en | |
dc.title | 一種加強網路服務描述語言配對的管線調合方法 | zh_TW |
dc.title | A Pipeline Fitting Approach to Enhancing WSDL Matchmaking | en |
dc.type | Thesis | |
dc.date.schoolyear | 107-2 | |
dc.description.degree | 碩士 | |
dc.contributor.oralexamcommittee | 蘇木春,郭忠義,劉建宏,馬尚彬 | |
dc.subject.keyword | 網路服務,服務比對,文字關係,文字向量,向量結合, | zh_TW |
dc.subject.keyword | Web Service,Service Matchmaking,Word Relation,Word Vector,Vector Combination, | en |
dc.relation.page | 57 | |
dc.identifier.doi | 10.6342/NTU201903020 | |
dc.rights.note | 有償授權 | |
dc.date.accepted | 2019-08-12 | |
dc.contributor.author-college | 電機資訊學院 | zh_TW |
dc.contributor.author-dept | 資訊網路與多媒體研究所 | zh_TW |
顯示於系所單位: | 資訊網路與多媒體研究所 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-108-1.pdf 目前未授權公開取用 | 2.1 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。