Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 電機工程學系
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/31110
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor顏嗣鈞
dc.contributor.authorYU-FANG WUen
dc.contributor.author吳於芳zh_TW
dc.date.accessioned2021-06-13T02:30:19Z-
dc.date.available2007-02-02
dc.date.copyright2007-02-02
dc.date.issued2007
dc.date.submitted2007-01-25
dc.identifier.citation[1] P. Bille. A Survey on Tree Edit Distance and Related Problems. Theoretical Computer Science, 337(1-3): 217-239, 2005.
[2] Kaizhong Zhang and Dennis Shasha. Simple Fast Algorithms for the Editing Distance between Trees and Related Problems. SIAM Journal of Computing, 18: 1245–1262, 1989.
[3] P.N. Klein. Computing the Edit-distance between Unrooted Ordered Trees. Proceedings of the 6th annual European Symposium on Algorithms (ESA) 1998., pages 91–102. Springer-Verlag, 1998.
[4] A. Micheli and D. Rossin. Edit Distance Between Unlabeled Ordered Trees. RAIRO - Theoretical Informatics and Applications, 40: 593-609, 2006,
[5] http://www.w3.org/XML
[6] M. Garofalakis and A. Kumar. XML Stream Processing Using Tree-Edit Distance Embeddings. ACM Transactions on Database Systems, 30(1): 279-332, 2005.
[7] G. Cormode and S. Muthukrishnan. The String Edit Distance Matching Problem with Moves. Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms, 667-676, 2002.
[8] T. Grust and M. van Keulen. Tree Awareness for Relational DBMS Kernels: Staircase Join. Intelligent Search on XML Data, 231-245, 2003.
[9] P. Boncz, T. Grust, M. van Keulen, S. Manegold, J. Rittinger, and J. Teubner. MonetDB/XQuery: A Fast XQuery Processor Powered by a Relational Engine. Proceedings of the 2006 ACM SIGMOD international conference on Management of data, 479-490, 2006.
[10] P. Boncz, S. Manegold, J. Rittinger. Updating the Pre/Post Plane in MonetDB/XQuery. Informal Proceedings of the Second International Workshop on XQuery Implementation, Experience, and Perspectives (XIME-P), June 16- 17, 2005.
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/31110-
dc.description.abstract在資訊科學研究中,樹狀結構比較問題廣泛地出現在各種領域,其中也包括了可擴展標記語言文件的處理。而在解決樹狀結構比較問題時,樹狀結構編輯距離是一個常見且重要,並可用來量化樹狀結構差異的方法。因此,在比較大量串流可擴展標記語言文件時,有效率的樹狀結構嵌入演算法就相當重要。
在本篇論文中,我們提出一個新的樹狀距離嵌入演算法,用來比較從串流可擴展標記語言文件中取得的無標記有序樹狀結構。與最近的研究相較,我們的貢獻是在不增加時間及空間複雜度的情況下,簡化得到樹狀距離的過程。同時我們也分析了這個演算法在樹狀結構編輯距離上造成失真的上下界以及誤差的機率。
zh_TW
dc.description.abstractThe problem of comparing tree structures occurs in various areas in computer science and engineering, including the application to XML data processing. To solve this problem, tree edit distance is a common and significant measurement defining the difference between two tree structures quantitatively. Efficient tree edit distance embedding algorithms are therefore of significant importance in comparing large streaming XML document trees. In this thesis, we propose a new algorithm to obtain edit distance between unlabeled ordered trees derived from streaming XML data. In comparison with the previous work, our contribution lies in simplifying the procedure of obtaining the tree edit distance without increasing the time and space complexities. The upper and lower bounds as well as the error probability of our algorithm are also analyzed in this thesis.en
dc.description.provenanceMade available in DSpace on 2021-06-13T02:30:19Z (GMT). No. of bitstreams: 1
ntu-96-R93921142-1.pdf: 513817 bytes, checksum: 5bb5b6f526607ee067e7cf975a6ac8da (MD5)
Previous issue date: 2007
en
dc.description.tableofcontentsAcknowledgement……i
Abstract (Chinese)……ii
Abstract (English)……iii
Contents……iv
List of Figures……vi
List of Tables……viii
Chapter 1 Introduction……1
Chapter 2 Preliminaries……5
Section 2.1 Tree Edit Distance……5
2.1.1 Definitions……6
2.1.2 Algorithms to Obtain Tree Edit Distance……9
2.1.3 Edit Distance between Unlabeled Trees……9
Section 2.2 XML Stream Processing……11
2.2.1 XML and Tree Structures……12
2.2.2 Data Stream and Pseudorandom Sketching……13
2.2.3 A Tree Edit Distance Embedding Algorithm……15
Section 2.3 Staircase Join……20
2.3.1 Pre/Post Plane Encoding of XML Document Trees……20
2.3.2 Relational XML Algebra……23
Chapter 3 Modified Tree Edit Distance Embedding Algorithm……27
Section 3.1 Motif of Modification……27
Section 3.2 The First Modified Algorithm……28
Section 3.3 Upper and Lower Bounds……29
3.3.1 Effect from Insertion and Deletion……30
3.3.2 Effect from Subtree-Moving……34
3.3.3 Ambiguities……40
Section 3.4 The Second Modified Algorithm……43
3.4.1 Definition……43
3.4.2 Analysis of Error Probabilities……44
3.4.3 Comparison with the Original Algorithm……47
Chapter 4 Tree Edit Distance on Pre/Post Plane Encoding……48
Section 4.1 Effect from Insertion and Deletion……48
Section 4.2 Effect from Subtree-Swapping……50
Section 4.3 Analysis……52
Chapter 5 Combination of Modified Tree Edit Distance Embedding Algorithm and Pre/Post Plane Encoding……55
Chapter 6 Conclusions and Future Works……60
References……62
dc.language.isoen
dc.subject可擴展標記語言文件zh_TW
dc.subject無標記有序樹狀結構zh_TW
dc.subject串流zh_TW
dc.subject演算法zh_TW
dc.subject樹狀結構編輯距離zh_TW
dc.title樹狀結構編輯距離在可擴展標記語言文件上之分析研究zh_TW
dc.titleAnalysis of Tree Edit Distance on XML Dataen
dc.typeThesis
dc.date.schoolyear95-1
dc.description.degree碩士
dc.contributor.oralexamcommittee雷欽隆,王勝德,郭斯彥,王柏堯
dc.subject.keyword樹狀結構編輯距離,可擴展標記語言文件,演算法,串流,無標記有序樹狀結構,zh_TW
dc.subject.keywordTree Edit Distance,XML,Algorithm,Streaming,Unlabeled Ordered Trees,en
dc.relation.page64
dc.rights.note有償授權
dc.date.accepted2007-01-25
dc.contributor.author-college電機資訊學院zh_TW
dc.contributor.author-dept電機工程學研究所zh_TW
顯示於系所單位:電機工程學系

文件中的檔案:
檔案 大小格式 
ntu-96-1.pdf
  未授權公開取用
501.77 kBAdobe PDF
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved