Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 電機工程學系
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/48133
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor王勝德(Sheng-De Wang)
dc.contributor.authorCheng-Han Youen
dc.contributor.author游承翰zh_TW
dc.date.accessioned2021-06-15T06:47:00Z-
dc.date.available2011-07-06
dc.date.copyright2011-07-06
dc.date.issued2011
dc.date.submitted2011-06-15
dc.identifier.citation[1] A. Slominski, 'Xml pull parsing,' http://http://www.xmlpull.org/, 2004.
[2] M. L. Noga, et al., 'Lazy XML processing,' presented at the Proceedings of the 2002 ACM symposium on Document engineering, McLean, Virginia, USA, 2002.
[3] W. M. Löwe, et al., 'Foundations of Fast Communication via XML,' Annals of Software Engineering, vol. 13, pp. 357-379, 2002.
[4] Z. Dai, et al., 'A 1 cycle-per-byte XML parsing accelerator,' presented at the Proceedings of the 18th annual ACM/SIGDA international symposium on Field programmable gate arrays, Monterey, California, USA, 2010.
[5] N. Bruno, et al., 'Holistic twig joins: optimal XML pattern matching,' presented at the Proceedings of the 2002 ACM SIGMOD international conference on Management of data, Madison, Wisconsin, 2002.
[6] S. Chen, et al., 'Twig2Stack: bottom-up processing of generalized-tree-pattern queries over XML documents,' presented at the Proceedings of the 32nd international conference on Very large data bases, Seoul, Korea, 2006.
[7] L. Qin, et al., 'Twiglist: make twig pattern matching fast,' presented at the Proceedings of the 12th international conference on Database systems for advanced applications, Bangkok, Thailand, 2007.
[8] W. Lu, et al., 'A Parallel Approach to XML Parsing,' in Grid Computing, 7th IEEE/ACM International Conference on, 2006, pp. 223-230.
[9] Y. Pan, et al., 'A Static Load-Balancing Scheme for Parallel XML Parsing on Multicore CPUs,' in Cluster Computing and the Grid, 2007. CCGRID 2007. Seventh IEEE International Symposium on, 2007, pp. 351-362.
[10] Y. Pan, et al., 'Simultaneous transducers for data-parallel XML parsing,' in Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on, 2008, pp. 1-12.
[11] Y. Pan, et al., 'Hybrid Parallelism for XML SAX Parsing,' in Web Services, 2008. ICWS '08. IEEE International Conference on, 2008, pp. 505-512.
[12] Y. Pan, et al., 'Parallel XML Parsing Using Meta-DFAs,' in e-Science and Grid Computing, IEEE International Conference on, 2007, pp. 237-244.
[13] X. Li, et al., 'Key Elements Tracing Method for Parallel XML Parsing in Multi-Core System,' in Parallel and Distributed Computing, Applications and Technologies, 2009 International Conference on, 2009, pp. 439-444.
[14] C. Rongxin and C. Weibin, 'A parallel solution to XML query application,' in Computer Science and Information Technology (ICCSIT), 2010 3rd IEEE
30
International Conference on, 2010, pp. 542-546.
[15] R. Bordawekar, et al., 'Parallelization of XPath queries using multi-core processors: challenges and experiences,' presented at the Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, Saint Petersburg, Russia, 2009.
[16] S. B. A. Berglund, D. Chamberlin, et al, 'XML path language(XPath)2.0,' http://www.w3.org/TR/XPath20, 2007.
[17] J. Zhang, 'VTD-XML: The Future of XML Processing,' http://vtd-xml.sourceforge.net/, 2004.
[18] D. Veillard, 'Libxml2 project web page,' http://xmlsoft.org/, 2004.
[19] G. Miklau, 'UW XML Repository,' http://www.cs.washington.edu/research/xmldatasets, 2001.
[20] W3c, 'Extensible Markup Language (XML) 1.0 (Third Edition),' http://www.w3.org/TR/2004/REC-xml-20040204/, 2004.
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/48133-
dc.description.abstract目前在使用數據平行方法來改善可擴展標記語言(XML)之語法分析上遇到了一個很大的問題,就是在分割文件上。現今的辦法都需要一個預先處理器來對文件做一個初步了解,才能解決這個問題。本文提出了不需使用預先處理的直接平行方法來解決這個問題。在直接平行方法中我們藉由找尋燈塔來開始語法分析,而所謂燈塔指的是一種只有少數例外的特定字元。接著我們在語法分析中藉由找尋線索以及在某些情況下重新解析必要的區間來處理例外狀況。接著我們提出了非同步之分割器來改善查詢可擴展標記語言之路徑語言的速度。在非同步之分割器中我們在不需要各個線程溝通的情況下查詢,並且藉由一種名稱為先代表格的資料結構來幫助各個線程獨立運作,讓我們可以把查詢路徑語言的工作分散到各個線程中。實驗數據顯示出我們提出的這兩個方法不論是用在空間小的文件或是空間大的文件上,都可以有效的利用多核心架構的資源。zh_TW
dc.description.abstractData-parallel XML parsing has a crucial problem in partitioning XML documents. Existing approaches need a pre-parse step to determine the partitions. In this paper, we propose a direct parallel method to solve this problem without pre-parsing. In the direct parallel method, we directly start the parallel parsing by finding the “light tower”, which is a particular character with some exceptions, called clues. We handle the exceptions by watching the clues and reparsing the partition if it is required in the parsing stage. We also propose a non-synchronized splitter approach to the parallel XML querying using XPath expressions. In the non-synchronized splitter approach, we split an XPath expression into pieces to be executed by threads and we use a data structure, called the ancestor table, to help each thread handle its part of XPath expression independently without communications between threads. Our experiments show that our approach scales well from small sized files to huge sized files.en
dc.description.provenanceMade available in DSpace on 2021-06-15T06:47:00Z (GMT). No. of bitstreams: 1
ntu-100-R98921069-1.pdf: 1339898 bytes, checksum: 0434e511aced962be0fac39634430025 (MD5)
Previous issue date: 2011
en
dc.description.tableofcontents口試委員審定書 ................................................................................................................ i
誌謝 .................................................................................................................................. ii
摘要 ................................................................................................................................. iii
Abstract ............................................................................................................................. iv
Chapter 1 Introduction ....................................................................................................... 1
1.1 Background .......................................................................................................... 1
1.2 Related Work ....................................................................................................... 2
1.3 Contributions ....................................................................................................... 3
1.4 Thesis Organization ............................................................................................. 5
Chapter 2 Direct Parallel Parsing ............................................................................... 6
2.1 Overview ................................................................................................... 6
2.2 Parallel Parsing .......................................................................................... 7
2.3 Well-formed Checking ............................................................................ 13
2.4 Table Merging .......................................................................................... 14
Chapter 3 Multi-core XPATH Architecture .............................................................. 15
3.1 Overview ................................................................................................. 15
3.2 The Ancestor Table .................................................................................. 15
3.3 Non-synchronized Splitter ....................................................................... 17
Chapter 4 System Architecture ................................................................................. 18
4.1 Environment ............................................................................................ 18
4.2 XML Parsing ........................................................................................... 18
4.3 XML Querying ........................................................................................ 22
Chapter 5 Discussion ................................................................................................ 26
vi
Chapter 6 Conclusions ............................................................................................. 27
Chapter 7 Future Work ............................................................................................. 28
References ....................................................................................................................... 29
dc.language.isoen
dc.subject虛擬標記描述子zh_TW
dc.subject平行處理zh_TW
dc.subject直接平行處理zh_TW
dc.subject可擴展標記語言zh_TW
dc.subject可擴展標記語言的路徑語言zh_TW
dc.subject多核心架構zh_TW
dc.subjectdirect parallelen
dc.subjectXML parsingen
dc.subjectXML queryingen
dc.subjectXPathen
dc.subjectmulti-coreen
dc.subjectparallelen
dc.subjectVTD-XMLen
dc.subjectreparsingen
dc.title可擴展標記語言之語法分析及查詢之數據平行方法zh_TW
dc.titleA Data Parallel Approach to XML Parsing and Queryen
dc.typeThesis
dc.date.schoolyear99-2
dc.description.degree碩士
dc.contributor.oralexamcommittee雷欽隆(Chin-Laung Lei),洪士灝(Shih-Hao Hung),鄭振牟(Chen-Mou Cheng)
dc.subject.keyword可擴展標記語言,可擴展標記語言的路徑語言,多核心架構,虛擬標記描述子,平行處理,直接平行處理,zh_TW
dc.subject.keywordXML parsing,XML querying,XPath,multi-core,parallel,VTD-XML,reparsing,direct parallel,en
dc.relation.page30
dc.rights.note有償授權
dc.date.accepted2011-06-15
dc.contributor.author-college電機資訊學院zh_TW
dc.contributor.author-dept電機工程學研究所zh_TW
顯示於系所單位:電機工程學系

文件中的檔案:
檔案 大小格式 
ntu-100-1.pdf
  未授權公開取用
1.31 MBAdobe PDF
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved