Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 管理學院
  3. 資訊管理學系
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/55669
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor孫雅麗
dc.contributor.authorChang-min Wuen
dc.contributor.author吳張民zh_TW
dc.date.accessioned2021-06-16T04:16:10Z-
dc.date.available2017-08-25
dc.date.copyright2014-08-25
dc.date.issued2014
dc.date.submitted2014-08-20
dc.identifier.citation[1] F. CHANG, J. DEAN, S. GHEMAWAT, W. HSIEH, D. WALLACH, M. BURROWS, T. CHANDRA, A. FIKES, and R. GRUBER, “Bigtable: A distributed storage system for structured data.” In Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation OSDI’06, 2006
[2] http://hbase.apache.org/
[3] http://cassandra.apache.org/
[4] S. Melnik et al. “Dremel : interactive analysis of web-scale datasets. In Proceedings of the VLDB Endowment”, pages 330–339, 2010
[5] Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung. “The Google file system.” In 19th Symposium on Operating Systems Principles, pages 29-43, 2003.
[6] Michael Hausenblas and Jacques Nadeau, “Apache Drill: Interactive ad-hoc analysis at scale”. Big Data, 2013.
[7] M. Kornacker and J. Erickson. Cloudera Impala: Real-Time Queries in Apache Hadoop, for Real. http://www.cloudera.com/content/cloudera/ en/products-and-services/cdh/impala.html, 2012.
[8] D.J. Abadi, S.R. Madden, N. Hachem, “Column-stores vs. row-stores: how different are they really?” In: SIGMOD 2008: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, pp. 967–980.ACM, 2008.
[9] Z. Liu, B. He, H. Hsiao, Y. Chen, “Efficient and scalable data evolution with column oriented databases.” In: EDBT , 2011.
[10] Amit Kumar Dwivedi, C.S. Lamba, Shweta Shukla, “Performance Analysis of Column Oriented Database versus Row Oriented Database.” In International Journal of Computer Application, July 2012.
[11] D.J. Abadi,”Column-stores for wide and sparse data.”In Proceedings of the Conference on Innovative Data Systems Research, 2007.
[12] Giura, Paul, and Nasir Memon. 'Netstore: An efficient storage infrastructure for network forensics and monitoring.' Recent Advances in Intrusion Detection. Springer Berlin Heidelberg, 2010.
[13] Taylor, Teryl, et al. 'Toward Efficient Querying of Compressed Network Payloads.' USENIX Annual Technical Conference, 2012.
[14] http://www.cisco.com/
[15] M. Berger, “IP lookup with low memory requirement and fast update”, In: Proceedings of IEEE High Performance Switching and Routing,p. 287–291,2003.
[16]Symantec,W32.Sasser.B.Worm,http://securityresponse.symantec.com/avcenter/venc/data/w32.sasser.b.worm.html , 2004.
[17] Carrie Gates, Michael Collins, Michael Duggan, Andrew Kompanek, and Mark Thomas. “More NetFlow Tools: For Performance and Security.” In Proceedings of the USENIX18th Systems Administration Conference, p.121–131, November 2004
[18] Shvachko, Konstantin, et al. 'The hadoop distributed file system.' Mass Storage Systems and Technologies (MSST), 2010 IEEE 26th Symposium on. IEEE, 2010.
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/55669-
dc.description.abstract網路流量隨著科技的進步和普及有逐漸成長的趨勢,其勢必會帶動更複雜的網路活動,和資安攻擊事件的增長,如何儲存這巨量的網路流量並彈性且快速地取得提供作資安分析成為一個具有挑戰性的課題。例如在分析即時的資安事件上,這需求對於過往龐大的流量資料進行快速的分析,在面對成長趨勢有著猛烈成長的流量資料來說,資安管理人員需要一套新的資料管理與計算架構與模式及方法才來達到管理者的要求。
本論文將提出一個基於雲端架構的巨量網路通訊活動資料管理系統,”搭配上行導向儲存”、”階層式檔案路徑”、”資料重複壓縮技術”,提出一個完善的架構,幫助管理者根據資料分析的巨、微觀維度,迅速地找出相關過往的NetFlow資料。此系統能夠針對使用者所提出的查詢,迅速並且及時地提供相對應的資料供前端呈現有意義的結果給使用者。
關鍵詞:雲端計算、行導向儲存、階層式檔案路徑、資料重複壓縮、巨量資料
zh_TW
dc.description.abstractAs the network volume growing rapidly, there should have much more complicated network activities and security problems. How to store the big volume of network traffic and to access the data as soon as possible for security analysis is a challenging work. For example, in the analysis of real-time information security events, it requires past huge NetFlow data for analysis. In the face of rapidly growing trends for NetFlow data, information security managers need a new set of data management, computing architectures, models and methods to achieve the requirements.
This paper will present a cloud-based architecture, network communications activity massive data management system, with “Column-based Segmentation”, “Directory path hierachy”, “Deduplication”. Proposed a comprehensive framework to help managers in accordance with giant, micro-dimensional of data analysis to rapidly retrieve relative past NetFlow data. This network communications activity massive data management system could provide queries and show views of network communication activities easily, clearly and quickly.
Keywords: could computing, Column-based Segmentation, Directory path hierachy, Deduplication, big data
en
dc.description.provenanceMade available in DSpace on 2021-06-16T04:16:10Z (GMT). No. of bitstreams: 1
ntu-103-R01725027-1.pdf: 1298794 bytes, checksum: 0926d8f910a4dc80558298f80716cf23 (MD5)
Previous issue date: 2014
en
dc.description.tableofcontents目錄
謝詞 III
論文摘要 IV
THESIS ABSTRACT V
圖片索引 VII
表格索引 VIII
一、簡介 1
1.1 動機 1
1.2 目標 1
二、相關文獻 2
三、系統模型 7
四、巨量網路通訊活動資料管理系統 10
4.1 目標應用資料 10
4.2設計 14
4.3 系統架構 17
(1). toolNetData: 19
(2). 行導向儲存(Column-based Segmentation): 19
(3). 重複資料壓縮(Deduplication) 22
(4). 建立Metadata file 26
(5). 階層式檔案路徑(directory hierarchy path) 28
(6). Data finder 31
(7). Reconstructor 33
五、實驗 35
5.1.實驗環境 35
5.2.實驗設計 37
5.3實驗結果 38
六、未來展望與結論 51
七、參考資料 52
dc.language.isozh-TW
dc.title巨量網路活動資料之即時運算與視覺化呈現zh_TW
dc.titleVisually Interactive Security Analysis of BigIPen
dc.typeThesis
dc.date.schoolyear102-2
dc.description.degree碩士
dc.contributor.oralexamcommittee陳孟彰,謝錫?,李漢銘,潘育群
dc.subject.keyword雲端計算,行導向儲存,階層式檔案路徑,資料重複壓縮,巨量資料,zh_TW
dc.subject.keywordcould computing,Column-based Segmentation,Directory path hierachy,Deduplication,big data,en
dc.relation.page54
dc.rights.note有償授權
dc.date.accepted2014-08-20
dc.contributor.author-college管理學院zh_TW
dc.contributor.author-dept資訊管理學研究所zh_TW
顯示於系所單位:資訊管理學系

文件中的檔案:
檔案 大小格式 
ntu-103-1.pdf
  目前未授權公開取用
1.27 MBAdobe PDF
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved