請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/9716
完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.advisor | 洪士灝(Shih-Hao Hung) | |
dc.contributor.author | Chang-Chun Huang | en |
dc.contributor.author | 黃昶竣 | zh_TW |
dc.date.accessioned | 2021-05-20T20:37:14Z | - |
dc.date.available | 2010-07-30 | |
dc.date.available | 2021-05-20T20:37:14Z | - |
dc.date.copyright | 2008-07-30 | |
dc.date.issued | 2008 | |
dc.date.submitted | 2008-07-29 | |
dc.identifier.citation | [1] Fall, K.R., and Pasquale, J., 'Exploiting in-Kernel Data Paths to Improve I/O Throughput and Cpu Availability', in Proceedings of the 1993 USENIX Winter Technical Conference, 1993, pp. 327-334.
[2] Hewlett-Packard Company, 'Linux Programmer's Manual - Sendfile', http://devresource.hp.com/STKL/man/RH6.1/sendfile_2.html, 2001. [3] Buddhikot, M., 'Project Mars: Scalable, High Performance, Web Based Multimedia-Ondemand (Mod) Services and Servers', in Departmetn of Computer Science, Washington University, St. Louis, MO, USA, 1998 [4] Kim, H.-y., and Rixner, S., 'Tcp Offload through Connection Handoff', in European Conference on Computer Systems, 2006, pp. 279-290. [5] Dalessandro, D., Wyckoff, P., and Montry, G., 'Initial Performance Evaluation of the Neteffect 10 Gigabit Iwarp Adapter', in Cluster Computing, 2006 IEEE International Conference, 2006, pp. 1-7. [6] Balaji, P., Jin, H.W., Vaidyanathan, K., and Panda, D.K., 'Supporting Iwarp Compatibility and Features for Regular Network Adapters', in Cluster Computing, 2005. IEEE International, 2005, pp. 1-10. [7] Dalessandro, D., Devulapalli, A., and Wyckoff, P., 'Iwarp Protocol Kernel Space Software Implementation', in Parallel and Distributed Processing Symposium, 2006. IPDPS 2006. 20th International, 2006, pp. 8 pp. [8] Engel, J., Meneskie, J., and Kocak, T., 'Performance Analysis of Network Protocol Offload in a Simulation Environment', in Atlantic Coast Marketing SE, 2006, pp. 762-763. [9] Halvorsen, P., Jorde, E., Skevik, K.A., Goebel, V., and Plagemann, T., 'Performance Tradeoffs for Static Allocation of Zero-Copy Buffers', in Proceedings of 28th Euromicro Conference, 2002, pp. 138-143. [10] Kang, D.-J., Kim, Y.-H., Cha, G.-I., Jung, S.-I., Kim, M.-J., and Bae, H.-Y., 'Design and Implementation of Zero-Copy Path for Efficient File Transmission', High Performance Computing and Communications, vol. 4208/2006, 2006. [11] J. Tranter. Exploring the sendfile system call. http://ldp.dvo.ru/LDP/LG/issue91/tranter.html [12] Andrew Tridgwell. Samba resources. http://us1.samba.org/samba/, 1992. [13] Tom's Guide web resources. http://www.tomsguide.com/us/dlink-medialounge-dsm-g600-wireless-g-network-storage-enclosure,review-676-9.html [14] Senapathi, S., and Hernandez, R., Introduction of Tcp Offload Engines (Dell Power Solution, 2004), pp. 103-107 [15] Gupta, P., Light, A., and Hameroff, I., Boosting Data Transfer with Tcp Offload Engine Technology on Ninth-Generation Dell Poweredge Servers (Dell Power Solutions, 2006), pp. 18 - 22 [16] Tianhua, L., Hongfeng, Z., Guira, C., and Chuansheng, Z., 'Research and Implementation of Zero-Copy Technology in Linux', in Sarnoff Symposium, 2006 IEEE, 2006, pp. 1-4. [17] InfiniBand Trade Association, 'Infiniband Architecture Specification', 2004. [18] Borkar, S., Cohn, R., Cox, G., Gleason, S., Gross, T., Kung, H.T., Lam, M., Moore, B., Peterson, C., Pieper, J., Rankin, L., Tseng, P.S., Sutton, J., Urbanski, J., and Webb, J., 'Iwarp: An Integrated Solution to High-Speed Parallel Computing', in Supercomputing '88. [Vol.1]. Proceedings., 1988, pp. 330-339. [19] Wu, Z.-Z., Chen, H.-C., and Huang, C.-M., 'The 10gbit Hba Hardware Design for Iwarp Offloading Engine', CCL TECHNICAL JOURNAL, 2005. [20] CR, H., Implementing Cifs: The Common Internet File System (2004) [21] Samba resources. http://us6.samba.org/samba/ [22] Wang, C.W., Performance Optimization of the Samba Read Service on Linux-Based Network-Attached Storage Systems (2008) [23] Dong-Jae, K., Chei-Yol, K., Kang-Ho, K., and Sung-In, J., 'Design and Implementation of Kernel S/W for Tcp/Ip Offload Engine(Toe)', in Advanced Communication Technology, 2005, ICACT 2005. The 7th International Conference on, 2005, pp. 706-709. [24] Intel Corporation. IOmeter resources. http://www.iometer.org/, 1998. [25] Sysstat resources. http://pagesperso-orange.fr/sebastien.godard/index.html [26] Intel. VTune Performance Analyzer resources. http://www.intel.com/cd/software/products/asmo-na/eng/239144.htm [27] Strace resources. http://linux.die.net/man/1/strace [28] Valgrind resources. http://valgrind.org/ [29] IBM. Pvtrace resources. http://www.ibm.com/developerworks/library/l-graphvis/ [30] Graphviz resources. http://www.graphviz.org/ [31] Sun. Dtrace resources. http://www.sun.com/bigadmin/content/dtrace/ [32] Red Hat, IBM, Intel, and Hitachi. SystemTap resources. http://sourceware.org/systemtap/ | |
dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/9716 | - |
dc.description.abstract | 隨著網路技術的進步,連網式儲存裝置也變得越來越普遍,特別於Network-Attached Storage(NAS)更為普遍,隨著NAS的普遍,NAS的價格與效能也越來越受重視,另外隨著網路頻寬的增加,使得較低階的處理器沒辦法負擔網路協定的處理以及資料的搬移。為了解決這些問題,有研究提出Remote Direct Memory Access(RDMA)及 TCP Offload Engine(TOE)等技術來解決這些問題。然而這些解決方案必須添加額外的硬體支援,還需要軟體的配合才能有效率地減輕CPU的負擔。
本論文利用效能追蹤量測工具及技術,找出NAS上所需改善的效能瓶頸,並利用追蹤到的資料協助軟硬體整合的工作。我們成功地將Samba NAS server處理資料的主要功能遷移到Linux作業系統核心(kernel),以減輕對CPU在資料搬移的負擔。由於軟體架構的改變,我們得到1.01~1.31的寫檔效能改進。未來搭配TOE之後,對於大檔案的寫入,預計可達到2.62的效能增進。 | zh_TW |
dc.description.abstract | As the Ethernet performance increases, network-enabled storage solution becomes popular, especially for the Network Attached Storage (NAS) appliances. Some low-end NAS servers come with affordable prices and thus, the trade between performance and cost in such low-end NAS servers become an important issue. In fact, while processing the I/O requests from the client machine, the control processor in the low-cost NAS server spends most of the time handling data copy operations and network protocols. To boost the performance, software (zero-copy) and hardware (TCP Offload Engine) solutions are provided to remove data copy operations and handle network protocols, respectively. However, few attempts are made to discuss this hardware-software codesign issue about the software effort made to take advantage from the offload engines.
In this thesis, we tackle the hardware-software codesign issue by presenting a throughout performance study of target system. We first diagnose the performance of target system. The result shows that data copy operations and data processing in network protocol stack are two major performance bottleneck. For the software part, we migrate the Samba engine to the kernel and remove unnecessary data copy operations. For the hardware part, we predict the performance after adding a offload engine to handle data processing in network protocol stack. We show that an 1.01~1.31 speedup is achieved for the software part compared to original code. And, 1.51~2.62 performance improvement is enhanced if the network protocol process can be offload to a dedicated hardware. | en |
dc.description.provenance | Made available in DSpace on 2021-05-20T20:37:14Z (GMT). No. of bitstreams: 1 ntu-97-R95922158-1.pdf: 4979100 bytes, checksum: 2d12c93f3bbb4076a245ff485b8cb35a (MD5) Previous issue date: 2008 | en |
dc.description.tableofcontents | 第1章 序論 1
1.1 概序 1 1.2 研究動機 1 1.3 論文架構 2 第2章 相關研究及背景知識 3 2.1 相關研究 3 2.2 相關背景知識 6 2.2.1 Samba服務常式簡介 6 2.2.2 Samba 讀取效能 8 2.2.3 TCP/IP Offload Engines(TOE) 8 第3章 實驗設計與效能測量架構 10 3.1 實驗環境 10 3.2 量測工具介紹 13 3.2.1 Iometer 13 3.2.2 Mpstat 16 3.2.3 VTune效能分析器 16 3.2.4 Strace 17 3.2.5 Valgrind 17 3.2.6 Pvtrace 19 3.2.7 Dtrace 19 3.2.8 Systemtap 21 第4章 探測效能瓶頸 23 4.1 相異工作量之量測 23 4.2 檢視系統狀態 24 4.3 系統呼叫之評估 26 4.4 研討smbd於網路部分之處理 27 4.5 追蹤流程 29 4.5.1 Dtrace之追蹤資料 29 4.5.2 SystemTap之追蹤資料 32 第5章 效能改善機制 34 5.1 評估遷移路徑 34 5.2 評估效能改善的程度 37 第6章 實作方式與實驗結果 38 6.1 1 Outstanding I/O時系統效能的改善 38 6.2 伺服器滿載的情況系統效能的改善 40 第7章 預測加入TOE後的效能 42 第8章 結論和未來展望 46 參考文獻 47 | |
dc.language.iso | zh-TW | |
dc.title | 改善Linux網路儲存系統上Samba伺服器檔案寫入之效能 | zh_TW |
dc.title | Performance Optimization of the Samba write service on Linux-based Network-Attached Storage Systems | en |
dc.type | Thesis | |
dc.date.schoolyear | 96-2 | |
dc.description.degree | 碩士 | |
dc.contributor.oralexamcommittee | 郭大維(Tei-Wei Kuo),施吉昇(Chi-Sheng Shih),陳任凱(Ren-Kai Chen) | |
dc.subject.keyword | 網路儲存系統,TCP卸載引擎, | zh_TW |
dc.subject.keyword | Samba,NAS,zero-copy,TOE, | en |
dc.relation.page | 49 | |
dc.rights.note | 同意授權(全球公開) | |
dc.date.accepted | 2008-07-29 | |
dc.contributor.author-college | 電機資訊學院 | zh_TW |
dc.contributor.author-dept | 資訊工程學研究所 | zh_TW |
顯示於系所單位: | 資訊工程學系 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-97-1.pdf | 4.86 MB | Adobe PDF | 檢視/開啟 |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。