Skip navigation

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets

Learn More
DSpace logo
English
中文
  • Browse
    • Communities
      & Collections
    • Publication Year
    • Author
    • Title
    • Subject
    • Advisor
  • Search TDR
  • Rights Q&A
    • My Page
    • Receive email
      updates
    • Edit Profile
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 資訊工程學系
Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/57149
Title: 加速多核系統模擬暨減少硬體共享資源競爭
HCOREMU: Accelerating Multicore System Emulation
and Reducing Hardware Shared Resource Contention
Authors: Pei-Chi Chen
陳培基
Advisor: 劉邦鋒(Pangfeng Liu)
Co-Advisor: 吳真貞(Jan-Jan Wu)
Keyword: 平行模擬,系統模式模擬,多核心,底層虛擬機器,多線程,追蹤式動態二元碼轉換最佳化,共享資源競爭,
Parallel Emulation,System Mode Emulation,Multicores,LLVM,Multi-Threaded,Trace-based Dynamic Binary Translation Optimization,Shared Resources Contention,
Publication Year : 2014
Degree: 碩士
Abstract: 我們提出一個高效能的平行系統模擬器,命名為 HCOREMU。現有的系統模擬器主要都關注執行的正確性以及 VCPU 間同步的機制,但是有兩個重要的因素會降低他們的效能。分別是模擬器產生的機器碼的優劣,以及用來模擬的多線程會去競爭有限的共享硬體資源。在提升模擬機器碼的品質方面,我們利用現在普遍存在的多核心機器,再根基於 HQEMU 提出的追蹤式 多線程最佳化,提出了兩種引入HCOREMU 的方法。在多線程競爭共享硬體資源的方面,我們減少了三種因為競爭 而造成的效能降低的情形。第一個情況是我們發現了在非均勻訪存機器 (NUMA)上 預設的 Linux 排程器與記憶體分配的行為會有所出入。第二個情況是我們用來幫助 提高模擬機器碼品質的線程干擾模擬的線程。第三種情況則是,我們發現 某些特定的應用程式會讓多個線程一直存取某段特定的記憶體位置。我們藉由硬體的幫助來偵測上述的情況,同時也提出了對應的解決方式。HCOREMU 的效能相較於 COREMU 在單一核心模擬有 1.8 倍的提升,在多核心模擬則有 1.3 倍的提升。我們的排程方法則是相較於預設的 Linux 排程器有了 1.1 倍的提升。
We present the high performance parallel system mode emulator, HCOREMU. Existing parallel system mode emulators focus on the correctness and synchronization mechanisms of emulation. However, there are two important factors that usually impede the performance: (1) the quality of
emulation code and (2) threads contention on shared hardware resources. In this thesis, we take advantage of the ubiquitous multi-core platforms to improve our emulation code quality. We also propose two designs to accelerate multi-core system mode emulation based on the trace-based multi-threaded optimization in HQEMU.
We reduce shared resource contention in three ways. First, We reduce the interconnect traffic and access latency of our threads due to the inconsistency of default Linux scheduler and memory allocator on NUMA platform. Second, we reduce the contention between optimization threads and emulation threads. Third, we find out that some workloads have a hotspot when accessing memory. We use hardware performance counters to detect this situation. We reduce the interconnect traffic and access latency of emulation threads in workloads having this characteristics.
HCOREMU improves the performance of COREMU by a factor of 1.8X in uni-processor emulation, 1.3X in multi-core emulation. Threads contention on shared resources are reduced by our scheduling, for that our scheduling outperforms the default Linux scheduling by a factor of 1.1X.
URI: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/57149
Fulltext Rights: 有償授權
Appears in Collections:資訊工程學系

Files in This Item:
File SizeFormat 
ntu-103-1.pdf
  Restricted Access
1.05 MBAdobe PDF
Show full item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved