最佳化超字組平行化中儲存指令向量化的切片選擇策略

陳至成; Chih-Cheng Chen

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/96172

標題:	最佳化超字組平行化中儲存指令向量化的切片選擇策略 Optimizing Slice Selection Strategy for Store Vectorization in Superword Level Parallelism
作者:	陳至成 Chih-Cheng Chen
指導教授:	廖世偉 Shih-Wei Liao
關鍵字:	單指令多資料流,超字組平行化,自動向量化,LLVM, SIMD,SuperwordLevelParallelism(SLP),Auto-vectorization,LLVM,
出版年 :	2024
學位:	碩士
摘要:	在現今由資料驅動計算的時代，處理大型資料集的需求日漸增長，使得程序運行效率成為研究的重點。當代的處理器普遍搭載了單指令多資料流(SIMD)處理單元，並且多數指令集架構也都支持各自的向量擴展指令集。例如，RISC-V支援RVV，而Arm架構則支援Neon和SVE。自動向量化是編譯器的優化技術之一，它能夠自動地在編譯過程中將純量指令轉換為向量指令，讓開發者能夠利用向量處理單元的性能潛力，同時減少開發者撰寫程式的負擔。本研究旨在透過優化儲存指令向量化的切片選擇策略，來增強LLVM中實作的超字組平行化(SLP)，從而發掘更多潛在的向量化機會。此外，還在Arm處理器上進行了效能模擬，以檢驗此設計的實際增益。 In the current era of data-driven computing, the need for processing large datasets has grown exponentially, making program execution efficiency a critical focus of research. Most processors are equipped with Single Instruction, Multiple Data (SIMD) units, and modern instruction set architectures also support vector extensions, such as RVV for RISC-V, and Neon and SVE for Arm. Auto-vectorization, a compiler optimization technique, transforms scalar code into vector instructions during compilation, allowing developers to fully exploit the performance potential of vector processing units while minimizing manual effort. This study aims to enhance the Superword Level Parallelism (SLP) auto-vectorization implemented in the LLVM compiler by optimizing slice selection for Store vectorization, thereby uncovering more potential vectorization opportunities. Additionally, performance simulations were conducted on Arm processors to verify the practical benefits of the optimized algorithm.
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/96172
DOI:	10.6342/NTU202404591
全文授權:	未授權
顯示於系所單位：	資訊工程學系

文件中的檔案：

檔案	大小	格式
ntu-113-1.pdf 未授權公開取用	1.33 MB	Adobe PDF

顯示文件完整紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料（如：文字、圖片、PDF）並使其易於取用。