三維圖學渲染系統可見度測試引擎之記憶體頻寬減少技術

Chi-Ling Wu; 吳其玲

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/31982

標題:	三維圖學渲染系統可見度測試引擎之記憶體頻寬減少技術 Memory Bandwidth Reduction Technique with Visibility Testing Engine for 3D Graphics Rendering Systems
作者:	Chi-Ling Wu 吳其玲
指導教授:	簡韶逸
關鍵字:	三維圖學渲染,圖學硬體,可見度, 3D graphics rendering,graphics hardware,visibility,
出版年 :	2006
學位:	碩士
摘要:	近幾年隨著三維圖學相關工業的快速發展，人們對於圖學硬體加速器的需求量也隨著發展而增加。因為娛樂事業的蓬勃發展，帶動整個三維圖學的應用滲透到各個需求層面，各處都可以看到三維圖學的蹤跡，不管是個人電腦或是消費性電子產品。然而，隨著娛樂上享受品質的需求逐漸提升，導致三維的模型和場景變得越來越複雜，對於顯示處理的三維圖學渲染系統而言，必須處理更多的資料量，而這些資料導致系統中的外部記憶體頻寬量居高不下，影響到三維渲染系統的處理能力表現。本論文中，我們提出一個硬體導向的可見度測試演算法來減少不必要的外部記憶體頻寬。在演算法層級上，提出的可見度測試演算法利用遮蔽遮罩來減少可見度資料的頻寬，除了有減少頻寬的好處之外，利用遮蔽遮罩可以很容易的利用較多次的取樣來達到反鋸齒的效果。除此之外，我們也將遮蔽遮罩以階層式(hierarchy)的方式來存放，如此利用階層可以較快速的決定某些部分的可見度，達到可見度測試加速的目標。重要的是，在顯示處理的過程中，提出的可見度測試中不需要特別利用硬體回傳可見度資訊的機制。使用提出的可見度測試演算法時，因為使用遮蔽遮罩，必須將三角型平面由近至遠做排序。對於靜態的場景或模型，常會先利用二元空間數作排序，使顯示處理在一般使用深度緩衝區的系統中達到加速。若在同樣的情況下，對於已經排序的三角形貯列，利用提出的演算法不只可以加速另外更可以減少許多不必要的外部記憶體頻寬的花費。因此，提出的演算法對於一些排序過的靜態場景和模型提供一個低頻寬的加速方式。在架構層級上，提出的可見度測試演算法可以實現在三維圖學硬體顯示系統，同時硬體實作上可以加入可擴充特性(scalability)考量。擁有可擴充特性後，提出的渲染系統將可以延伸擴展至各種不同的圖學應用。實驗結果顯示，不考慮反鋸齒的情況下，最多可節省百分之八十的外部記憶體頻寬，而在考慮反鋸齒的情況下，最多可節省百分之九十七的頻寬。我們將提出的可見度測試演算法實作成利用硬體名為可見度測試引擎的三維圖學渲染系統晶片原型，其使用TSMC 0.18um 1P6M技術，晶片大小為2.57x2.57mm2。 In recent years, 3D graphics industry is growing rapidly, and the requirements for graphics hardware become larger than before. With the flourishing development of entertainment industry, 3D graphics applications become widespread. However, since the models and scenes become more and more complex and the rendering quality requirement is getting higher and higher, the 3D graphics rendering systems will need to process more and more data and suffer from high external memory bandwidth. In this thesis, we propose a hardware-oriented visibility testing algorithm to reduce the external memory bandwidth. In the algorithm level, the proposed visibility testing algorithm adopts coverage masks to reduce the visibility data bandwidth and easily integrates antialiasing with oversampling. Beside, coverage masks can construct a hierarchical structure, which can speed up the visibility testing progress. On the other hand, the visibility tests are done during rendering without occlusion queries. In this algorithm, the incoming primitives are sorted in the front-to-back order because of coverage mask adoption. For static scenes and models, they are usually sorted with BSP trees and are accelerated in Z-buffer systems. But with our proposed algorithm, not only accelerating but also reducing more external memory bandwidth can be achieved with sorted primitives. Thus the proposed visibility algorithm can be seen as an accelerator for static scenes or models. In architecture level, the proposed algorithm can be integrated into 3D graphics hardware rendering systems. The proposed hardware-oriented visibility testing algorithm can be implemented by hardware with scalability. With scalability, the proposed rendering system can be easily extended to various graphics applications. The experimental results shows that $80\%$ of the external memory bandwidth can be reduced without antialiasing, and $97\%$ of reduction can be achieved with antialiasing. The prototype chip of the proposed 3D graphics rendering system with visibility testing engine is fabricated with TSMC 0.18um 1P6M technology, where the chip size is 2.57x2.57 mm^2.
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/31982
全文授權:	有償授權
顯示於系所單位：	電子工程學研究所

文件中的檔案：

檔案	大小	格式
ntu-95-1.pdf 目前未授權公開取用	3.25 MB	Adobe PDF

顯示文件完整紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料（如：文字、圖片、PDF）並使其易於取用。