適用於數位相機之彩色濾波陣列內插演算法及雙資料流可重組化影像處理器之設計與實作

Tsung-Huang Chen; 陳宗煌

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/27598

標題:	適用於數位相機之彩色濾波陣列內插演算法及雙資料流可重組化影像處理器之設計與實作 Hardware-Oriented Demosaicking Algorithm and Design of Dual-Stream Reconfigurable Image Signal Processor for Digital Still Cameras
作者:	Tsung-Huang Chen 陳宗煌
指導教授:	簡韶逸(Shao-Yi Chien)
關鍵字:	彩色影像內插,可重組化,影像處理, demosaicking,reconfigurable,image processing,
出版年 :	2007
學位:	碩士
摘要:	近年來，隨著科技的快速發展以及對於拍照的需求，數位相機已經成為人們生活中不可或缺的一部分。一般來說，數位相機是經由鏡頭擷取影像資料，並再經由一連串的處理程序後，送到壓縮處理單元並且儲存在記憶體中。而為了降低成本與體積，目前數位相機都是採用單一影像感測器，也就是在影像感測器前面放上彩色濾波片，使得每一像素只有得到紅、綠、藍其中一種顏色，再藉由彩色濾波片內插演算法透過周圍像素的值來預測而其他兩種顏色。一般來說，經由此內插後而得到的圖常會出現兩種缺陷，分別是拉鍊效應(zipper effect)以及不真實顏色(false color)。許多著名的內插演算法都是要去解決這兩種缺陷來達到較好的影像品質。然而，在這些之中，並沒有任何一個演算法有針對硬體的考量來設計。因此，在本篇論文中，我們分析了這些演算法的硬體成本並且根據分析所得到的結果，提出了以硬體設計為導向的彩色濾波片內插演算法。這個方法是採用了以彩度變化量為權重的內插方式。實驗結果顯示在所提出的方法不但能達到低成本硬體的考量，並且仍然可以達到好的影像品質。這也顯示了所提出的方法對於影像品質與硬體成本之間是很好的平衡點。此外，我們也針對一般數位相機影像處理的解法進行評估，包括數位訊號處理器(DSP)、特定應用積體電路(ASIC)、混合式的架構以及粗顆粒可重組化影像處理器(CRISP).數位訊號處理器擁有高彈性可支援多樣的影像處理演算法，然而並不能達到預覽模式下即時處理的需求。而ASIC 則是對於特定的演算法針對面積、功率以及速度進行最佳化，因此擁有高效能以及低成本的優勢，但是並不具備架構的彈性，無法達到對於照相時所需不同高畫質影像處理管線演算法的特性。而混合式的架構則是結合了上述兩種方式的優點，然而硬體成本過高而且使用效率也相當低。因此，粗顆粒可重組化影像處理器(CRISP)則是被提出來解決傳統影像處理器所遇到的問題。根據影像處理演算法之高度運算相似性及硬體資源共享性，CRISP是以預覽模式所需要的影像處理管線為硬體投資的標準，使用足夠的硬體資源來滿足及時化的需求並且投資更多適當的硬體和繞線來增加架構的彈性和處理能力。如此除了兼顧ASIC和DSP 的優點外，CRISP 還是一個擁有低成本及高效率的影像處理器。因此在本篇論文中，為了提升在預覽模式下所得到的影像與視訊的品質，我們採用了CRISP 的設計概念並且提出了雙資料流影像處理器(CRISP-DS)，其全新的設計概念稱作Dual-Stream with Context Switch。這概念則是結合了CRISP 在預覽模式及拍照模式下的特性。在預覽模式下，透過CRISP-DS 連結影像感測器介面中的鎖相迴路(PLL)來產生比影像感測器快一倍的頻率，並且與原本影像感測器的頻率進行同步的動作。然後輸入的資料流與同步訊號在此介面中則被修改成每兩個時脈(clock cycle)來處理一筆資料，也就是說對於每個的處理單元來說都有一個時脈是沒有動作的。因此，藉由增加每個處理單元的context 使得新增加的context 可以在這個時脈中對處理單元進行重組。如此每個處理單元就有如變成兩倍一樣，而所能支援的影像管線處理演算法的能力也就因此大大提升。也就是說在此設計概念下，對於同樣一套較好的影像處理管線演算法，相較於CRISP 必須透過增加處理單元來達成，CRISP-DS 只需要增加一組新的context 就可以達到相同的效果。如此就可以大幅度的降低硬體成本並且維持低成本的優勢。在硬體實作方面，CRISP-DS 已經實作成晶片，採用TSMC 0.13 um 1P8M 的製程。晶片的面積為3.18mm x 3.18mm 而晶片核心大小為 2.2mm x 2.2mm。最高的頻率可以達到200Mhz，而在預覽模式下則是跑在比影像感測器的頻率還要快兩倍的頻率。當跑在200Mhz時，平均消耗的功率為314mW。在CRISP-DS 中我們總共使用了104,192bits 的SRAM。在比較上，透過所提出的測試影像處理管線演算法，實作結果證明CRISP-DS 在面積與使用功率的效率上都是比CRISP 還來的優異，處理速度也快上六倍。而與TI 的TMS320C64x相比，處理速度則是快了250 倍以上。 In recent years, the Digital Still Camera (DSC) has been widely adopted as an image capture device for PC-based multi-media system. A digital still camera can capture images by using a CCD or CMOS sensor and then compress the data to store the images on a memory card. In order to reduce the cost and size, many digital cameras use a single image sensor with a color filter array (CFA) to capture images. Because each pixel only contains one of the three primary colors in CFA, the others must be estimated from the neighboring pixels. This process is called as CFA demosaicking. Generally speaking, there are two artifacts generated in the process of interpolation,including zipper effect and false color artifacts. Many state-of-the-art demosaicking methods are proposed to reduce these artifacts for achieving better image quality perceptually and in PSNR. However, hardware cost for VLSI implementation is not considered in most of them. Therefore, a hardware-oriented CFA interpolation algorithm is developed in this thesis according to the hardware cost analysis results. The proposed algorithm is by use of chrominance variance weighting scheme interpolation. Experimental results show that our method can achieve good image quality in PSNR than the existing methods on variety of test images while low hardware cost is still maintained. It shows that this method can be a good compromise between image quality and hardware cost. Besides, we also evaluate mostly used existing solutions to image processor, including DSP, ASIC, hybrid solution and CRISP [2]. DSP owns high flexibility and can handle almost all kinds of image pipeline tasks in a DSC system, but the cost is much higher and it can’t meet the real-time requirement in the preview mode. The application-specific-integrated-circuit (ASIC) solution is highly optimized in terms of area, power and speed to perform its designated task, but it, however, doesn’t have much flexibility for different algorithms in the picture-taking mode. The hybrid solution combines the advantages of DSP and ASIC but the hardware utilization is low and the cost is too high. Therefore, a coarse-grain reconfigurable image processor, CRISP, has been proposed to solve this problem. It can approach the hardware cost lower bound of preview engine caused by the real-time constraints in the preview mode. In addition, the flexibility requirement in the picture-taking mode can be achieved by the reconfigurability of CRISP. The high processing speed can also be achieved by the processing elements specially designed for image processing tasks by utilizing the algorithmic similarity. In summary, CRISP combines the advantages of ASIC and DSP into a single hardware by proper time-space tradeoff in different modes. The high flexibility and efficiency of CRISP is very suitable for image pipeline in DSC. Therefore, in order to upgrade the image or video quality in the preview mode, we follow the concept of CRISP and propose a dual-stream reconfigurable image stream processor (CRISP-DS) with a new design idea, called dual stream with context switch. The concept of dual-stream with context switch is to combining these characteristics in the preview and picture-taking mode. In the preview mode, inside the interface of CRISP-DS, there is a PLL (Phase lock loop) to generate the frequency twice than sensor frequency and to synchronize it. Then the data stream and synchronization signal are modified such that there are two cycle to process a pixel, which concept is similar to data stream rate adjustment in the picture-taking mode. Then we double the context for some reconfigurable stage processing elements (RSPEs) and each context is corresponding to the each one of two cycles, which means each RSPE can be configured as two different image pipeline algorithms. Then a better image pipeline can be employed in the concept meanwhile the overhead of gate count to implement this pipeline is mush smaller than original CRISP design. The CRISP-DS chip is fabricated with TSMC 0.13um 1P8M CMOS process via CIC. The chip die size is 3.18 mm x 3.18 mm and the core size is 2.2 mm x 2.2 mm. The max working frequency in preview mode is twice than the frequency of the sensor and 200Mhz in picture-taking mode. The average total power consumption at 200 MHz is 314 mW. The total on-chip SRAM bit number is 104,192 bits. For the golden test image pipeline, the implementation results demonstrate that the area and power efficiency of CRISP-DS are better than CRIPS. In additions, the processing speed of CRISP-DS is 6 times faster than CRISP and over 250 times fasters than TMS320C64x DSP.
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/27598
全文授權:	有償授權
顯示於系所單位：	電子工程學研究所

文件中的檔案：

檔案	大小	格式
ntu-96-1.pdf 目前未授權公開取用	1.91 MB	Adobe PDF

顯示文件完整紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料（如：文字、圖片、PDF）並使其易於取用。