透過漸進式學習實現基於LoRA的可變壓縮率深度影像壓縮

許興宇; Xingyu Xu

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/94359

標題:	透過漸進式學習實現基於LoRA的可變壓縮率深度影像壓縮 Variable-Rate Deep Image Compression based on Low-Rank Adaptation by Progressive Learning
作者:	許興宇 Xingyu Xu
指導教授:	吳家麟 Ja-Ling Wu
關鍵字:	深度影像壓縮,可變率影像壓縮,低秩適應,漸進式學習, Deep image compression,Variable rate image compression,Low rank adaptation,Progressive learning,
出版年 :	2024
學位:	碩士
摘要:	在數位時代，影像壓縮在眾多領域扮演著關鍵的角色，從網路媒體到串流服務，再到高解析度醫學影像和車聯網等，都有助於實現資料的有效儲存和傳輸。隨著對高品質圖像通訊的需求不斷增加，對先進壓縮技術的需求變得日益迫切。近年來，已提出了一些學習型影像壓縮方法，並在傳統標準下取得了令人信服的成果。然而，可變率影像壓縮仍然是一個待解決的問題。一些學習型影像壓縮方法利用多個網路實現不同壓縮率，而其他方法則使用單一模型，但這可能會增加計算複雜度並降低性能。在本文中，我們透過漸進式學習實現了一種基於參數高效微調方法，Low-Rank Adaptation（LoRA），的可變壓縮率影像壓縮方法。由於LoRA 的參數化合併，我們所提出的方法在推論時並不會增加任何的計算複雜度，並且在完整的實驗中表明，與基於多個模型的方法相比，該方法在性能相近的狀況下，在參數量上減少百之九十九，在數據集上減少百分之九十，在訓練步驟上減少百分之九十七。 In the digital age, image compression is crucial for numerous applications, including web media, streaming services, high-resolution medical imaging, and connected vehicle networks, enabling efficient data storage and transmission. With the increasing demand for high-quality image communication, the need for advanced compression techniques becomes increasingly critical. Numerous learned image compression techniques have recently been introduced, showing impressive performance compared to traditional standards. However, variable rate image compression remains an unresolved issue. Specific learned image compression methods deploy multiple networks to attain different compression rates, whereas others use a single model, which often results in higher computational complexity and reduced performance. In this thesis, we propose a progressive learning approach for variable rate image compression based on the parameter-efficient fine-tuning method, the Low-Rank Adaptation. Due to the re-parameterized merging of Low-Rank Adaptation, our proposed method does not introduce additional computational complexity during inference. Compared to methods utilizing multiple models, comprehensive experiments demonstrate that our approach achieves similar performance, saving 99% in parameter storage, 90% in datasets, and 97% in training steps.
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/94359
DOI:	10.6342/NTU202402012
全文授權:	同意授權(限校園內公開)
電子全文公開日期:	2029-07-21
顯示於系所單位：	資訊網路與多媒體研究所

文件中的檔案：

檔案	大小	格式
ntu-112-2.pdf 目前未授權公開取用	1.16 MB	Adobe PDF	檢視/開啟

顯示文件完整紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料（如：文字、圖片、PDF）並使其易於取用。