用於多任務邊緣-雲端部署的架構與切割點聯合搜尋方法

張祐綸; Yu-Lun Chang

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/99169

標題:	用於多任務邊緣-雲端部署的架構與切割點聯合搜尋方法 SplitNAS: Joint Architecture and Partition Search for Multi-Task Edge-Cloud Deployment
作者:	張祐綸 Yu-Lun Chang
指導教授:	簡韶逸 Shao-Yi Chien
關鍵字:	神經架構搜尋,邊雲協作,多任務學習,延遲感知,知識蒸餾, Neural Architecture Search,edge-cloud collaboration,multi-task learning,latency-aware loss,
出版年 :	2025
學位:	碩士
摘要:	為了滿足邊緣裝置上即時且低延遲執行電腦視覺任務的需求，如何高效部署多任務學習（MTL）模型於邊雲協作架構中，成為當前的重要挑戰。由於邊緣裝置的運算資源有限，若僅在邊緣端執行複雜的多任務模型，或完全依賴雲端推論，往往難以在準確率與延遲之間取得理想的平衡。為此，本文提出 SplitNAS，一個統一式架構搜尋框架，可聯合探索任務專屬的分支結構與模型切割點，以支援協同式的邊雲部署。 SplitNAS 採用可微分的神經架構搜尋（Neural Architecture Search）流程，聯合優化任務分支結構與模型切割位置。為了平衡預測效能與系統延遲，SplitNAS 設計了延遲感知的損失函數，綜合考量邊緣運算、雲端推論與資料傳輸的成本。在此基礎上，模型切割點進一步結合 autoencoder 壓縮模組以降低中間特徵的傳輸負擔，並應用知識蒸餾（KD）技術以提升模型準確率。在 PASCAL-Context 與 NYUD-v2 兩個資料集上的實驗結果顯示，SplitNAS能在不同頻寬與硬體條件下取得優異的準確率與延遲權衡。與純邊緣端或純雲端推論相比，SplitNAS 在 MobileNetV2 與 ResNet34 架構下皆可減少超過 50% 的總體延遲，展現其於實際應用場景中的效能與實用價值。 To meet the demand for real-time and low-latency execution of computer vision tasks on edge devices, efficient deployment of multi-task learning (MTL) models in edge-cloud collaborative settings has become a critical challenge. Due to the limited computational capacity of edge devices, deploying complex multi-task models solely on the edge or fully on the cloud often results in suboptimal tradeoffs between accuracy and latency. To address this, we propose SplitNAS, a unified framework that jointly searches for task-specific branching architectures and optimal partition points for collaborative edge-cloud deployment. SplitNAS adopts a differentiable neural architecture search (NAS) process to jointly optimize task-specific branching structures and model partition points. To balance prediction performance and system latency, it integrates latency-aware loss functions that consider both edge and cloud execution costs, as well as transmission overhead. On top of this, an autoencoder-based compression module is introduced at the partition point to reduce feature transmission cost, and knowledge distillation (KD) is applied to improve the accuracy of the model. Experimental results on the PASCAL-Context and NYUD-v2 datasets demonstrate that SplitNAS achieves superior accuracy-latency trade-offs under various bandwidth and hardware conditions. Compared to pure edge or cloud inference, SplitNAS reduces total latency by over 50% on both MobileNetV2 and ResNet34 backbones, highlighting its effectiveness and practical value for real-world deployment.
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/99169
DOI:	10.6342/NTU202502858
全文授權:	未授權
電子全文公開日期:	N/A
顯示於系所單位：	電子工程學研究所

文件中的檔案：

檔案	大小	格式
ntu-113-2.pdf 未授權公開取用	4.04 MB	Adobe PDF

顯示文件完整紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料（如：文字、圖片、PDF）並使其易於取用。