請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/73438
標題: | 基於 PySyft 安全聚集之聯盟式學習應用的效能分析與優化 Performance Analysis and Optimization for Federated Learning Applications with PySyft-based Secure Aggregation |
作者: | Po-Sheng Lin 林伯陞 |
指導教授: | 洪士灝(Shih-Hao Hung) |
關鍵字: | 資料隱私,聯盟式學習,安全聚集,效能建模, Data Privacy,Federated Learning,Secure Aggregation,Performance Modeling, |
出版年 : | 2020 |
學位: | 碩士 |
摘要: | 為了顧及隱私的考量,聯盟式學習成為一個可以使多個分散式客戶端合作地訓練一個共享模型同時保護他們自己訓練資料的一個新興機器學習技術。雖然聯盟式學習能降低資料外洩的風險,但駭客仍可對客戶端訓練過的模型作逆向工程來獲取客戶端訓練資料的相關資訊。為了避免這樣的風險,透過安全聚集的方法可以私密地結合各個客戶端訓練好的模型並且更新共享的模型。然而,安全聚集往往會因為需要額外的加密運算甚至是安全多方計算使用所需的通訊而帶來效能上的負擔。在這篇論文中,我們分析透過 PySyft 實現的安全聚集之聯盟式學習,PySyft 是一個包含聯盟式學習實作的開源程式碼框架,除此之外,我們提出非同步的聯盟式學習機制來改進整體效能。我們可以發現整體效能取決於客戶端的運算能力以及網路通訊的特性,我們因此提出一個效能建模的方法來幫助系統設計師了解整體執行時間的分配來對隱私、效率以及精確度作適當的取捨,設計一個平衡的系統。 To address privacy concerns, federated learning (FL) is becoming a promising machine learning technique which enables multiple decentralized clients to train a shared model collaboratively while preserving their private training data. Although FL may reduce the risks of data leak, it is still possible for hackers to reverse-engineer a trained model and figure out the information in the original training dataset provided by a FL client. In order to avoid such risks, secure aggregation (SA) can be used to privately combine the trained models of the clients to update the shared model. However, SA usually introduces performance overhead as it requires additional computation for encryption operations and even communications when secure multi-party computation (SMPC) is used. In this paper, we analyze the performance of FL with SA using PySyft, an open source framework including FL implementation, and propose an asynchronous FL mechanism to improve the overall performance. It turns out that the performance depends on the computational capabilities of the clients and the characteristics of the communication network, and we propose a performance modeling method to help system designers break down the execution time and decide on suitable trade-offs between privacy, efficiency, and accuracy for a balanced system. |
URI: | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/73438 |
DOI: | 10.6342/NTU202004396 |
全文授權: | 有償授權 |
顯示於系所單位: | 資訊工程學系 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
U0001-0512202002355600.pdf 目前未授權公開取用 | 2.61 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。