Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 資料科學學位學程
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/79482
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor劉邦鋒(Pangfeng Liu)
dc.contributor.authorChang-Han Chiangen
dc.contributor.author江昶翰zh_TW
dc.date.accessioned2022-11-23T09:01:35Z-
dc.date.available2022-10-12
dc.date.available2022-11-23T09:01:35Z-
dc.date.copyright2021-10-21
dc.date.issued2021
dc.date.submitted2021-10-14
dc.identifier.citation[1]Apple Inc. Apple’s personal assistant: Siri.https://machinelearning.apple.com/research/hey-siri. [2]Z. Chen, Y. Li, S. Bengio, and S. Si. Gaternet: Dynamic filter selection in convolu­tional neural network via a dedicated global gating network„ 2018. [3]M. Elbayad, J. Gu, E. Grave, and M. Auli. Depth­-adaptive transformer. arXiv preprint arXiv:1910.10073, 2019. [4]Google Inc. Google glass.https://www.google.com/glass/start/. [5]S. Han, X. Liu, H. Mao, J. Pu, A. Pedram, M. A. Horowitz, and W. J. Dally. Eie:efficient inference engine on compressed deep neural network. ACM SIGARCH Computer Architecture News, 44(3):243–254, 2016. [6]S. Han, H. Mao, and W. J. Dally. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint arXiv:1510.00149, 2015. [7]S. Han, J. Pool, J. Tran, and W. Dally. Learning both weights and connections for efficient neural network. In Advances in neural information processing systems, pages 1135–1143, 2015. [8]K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016. [9]Y. Kang, J. Hauswald, C. Gao, A. Rovinski, T. Mudge, J. Mars, and L. Tang. Neurosurgeon: Collaborative intelligence between the cloud and mobile edge. ACM SIGARCH Computer Architecture News, 45(1):615–629, 2017 [10]A. Krizhevsky. Learning multiple layers of features from tiny images. Technicalreport, 2009. [11]G. Li, L. Liu, X. Wang, X. Dong, P. Zhao, and X. Feng. Auto­-tuning neural network quantization framework for collaborative inference between the cloud and edge. In International Conference on Artificial Neural Networks, pages 402–411. Springer,2018. [12]H. Li, S. De, Z. Xu, C. Studer, H. Samet, and T. Goldstein. Training quantized nets: A deeper understanding. In Proceedings of the 31st International Conference on Neural Information Processing Systems, pages 5813–5823, 2017. [13]H.Li,A. Kadav,I. Durdanovic,H. Samet, and H.P.Graf. Pruning filters for efficient convnets. arXiv preprint arXiv:1608.08710, 2016. [14]Z. Li, X. Wang, X. Lv, and T. Yang. Sep­-nets: Small and effective pattern networks. arXiv preprint arXiv:1706.03912, 2017. [15]Z. Li, L. Xu, and S. Zhu. Pruning of network filters for small dataset. IEEE Access,8:4522–4533, 2019. [16]J. Lin, Y. Rao, J. Lu, and J. Zhou. Runtime neural pruning. In I. Guyon, U. V.Luxburg, S.Bengio, H.Wallach, R.Fergus, S.Vishwanathan, and R.Garnett, editors, Advances in Neural Information Processing Systems, volume30. Curran Associates, Inc., 2017. [17]W. Liu, P. Zhou, Z. Zhao, Z. Wang, H. Deng, and Q. Ju. Fastbert: a self-­distilling bert with adaptive inference time. arXiv preprint arXiv:2004.02178, 2020. [18]C. Lo, Y.-­Y. Su, C.­Y. Lee, and S.-­C. Chang. A dynamic deep neural network design for efficient workload allocation in edge computing. In 2017 IEEE International Conference on Computer Design(ICCD), pages 273–280. IEEE, 2017. [19]J.-­H. Luo and J. Wu. An entropy-­based pruning method for cnn compression. arXiv preprint arXiv:1706.05791, 2017. [20]Y. Matsubara and M. Levorato. Neural compression and filtering for edge­-assisted real-­time object detection in challenged networks. In 2020 25th International Conference on Pattern Recognition(ICPR), pages 2272–2279. IEEE,2021. [21]Microsoft Corporation. Microsoft crotana. https://www.microsoft.com/en-us/research/group/cortana-research/. [22]P. Panda, A. Sengupta, and K. Roy. Energy­-efficient and improved image recognition with conditional deep learning. ACM Journal on Emerging Technologies in Computing Systems(JETC), 13(3):1–21, 2017. [23]A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin,N. Gimelshein, L. Antiga, et al. Pytorch: An imperative style, high-­performance deep learning library. In Advances in neural information processingsystems, pages8026–8037, 2019. [24]K. Simonyan and A. Zisserman. Very deep convolutional networks for large-­scale image recognition. arXiv preprint arXiv:1409.1556, 2014. [25]S. Teerapittayanon, B. McDanel, and H.-­T. Kung. Branchynet: Fast inference via early exiting from deep neural networks. In 2016 23rd International Conference on PatternRecognition(ICPR), pages 2464–2469. IEEE, 2016. [26]M. Wang, J. Mo, J. Lin, Z. Wang, and L. Du. Dynexit: A dynamic early-exit strategy for deep residual networks. In 2019 IEEE International Workshopon Signal Processing Systems(SiPS), pages 178–183. IEEE, 2019. [27]X. Wang, F. Yu, Z.-­Y. Dou, T. Darrell, and J. E. Gonzalez. Skipnet: Learning dynamic routing in convolutional networks. In Proceedings of the European Conference on Computer Vision(ECCV), pages 409–424, 2018. [28]Y. Wang, J. Shen, T.­-K. Hu, P. Xu, T. Nguyen, R. Baraniuk, Z. Wang, and Y. Lin.Dual dynamic inference: Enabling more efficient, adaptive, and controllable deep inference. IEEE Journal of Selected Topics in Signal Processing, 14(4):623–633,2020. [29]C.-­J. Wu, D. Brooks, K. Chen, D. Chen, S. Choudhury, M. Dukhan, K. Hazelwood,E.Isaac, Y.Jia, B. Jia, etal. Machine learning at facebook: Understanding inference at the edge. In 2019 IEEE International Symposium on High Performance Computer Architecture(HPCA), pages 331–344. IEEE, 2019 [30]Z. Wu, T. Nagarajan, A. Kumar, S. Rennie, L. S. Davis, K. Grauman, and R. Feris.Blockdrop: Dynamic inference paths in residual networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 8817–8826,2018. [31]J. Xin, R. Nogueira, Y. Yu, and J. Lin. Early exiting bert for efficient document ranking. In Proceedings of SustaiNLP:Workshop on Simple and Efficient Natural Language Processing, pages 83–88, 2020
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/79482-
dc.description.abstract深度神經網路近幾年在機器學習的使用上非常廣泛。為了達到較高的預測正確率,網路的深度不斷的在增加。但在提高預測正確率的同時,同時也需要花費較多的推斷時間以及能源。 前人提出了分支網路的這個技術來處理這個問題,這個技術在原本的網路當中,加入分支,這些分支可以讓資料不須執行完整個網路就可以提前離開產生預測結果。 分支網路需要人工調整一些超參數,這些超參數會非常影響分支網路的效果。而我們這篇論文主要是探討在那些位置加入這些分支會是最佳的。根據我們所知道的,之前還沒有人探討過類似問題。首先我們先定義了分支位置決定問題,並且也證明分支位置決定問題是一個NP完備問題。之後我們提出了一個動態規劃演算法來找到一個網路中最佳的分支位置。 我們將我們的演算法在四種不同深度的VGG網路上做實驗,實驗結果顯示我們的演算法可以有效地計算出最佳分支位置並且經準的預估預測正確率以及實際推斷時間。zh_TW
dc.description.provenanceMade available in DSpace on 2022-11-23T09:01:35Z (GMT). No. of bitstreams: 1
U0001-0810202115010300.pdf: 2565231 bytes, checksum: 6385a1f64f0a9881eb2237fc23284baa (MD5)
Previous issue date: 2021
en
dc.description.tableofcontents致謝----------------------------------------------- v 摘要----------------------------------------------- vi Abstract------------------------------------------ vii Contents------------------------------------------ ix List of Figures----------------------------------- xi List of Tables------------------------------------ xii List of Algorithms-------------------------------- xiii Chapter 1 Introduction---------------------------- 1 Chapter 2 Background------------------------------ 3 2.1 Architecture-------------------------------- 3 2.2 Training Methodolodies---------------------- 4 2.2.1 Joint Training-------------------------- 4 2.2.2 Seperate Training----------------------- 5 2.3 Fast Inference by Early Exiting------------- 5 Chapter 3 Related Work---------------------------- 6 3.1 Model Compression--------------------------- 6 3.1.1 Unstructured Pruning-------------------- 6 3.1.2 Structured pruning---------------------- 7 3.2 Dynamic Inference--------------------------- 7 3.2.1 Dynamic Layer Skipping------------------ 8 3.2.2 Adaptive Channel and Filter Pruning----- 8 3.3 Split Computing----------------------------- 8 Chapter 4 Methodology----------------------------- 10 4.1 Hyperparameters----------------------------- 10 4.2 Objective----------------------------------- 10 4.3 Assumption---------------------------------- 11 4.4 Notation------------------------------------ 11 4.5 NP-Completness------------------------------ 13 4.6 Optimal Branches Location Search Algorithm-- 14 4.6.1 Recursion------------------------------- 14 4.6.2 Terminal case--------------------------- 16 4.6.3 Time Complexity Analysis---------------- 16 Chapter 5 Experiment------------------------------ 18 5.1 VGG11--------------------------------------- 19 5.2 Other VGGs---------------------------------- 21 Chapter 6 Conclusion------------------------------ 25 Reference----------------------------------------- 27
dc.language.isoen
dc.title分支網路中性價比最高之最佳分支位置演算法zh_TW
dc.titleOptimal Branch Location for Cost-­effective Inference on Branchyneten
dc.date.schoolyear109-2
dc.description.degree碩士
dc.contributor.coadvisor王大為(Da-Wei Wang)
dc.contributor.oralexamcommittee吳真貞(Hsin-Tsai Liu),洪鼎詠(Chih-Yang Tseng)
dc.subject.keyword深度神經網路,提前退出機制,分支網路,動態規劃,NP完備,zh_TW
dc.subject.keywordDeep Neural Network,early exits,branchynet,dynamic programming,NP-Complete,en
dc.relation.page30
dc.identifier.doi10.6342/NTU202103620
dc.rights.note同意授權(全球公開)
dc.date.accepted2021-10-15
dc.contributor.author-college電機資訊學院zh_TW
dc.contributor.author-dept資料科學學位學程zh_TW
顯示於系所單位:資料科學學位學程

文件中的檔案:
檔案 大小格式 
U0001-0810202115010300.pdf2.51 MBAdobe PDF檢視/開啟
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved