Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 資訊工程學系
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/67300
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor徐慰中
dc.contributor.authorLi Changen
dc.contributor.author張力zh_TW
dc.date.accessioned2021-06-17T01:27:03Z-
dc.date.available2020-08-11
dc.date.copyright2017-08-11
dc.date.issued2017
dc.date.submitted2017-08-07
dc.identifier.citation[1] U. Ramacher, “Software-defined radio prospects for multistandard mobile phones,” Computer, vol. 40, no. 10, 2007.
[2] S. Yang and L. Hanzo, “Fifty years of mimo detection: The road to large-scale mimos,” IEEE Communications Surveys & Tutorials, vol. 17, no. 4, pp. 1941–1988, 2015.
[3] Q. Zheng, Y. Chen, R. Dreslinski, C. Chakrabarti, A. Anastasopoulos, S. Mahlke, and T. Mudge, “Architecting an lte base station with graphics processing units,” in 2013 IEEE Workshop on Signal Processing Systems, SiPS 2013. IEEE, 2013, pp. 219–224.
[4] J. Berkmann, C. Carbonelli, F. Dietrich, C. Drewes, and W. Xu, “On 3g lte terminal implementation-standard, algorithms, complexities and challenges,” in Wireless Communications and Mobile Computing Conference, 2008. IWCMC’08. Interna-
tional. IEEE, 2008, pp. 970–975.
[5] J. Ketonen, M. Juntti, and J. R. Cavallaro, “Performance—complexity comparison of receivers for a lte mimo–ofdm system,” IEEE transactions on signal processing, vol. 58, no. 6, pp. 3360–3372, 2010.
[6] Q. Zheng, Y. Chen, R. Dreslinski, C. Chakrabarti, A. Anastasopoulos, S. Mahlke, and T. Mudge, “Wibench: An open source kernel suite for benchmarking wireless systems,” in Proceedings of the IEEE International Symposium on Workload Char- acterization (IISWC), 2013, pp. 123–132.
[7] S.Bang,C.Ahn,Y.Jin,S.Choi,J.Glossner,andS.Ahn,“Implementation of lte system on an sdr platform using cuda and uhd,” Analog Integrated Circuits and Signal Processing, vol. 78, no. 3, pp. 599–610, 2014.
[8] J. Kim, S. Hyeon, and S. Choi, “Implementation of an sdr system using graphics processing unit,” IEEE Communications Magazine, vol. 48, no. 3, 2010.
[9] T.-D.Chiueh,P.-Y.Tsai,andI.-W.Lai,Baseband receiver design for wireless MIMO-OFDM communications. John Wiley & Sons, 2012.
[10] B. Hassibi and H. Vikalo, “On the sphere-decoding algorithm i. expected complexity,” IEEE transactions on signal processing, vol. 53, no. 8, pp. 2806–2818, 2005.
[11] S. Cook, CUDA Programming: A Developer’s Guide to Parallel Computing with GPUs, 1st ed. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., 2013.
[12] Q. Xu, H. Jeon, and M. Annavaram, “Graph processing on gpus: Where are the bottlenecks?” in Proceedings of the IEEE International Symposium on Workload Characterization (IISWC), 2014, pp. 140–149.
[13] D. Sui, Y. Li, J. Wang, P. Wang, and B. Zhou, “High throughput mimo-ofdm detection with graphics processing units,” in 2012 IEEE International Conference on Computer Science and Automation Engineering (CSAE), vol. 2, pp. 176–179.
[14] M.Wu,S.Gupta,Y.Sun,andJ.R.Cavallaro,“A GPU implementation of a real-time mimo detector,” in 2009 IEEE Workshop on Signal Processing Systems, SiPS, 2009, pp. 303–308.
[15] M.S.Khairy,C.Mehlführer,and M.Rupp,“Boosting spheredecoding speed through graphic processing units,” in 2010 European Wireless Conference (EW). IEEE, 2010, pp. 99–104.
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/67300-
dc.description.abstract在行動通訊標準日益複雜以及通用處理器日益強大下,使用通用程式語言及通用處理器實作通訊演算法變得可行,使軟體定義無線電成為具吸引力的高彈性解決方案。本文探討在軟體定義無線電下以圖形處理器實作的基頻處理演算法中,相較其他部分在執行時行為較不規律的多輸入多輸出偵測演算法之球面解碼,根據阿姆達爾定律可能成為系統的瓶頸。因此在實作時必須考慮到圖形處理器硬體架構對軟體優化,以有效利用圖形處理器提供之性能,針對球面解碼演算法實作中造成圖形處理器執行效率不佳的因素,我們透過實作上的優化以及提出的分歧減少方法,使得在中至低訊噪比環境時,我們的分歧減少方法比起未優化的實作在執行時間上平均提升1.6倍的效能,加上實作上的優化後比起未優化之實作整體增進2-3.5倍的效能。zh_TW
dc.description.abstractModern communication protocols are getting more complicated. With general purpose processor getting more powerful, software-defined radio offer an attractive alternative with its high flexibility. Graphics Processing Unit(GPU) enable software-defined radio to exploit its massive parallelism computing paradigm. While the majority of the baseband processing algorithm is highly parallel, the sphere decoder, however, due to its depth-first search nature, is not as regular as other components. According to the Amdahl's law, the efficiency of the software-defined LTE system implementation may be limited by the sphere decoding stage. In this thesis, we propose a preprocessing stage to significantly improve the extit{warp} execution efficiency of sphere decoding algorithm on GPU. The sphere decoder can run 1.6X faster in average in a middle to low SNR environment. Together with memory hierarchy related optimizations, the overall performance improvement to a sphere decoder has been improved by 2-3.5 times.en
dc.description.provenanceMade available in DSpace on 2021-06-17T01:27:03Z (GMT). No. of bitstreams: 1
ntu-106-R04922079-1.pdf: 1631219 bytes, checksum: 0a28920247a0f6c6618da950432dd945 (MD5)
Previous issue date: 2017
en
dc.description.tableofcontents誌謝 i
摘要 ii
Abstract iii
Contents iv
List of Figures vi
List of Tables viii
1 Introduction 1
2 Background and Motivation 4
2.1 Overview of MIMO Detection Algorithms 4
2.1.1 Maximum Likelihood Detection 6
2.2 Graphics Processing Unit 9
2.2.1 Overview of CUDA 10
2.2.2 GPU Memory Hierarchy 10
2.2.3 Irregular Program 11
2.3 MIMO Detection on GPU 12
3 Memory-aware Implementation and Divergence Reduction Preprocessing for Sphere Decoding 14
3.1 Schnorr-Euchner Sphere Decoding Algorithm 14
3.2 Memory-aware Implemenation 15
3.2.1 Minimize the Main Memory Access 17
3.3 Divergence Issue in Schnorr-Euchner Sphere Decoding 18
3.3.1 Observation of Input Data Pattern 18
3.3.2 Full Frame Data Size 22
3.3.3 Divergence reduction Through Reorder 22
3.4 Proposed Preprocessing Stages 25
4 Experiments 28
4.1 Simulation Setup 28
4.2 Results 29
4.3 Limitation 32
5 Conclusions 35
Bibliography 37
dc.language.isoen
dc.subject多輸入多輸出zh_TW
dc.subject球面解碼zh_TW
dc.subject圖形處理器zh_TW
dc.subject軟體定義無線電zh_TW
dc.subjectSoftware-defined radioen
dc.subjectGraphics processing uniten
dc.subjectSphere decodingen
dc.subjectMulti-input Multi-outputen
dc.title優化於軟體定義無線電中圖形處理器加速球面解碼之效能zh_TW
dc.titleGPU-Aware Sphere Decoder Implementation in Software-Defined Radioen
dc.typeThesis
dc.date.schoolyear105-2
dc.description.degree碩士
dc.contributor.oralexamcommittee廖世偉,吳安宇
dc.subject.keyword軟體定義無線電,圖形處理器,球面解碼,多輸入多輸出,zh_TW
dc.subject.keywordSoftware-defined radio,Graphics processing unit,Sphere decoding,Multi-input Multi-output,en
dc.relation.page39
dc.identifier.doi10.6342/NTU201702387
dc.rights.note有償授權
dc.date.accepted2017-08-07
dc.contributor.author-college電機資訊學院zh_TW
dc.contributor.author-dept資訊工程學研究所zh_TW
顯示於系所單位:資訊工程學系

文件中的檔案:
檔案 大小格式 
ntu-106-1.pdf
  未授權公開取用
1.59 MBAdobe PDF
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved