可適性移動向量搜尋和編碼演算法

Chun-Hung Lin; 林俊宏

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/58827

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	丁建均(Jian-Jiun Ding)
dc.contributor.author	Chun-Hung Lin	en
dc.contributor.author	林俊宏	zh_TW
dc.date.accessioned	2021-06-16T08:33:18Z	-
dc.date.available	2020-07-17
dc.date.copyright	2020-07-17
dc.date.issued	2020
dc.date.submitted	2020-07-13
dc.identifier.citation	REFERENCE [1] T. Zhang and S. Mao, 'An Overview of Emerging Video Coding Standards,' GetMobile: Mobile Computing and Communications, vol. 22, no. 4, pp. 13-20, 2019. [2] H. Kalva, 'The H. 264 video coding standard,' IEEE multimedia, vol. 13, no. 4, pp. 86-90, 2006. [3] J. Nilson, 'Inter-Picture Prediction forVideo Compression using Low Passand High Pass Filters,' ed, 2017. [4] G. J. Sullivan, P. N. Topiwala, and A. Luthra, 'The H. 264/AVC advanced video coding standard: Overview and introduction to the fidelity range extensions,' in Applications of Digital Image Processing XXVII, 2004, vol. 5558: International Society for Optics and Photonics, pp. 454-474. [5] T. Wiegand, G. J. Sullivan, G. Bjontegaard, and A. Luthra, 'Overview of the H. 264/AVC video coding standard,' IEEE Transactions on circuits and systems for video technology, vol. 13, no. 7, pp. 560-576, 2003. [6] G. Correa, P. Assuncao, L. Agostini, and L. A. da Silva Cruz, Complexity-Aware High Efficiency Video Coding. Springer, 2016. [7] J.-R. Ohm, G. J. Sullivan, H. Schwarz, T. K. Tan, and T. Wiegand, 'Comparison of the coding efficiency of video coding standards—including high efficiency video coding (HEVC),' IEEE Transactions on circuits and systems for video technology, vol. 22, no. 12, pp. 1669-1684, 2012. [8] B. Li, G. J. Sullivan, and J. Xu, 'Comparison of compression performance of HEVC working draft 5 with AVC high profile,' document JCTVC-H0360, 2012. [9] T. Tan, A. Fujibayashi, Y. Suzuki, and J. Takiue, 'Objective and Subjective Evaluation of HM5. 0, document JCTVC-H0116,' Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG, vol. 16, 2012. [10] T. Laude, Y. G. Adhisantoso, J. Voges, M. Munderloh, and J. Ostermann, 'A Comparison of JEM and AV1 with HEVC: Coding Tools, Coding Efficiency and Complexity,' in 2018 Picture Coding Symposium (PCS), 24-27 June 2018 2018, pp. 36-40, doi: 10.1109/PCS.2018.8456291. [11] Y. Chen et al., 'An overview of core coding tools in the AV1 video codec,' in 2018 Picture Coding Symposium (PCS), 2018: IEEE, pp. 41-45. [12] D. Grois, T. Nguyen, and D. Marpe, 'Coding efficiency comparison of av1/vp9, h. 265/mpeg-hevc, and h. 264/mpeg-avc encoders,' in 2016 Picture Coding Symposium (PCS), 2016: IEEE, pp. 1-5. [13] D. Grois, T. Nguyen, and D. Marpe, 'Performance comparison of AV1, JEM, VP9, and HEVC encoders,' in Applications of Digital Image Processing XL, 2018, vol. 10396: International Society for Optics and Photonics, p. 103960L. [14] G. J. Sullivan, J.-R. Ohm, W.-J. Han, and T. Wiegand, 'Overview of the high efficiency video coding (HEVC) standard,' IEEE Transactions on circuits and systems for video technology, vol. 22, no. 12, pp. 1649-1668, 2012. [15] A. Wieckowski, J. Ma, H. Schwarz, D. Marpe, and T. Wiegand, 'Fast partitioning decision strategies for the upcoming versatile video coding (VVC) standard,' in 2019 IEEE International Conference on Image Processing (ICIP), 2019: IEEE, pp. 4130-4134. [16] D. Marpe et al., 'Video Compression Using Nested Quadtree Structures, Leaf Merging, and Improved Techniques for Motion Representation and Entropy Coding,' IEEE Transactions on Circuits and Systems for Video Technology, vol. 20, no. 12, pp. 1676-1687, 2010, doi: 10.1109/TCSVT.2010.2092615. [17] M. Maazouz, N. Batel, N. Bahri, and N. Masmoudi, 'Homogeneity-based fast CU partitioning algorithm for HEVC intra coding,' Engineering Science and Technology, an International Journal, vol. 22, no. 3, pp. 706-714, 2019, doi: 10.1016/j.jestch.2018.12.016. [18] I.-K. Kim, J. Min, T. Lee, W.-J. Han, and J. Park, 'Block partitioning structure in the HEVC standard,' IEEE transactions on circuits and systems for video technology, vol. 22, no. 12, pp. 1697-1706, 2012. [19] M. Wien, 'High efficiency video coding,' Coding Tools and specification, pp. 133-160, 2015. [20] L. Trudeau, N. Egge, and D. Barr, 'Predicting chroma from luma in AV1,' in 2018 Data Compression Conference, 2018: IEEE, pp. 374-382. [21] J. Tariq and S. Kwong, 'Efficient intra and most probable mode (MPM) selection based on statistical texture features,' in 2015 IEEE International Conference on Systems, Man, and Cybernetics, 2015: IEEE, pp. 1776-1781. [22] J. Lainema, F. Bossen, W. Han, J. Min, and K. Ugur, 'Intra Coding of the HEVC Standard,' IEEE Transactions on Circuits and Systems for Video Technology, vol. 22, no. 12, pp. 1792-1801, 2012, doi: 10.1109/TCSVT.2012.2221525. [23] A. M. Bock, Video Compression Systems: From first principles to concatenated codecs. IET Digital Library, 2009. [24] A. J. Hussain and Z. Ahmed, 'A survey on video compression fast block matching algorithms,' Neurocomputing, vol. 335, pp. 215-237, 2019. [25] B.-G. Kim and K. Goswami, Basic prediction techniques in modern video coding standards. Springer, 2016. [26] K. R. Rao and P. C. Yip, The transform and data compression handbook. CRC press, 2000. [27] C.-M. Fu et al., 'Sample adaptive offset in the HEVC standard,' IEEE Transactions on Circuits and Systems for Video technology, vol. 22, no. 12, pp. 1755-1764, 2012. [28] I. E. Richardson, The H. 264 advanced video compression standard. John Wiley Sons, 2011. [29] V. Sze and M. Budagavi, 'High throughput CABAC entropy coding in HEVC,' IEEE Transactions on Circuits and Systems for Video Technology, vol. 22, no. 12, pp. 1778-1791, 2012. [30] D. Marpe, H. Schwarz, and T. Wiegand, 'Context-based adaptive binary arithmetic coding in the H. 264/AVC video compression standard,' IEEE Transactions on circuits and systems for video technology, vol. 13, no. 7, pp. 620-636, 2003. [31] 'Xiph.org Video Test Media[derf's collection].' [Online]. Available: https://media.xiph.org/video/derf/. [32] P. B. Lee. http://web.engr.oregonstate.edu/~benl/Courses/ECE477_sp20.html (accessed. [33] R. Yaakob, A. Aryanfar, A. A. Halin, and N. Sulaiman, 'A comparison of different block matching algorithms for motion estimation,' Procedia Technology, vol. 11, pp. 199-205, 2013. [34] P. Guillotel and C. Chevance, 'Comparison of motion vector coding techniques,' in Visual Communications and Image Processing'94, 1994, vol. 2308: International Society for Optics and Photonics, pp. 1594-1604. [35] L. Zhang, D. Wang, and D. Zheng, 'Improved adaptive arithmetic coding based on optimal segmentation of code symbols for lossless motion vector coding,' in 2011 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB), 2011: IEEE, pp. 1-5. [36] M. Takahashi and M. Yamaguchi, 'H. 264/AVC improvement based on adaptive motion vector coding,' in 2009 Picture Coding Symposium, 2009: IEEE, pp. 1-4. [37] N. I. Radwan, N. M. Salem, and M. I. El Adawy, 'Histogram correlation for video scene change detection,' in Advances in Computer Science, Engineering Applications: Springer, 2012, pp. 765-773. [38] H. Jiang, A. S. Helal, A. K. Elmagarmid, and A. Joshi, 'Scene change detection techniques for video database systems,' Multimedia systems, vol. 6, no. 3, pp. 186-195, 1998. [39] W. Lin, M.-T. Sun, H. Li, Z. Chen, W. Li, and B. Zhou, 'Macroblock classification method for video applications involving motions,' IEEE transactions on broadcasting, vol. 58, no. 1, pp. 34-46, 2011. [40] A. Varghese, J. Gubbi, A. Ramaswamy, and P. Balamuralidhar, 'ChangeNet: a deep learning architecture for visual change detection,' in Proceedings of the European Conference on Computer Vision (ECCV), 2018, pp. 0-0. [41] E. Guo et al., 'Learning to measure change: Fully convolutional siamese metric networks for scene change detection,' arXiv preprint arXiv:1810.09111, 2018. [42] C.-Y. Wu, M. Zaheer, H. Hu, R. Manmatha, A. J. Smola, and P. Krähenbühl, 'Compressed video action recognition,' in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018, pp. 6026-6035. [43] C.-Y. Wu, N. Singhal, and P. Krahenbuhl, 'Video compression through image interpolation,' in Proceedings of the European Conference on Computer Vision (ECCV), 2018, pp. 416-431. [44] O. Rippel, S. Nair, C. Lew, S. Branson, A. G. Anderson, and L. Bourdev, 'Learned video compression,' in Proceedings of the IEEE International Conference on Computer Vision, 2019, pp. 3454-3463. [45] S. Ma, X. Zhang, C. Jia, Z. Zhao, S. Wang, and S. Wanga, 'Image and video compression with neural networks: A review,' IEEE Transactions on Circuits and Systems for Video Technology, 2019. [46] D. Liu, Y. Li, J. Lin, H. Li, and F. Wu, 'Deep Learning-Based Video Coding: A Review and A Case Study,' arXiv preprint arXiv:1904.12462, 2019. [47] S. Huo, D. Liu, F. Wu, and H. Li, 'Convolutional neural network-based motion compensation refinement for video coding,' in 2018 IEEE International Symposium on Circuits and Systems (ISCAS), 2018: IEEE, pp. 1-4. [48] Z. Chen, T. He, X. Jin, and F. Wu, 'Learning for Video Compression,' IEEE Transactions on Circuits and Systems for Video Technology, pp. 1-1, 2019, doi: 10.1109/TCSVT.2019.2892608.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/58827	-
dc.description.abstract	根據Sandvine公司於2020年進行的調查，全球移動下載流量的65％為視訊。隨著物聯網和5G時代的來臨，高效能的視訊編解碼器對於節省存儲空間和頻帶使用而言，扮演越來越重要的角色。對於多媒體資料壓縮而言，視訊資料可以達到最高的壓縮率，主要是利用時間上相鄰幀之間的相關性，該技術稱為「幀間預測」。對於預測圖像而言，僅將運動向量和殘差圖像編碼以降低位元速率。然而，在視訊壓縮中，幀間預測也是高計算複雜度的主要原因。在此論文中，我們提出了透過觀察移動向量的變化來調整搜索區域的演算法，在不影響重建視訊品質和位元速率的前提下，我們試著節省比對次數。由實驗結果發現，我們提出的演算法特別適用於小動作影片或是有特定方向相機運動的影片。此外，我們也研究了將移動向量編碼的方法。與傳統基於查表方式的編碼方法相比，我們提出了自適性算術編碼，進一步減少位元數。總結來說，這篇論文回顧了視訊壓縮技術，並且對於移動向量提出搜索和編碼的相關演算法，做實驗驗證想法並討論其提升效能的可能性。	zh_TW
dc.description.abstract	According to the survey from Sandvine in 2020, 65% of the downloaded data are video files. As 5G networks are deployed, efficient video codecs are essential for saving the requirements of storage and internet bandwidth. Among all kinds of multimedia, video can achieve the highest compression ratio due to the high correlation between adjacent frames. The technique of “Inter prediction” makes good use of interframe redundancy. For a predicted frame, only the motion vector and the residual image are coded to bitstream. However, in video compression, inter prediction usually leads to high computational complexity so we believe that there is still room for improvement. In this thesis, we propose an algorithm for adjusting the search window by observing the change of global motion vectors. We minimize the computation without affecting the quality of the reconstructed video and the bitrate. The proposed algorithm is especially suitable for static and fix camera slow movement. Besides, we investigate techniques to encode motion vector data into bitstream. Compared to the classic table-based coding method, we propose a context-based adaptive arithmetic coding (AAC) to further reduce the bitlength. All in all, we review the techniques in video compression and propose efficient methods for searching and coding motion vector.	en
dc.description.provenance	Made available in DSpace on 2021-06-16T08:33:18Z (GMT). No. of bitstreams: 1 U0001-0907202016240300.pdf: 4208058 bytes, checksum: f420ea102e025d7a3d06f35046d917e7 (MD5) Previous issue date: 2020	en
dc.description.tableofcontents	誌謝 i 中文摘要 ii ABSTRACT iii CONTENTS iv LIST OF FIGURES viii LIST OF TABLES xii Chapter 1 Introduction 1 1.1 Motivation 1 1.2 Fundamental tradeoff in video coding 2 1.3 Contribution of the thesis 3 1.4 Thesis Organization 4 Chapter 2 Video codec overview 5 2.1 H.264/Advanced Video Coding (AVC) 7 2.2 H.265/High Efficiency Video Coding (HEVC) 8 2.3 AV1 10 Chapter 3 Video Compression Technique 12 3.1 Picture Partitioning 12 3.1.1 Coding Tree Unit (CTU) 13 3.1.2 From Coding Tree Unit (CTU) to Coding Unit (CU) 14 3.1.3 From Coding Unit (CU) to Prediction Unit (PU) 16 3.1.4 From Coding Unit (CU) to Transform Unit (TU) 16 3.1.5 Downsampling 4:2:0 17 3.1.6 Shape-adaptive 18 3.1.7 Summary 19 3.2 Intra Coding 19 3.2.1 Multiple directions supported 20 3.2.2 Most Probable Modes (MPMs) in Luma Block 22 3.2.3 Chroma from Luma 23 3.2.4 Summary 23 3.3 Inter Coding 24 3.3.1 Three types of frames: I, B, P 24 3.3.2 Motion Estimation – Motion Vector Representation 25 3.3.3 Motion Estimation – Block Matching Algorithm 27 3.3.4 Motion Compensation 29 3.4 RD cost optimization 30 3.5 Transform and Quantization 31 3.5.1 Karhunen-Loeve Transform (KLT) 32 3.5.2 Discrete Cosine Transform (DCT) 33 3.5.3 Quantization 34 3.6 Loop Filtering 34 3.6.1 Deblocking Filter (DBLK) 36 3.6.2 Sample Adaptive Offset (SAO) 36 3.6.3 Summary 39 3.7 Entropy Coding 40 3.7.1 Shannon’s source coding theorem 41 3.7.2 Context-based Adaptive Variable Length Coding (CAVLC) 42 3.7.3 Context-based Adaptive Binary Arithmetic Coding (CABAC) 44 3.7.4 Summary 45 Chapter 4 Selection of matching function 46 4.1 The problem -- Error matching 46 4.2 Experiment – Matching criterion 49 4.3 Result 51 4.4 Summary 51 Chapter 5 Proposed adaptive search range adjustment algorithm 53 5.1 Review: Block-based motion estimation 53 5.2 Related work 54 5.3 Proposed search range adjustment based on camera movement 55 5.4 Experiment 56 5.4.1 Dataset 56 5.4.2 Tweaking parameters 57 5.4.3 Evaluation 61 5.5 Result 62 5.6 Summary 63 Chapter 6 Proposed context-based adaptive arithmetic coding for motion vector difference 64 6.1 Related work 65 6.2 Proposed adaptive arithmetic coding (AAC) 66 6.3 Experiment 68 6.3.1 Zigzag scanning through video 68 6.3.2 Context modeling (CM) 70 6.3.3 Multiple context modeling by zero percentage 73 6.3.4 Reference distance (current, past1, past2) 77 6.3.5 Initial frequency table 80 6.3.6 Update frequency table 82 6.4 Result 84 6.5 Summary 87 Chapter 7 Conclusion and future work 88 7.1 Conclusion 88 7.2 Future work 89 REFERENCE 90
dc.language.iso	en
dc.title	可適性移動向量搜尋和編碼演算法	zh_TW
dc.title	Adaptive Motion Vector Searching and Encoding Algorithms for Video Compression	en
dc.type	Thesis
dc.date.schoolyear	108-2
dc.description.degree	碩士
dc.contributor.oralexamcommittee	歐陽良昱(Liang-Yu Ou Yang),許文良(Wun-Liang Hsu),劉俊麟(Zun-Lin Liu)
dc.subject.keyword	資料壓縮,視訊壓縮,幀間預測,運動估計,運動補償,搜尋範圍調整,物件追蹤,熵編碼,算數編碼,	zh_TW
dc.subject.keyword	Data compression,Video compression,Inter-frame prediction,Motion estimation,Motion compensation,Search window adjustment,Object tracking,Entropy coding,Arithmetic coding,	en
dc.relation.page	95
dc.identifier.doi	10.6342/NTU202001414
dc.rights.note	有償授權
dc.date.accepted	2020-07-13
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	電信工程學研究所	zh_TW
顯示於系所單位：	電信工程學研究所

文件中的檔案：

檔案	大小	格式
U0001-0907202016240300.pdf 目前未授權公開取用	4.11 MB	Adobe PDF

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。