基於距離影像的光達點雲幀間壓縮

劉佳昇; Chia-Sheng Liu

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/91929

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	李明穗徐宏民	zh_TW
dc.contributor.advisor	Ming-Sui LeeWinston H. Hsu	en
dc.contributor.author	劉佳昇	zh_TW
dc.contributor.author	Chia-Sheng Liu	en
dc.date.accessioned	2024-02-26T16:30:04Z	-
dc.date.available	2024-02-27	-
dc.date.copyright	2024-02-26	-
dc.date.issued	2022	-
dc.date.submitted	2002-01-01	-
dc.identifier.citation	E. Agustsson, D. Minnen, N. Johnston, J. Balle, S. J. Hwang, and G. Toderici. Scale-space flow for end-to-end optimized video compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020. J. Ballé, D. Minnen, S. Singh, S. J. Hwang, and N. Johnston. Variational image compression with a scale hyperprior. In International Conference on Learning Representations (ICLR), 2018. J. Behley, M. Garbade, A. Milioto, J. Quenzel, S. Behnke, C. Stachniss, and J. Gall. SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019. S.Biswas, J.Liu, K.Wong, S.Wang, and R.Urtasun. Muscle: Multi sweep compression of lidar using deep entropy models. Advances in Neural Information Processing Systems, 2020. Z. Cheng, H. Sun, M. Takeuchi, and J. Katto. Learned image compression with discretized gaussian mixture likelihoods and attention modules. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020. C. Fu, G. Li, R. Song, W. Gao, and S. Liu. Octattention: Octree-based largescale contexts model for point cloud compression. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2022. Google. Draco. https://github.com/google/draco. D. Graziosi, O. Nakagami, S. Kuma, A. Zaghetto, T. Suzuki, and A. Tabatabai. An overview of ongoing point cloud compression standardization activities: Video-based (v-pcc) and geometry-based (g-pcc). APSIPA Transactions on Signal and Information Processing, 9, 2020. D. He, Z. Yang, W. Peng, R. Ma, H. Qin, and Y. Wang. Elic: Efficient learned image compression with unevenly grouped space-channel contextual adaptive coding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022. L. Huang, S. Wang, K. Wong, J. Liu, and R. Urtasun. Octsqueeze: Octree-structured entropy model for lidar compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020. H. Jiang, D. Sun, V. Jampani, M.-H. Yang, E. Learned-Miller, and J. Kautz. Super slomo: High quality estimation of multiple intermediate frames for video interpolation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018. G. Lu, W. Ouyang, D. Xu, X. Zhang, C. Cai, and Z. Gao. Dvc: An end-to-end deep video compression framework. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019. B. Mersch, X. Chen, J. Behley, and C. Stachniss. Self-supervised Point Cloud Prediction Using 3D Spatiotemporal Convolutional Networks. In Proceedings of the Conference on Robot Learning (CoRL), 2021. D. Minnen, J. Ballé, and G. D. Toderici. Joint autoregressive and hierarchical priors for learned image compression. In Advances in Neural Information Processing Systems, 2018. J. Seward. bzip2. https://sourceware.org/bzip2. X. Sun, H. Ma, Y. Sun, and M. Liu. A novel point cloud compression algorithm based on clustering. IEEE Robotics and Automation Letters (RAL), 2019. X. Sun, S. Wang, M. Wang, Z. Wang, and M. Liu. A novel coding architecture for lidar point cloud sequence. IEEE Robotics and Automation Letters (RAL), 2020. S. Wang, J. Jiao, P. Cai, and L. Wang. R-PCC: A baseline for range image-based point cloud compression. In International Conference on Robotics and Automation (ICRA). IEEE, 2022. L. Zhao, X. Lin, W. Wang, K.K. Ma, and J. Chen. Rangeinet: Fast lidar point cloud temporal interpolation. In ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022. X. Zhou, C. R. Qi, Y. Zhou, and D. Anguelov. Riddle: Lidar data compression with range image deep delta encoding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022.	-
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/91929	-
dc.description.abstract	光達傳感器所收集的大量資料也隨之產生了光達資料壓縮的問題。而基於距離影像的光達資料壓縮方法是一個有潛力的候選解決方案，但是在幀間編碼方面缺乏充分的探索。在我們的工作中，我們嘗試處理基於距離影像的光達傳感器資料幀間壓縮問題。我們的幀間壓縮框架遵循預測編碼的典範，該典範已在彩色影像編碼領域被採用。我們提出的框架包括一個預測模組和一個殘差編碼模組。對於預測模組，我們將時間序列中的前一幀和後一幀作為參考幀，並對將要編碼的幀進行幀級預測。至於殘差編碼模組，與以往先量化再進行無損編碼的方法不同，我們引入了一種基於學習的方法，可以進一步開發殘差幀內的空間冗餘性。在SemanticKITTI資料集上進行的實驗顯示出，在低碼率的條件之下，我們的方法優於其他基於距離影像的方法。	zh_TW
dc.description.abstract	The large amount of data collected by LiDAR sensors brings the issue of LiDAR data compression. Range image-based method for LiDAR data compression is a potential candidate for solution but lacks full exploration, especially for inter-frame coding. In our work, we address the problem of range image-based LiDAR sensor data inter-frame compression. Our inter-frame compression framework follows the predictive coding paradigm, which is adopted in the field of color video codec. Our proposed framework consists of a prediction module and a residual coding module. For the prediction module, we take the previous and next frame in time steps as reference frames, aiming to perform a frame-level prediction of the to-be-encoded frame. As for the residual coding module, different from previous methods that first apply quantization followed by a lossless coder, we introduce a learning-based approach that can further exploit spatial redundancy within the residual frame. Experiments conducted on the SemanticKITTI dataset demonstrate that our method outperforms other range image-based methods, especially at low bitrate conditions.	en
dc.description.provenance	Submitted by admin ntu (admin@lib.ntu.edu.tw) on 2024-02-26T16:30:04Z No. of bitstreams: 0	en
dc.description.provenance	Made available in DSpace on 2024-02-26T16:30:04Z (GMT). No. of bitstreams: 0	en
dc.description.tableofcontents	Chapter 1 Introduction 1 Chapter 2 Related Work 5 2.1 Treebased LiDAR Point Clouds Compression 5 2.2 NeuralImageCompression 6 Chapter 3 Methodology 7 3.1 Overview 7 3.2 Interframe Prediction 8 3.3 Residual Frame Coding 9 3.3.1 Handcrafted Method 10 3.3.2 Deeplearned Method 10 Chapter 4 Experimental Results 13 4.1 Settings 13 4.1.1 Dataset 13 4.1.2 Evaluation Metrics 13 4.1.3 Baselines 14 4.2 Rate-distortion Results 14 4.3 Decoded Point Clouds Visualization 14 4.4 Ablation Study 15 4.4.1 Choice of Prediction Module 16 4.4.2 ChoiceofResidualCodingModule 16 Chapter 5 Conclusions 19 References 21	-
dc.language.iso	zh_TW	-
dc.title	基於距離影像的光達點雲幀間壓縮	zh_TW
dc.title	Range Image-based Inter-frame Compression for LiDAR Point Clouds	en
dc.type	Thesis	-
dc.date.schoolyear	110-2	-
dc.description.degree	碩士	-
dc.contributor.oralexamcommittee	陳文進;葉梅珍;陳奕廷	zh_TW
dc.contributor.oralexamcommittee	Wen-Chin Chen;Mei-Chen Yeh;Yi-Ting Chen	en
dc.subject.keyword	壓縮,光達,點雲,距離影像,	zh_TW
dc.subject.keyword	Compression,LiDAR,Point Clouds,Range Image,	en
dc.relation.page	30	-
dc.identifier.doi	10.6342/NTU202204094	-
dc.rights.note	同意授權(限校園內公開)	-
dc.date.accepted	2022-09-28	-
dc.contributor.author-college	電機資訊學院	-
dc.contributor.author-dept	資訊工程學系	-
顯示於系所單位：	資訊工程學系

文件中的檔案：

檔案	大小	格式
ntu-110-2.pdf 授權僅限NTU校內IP使用（校園外請利用VPN校外連線服務）	3.85 MB	Adobe PDF	檢視/開啟

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。