室內自動導航無人機系統之同步定位、地圖構建與影像物件偵測

CHENG-WEI HUANG; 黃政維

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/86251

標題:	室內自動導航無人機系統之同步定位、地圖構建與影像物件偵測 Indoor Autonomous Navigation Drone System with Semantic-SLAM based on SD-DETR
作者:	CHENG-WEI HUANG 黃政維
指導教授:	陳俊杉(Chuin-Shan Chen)
關鍵字:	室內自動化無人機,深度學習,電腦視覺,時序性模型,語義分割, Autonomous Indoor MAV,Deep Learning,Computer Vision,Vision Transformer,Semantic SLAM,
出版年 :	2022
學位:	碩士
摘要:	多軸無人機在過去幾年，不論是業界抑或是學界皆已開始被廣泛地運用。業界上的利用例如貨物運輸，農業灌溉或是警務巡邏，而學界上則使用無人機進行資料自動蒐集，工地地圖重建，交通流量監測或是災害救援等等。雖然以上應用皆以顯現無人機良好的機動性，但目前無人機若須進行自動化移動，大多需要仰賴全球定位系統的協助以提供精準的定位。目前許多問題若是發生在全球定位系統訊號不良好甚至是室內環境時，就會導致無人機無法進行自動導航。另外，若是需要在未知的環境下使用無人機，通常需要專業的駕駛員操作無人機以避免無人機的墜毀。在這篇研究中，我們提出了一個完整的系統讓無人機可以快速建置一個擁有六個自由度的障礙物地圖。地圖的建置主要養賴彩色影像，深度影像以及相機的里程計。由障礙物底圖以及里程計，我們的無人機可以自動地在室內導航。在無人機探索室內後，我們的系統利用無人機蒐集的資料建立具有語義資訊的三維地圖。這篇研究提出新的深度模型基於Transformer的架構來將一段序列影像進行語義分割。我們更改了傳統Transformer的注意力機制，使其可以處裡計算量更大的時序性資料。我們進一步將語義分割文的資料重投影以建立三維地圖。經過我們的驗證比較，比起其他模型，我們的模型可以在相似的任務上提升準確度。 Micro Aerial Vehicle (MAV) has started to be utilized by different industries, and companies have used the MAV to deliver merchandise, agriculture spraying, or police patrolling. MAV also gained academic attention, and researchers have used the MAV to collect data for construction site map generation, traffic flow monitoring, or catastrophe rescuing tasks. While those applications have shown that the MAV can travel remotely and unmannedly, most applications highly rely on the Global Positioning System (GPS) to provide accurate position information, which is usually unavailable in an unstructured, crowded indoor environment. Furthermore, MAV usually needs to be controlled by well-trained professionals once it is placed in an unknown environment due to the lack of an environment map which might lead to the failure of autonomous navigation. In this research, we developed a MAV system that built a 6-DoF obstacle map by processing the sensor data from RGB-D and odometry camera to enable the drone autonomously navigate in an indoor environment. After MAV navigates through an unknown environment, it is important for the MAV to generate a semantic 3D map which not only helps the user to investigate the unknown environment but also enables the MAV to provide high-level navigation tasks. To generate the semantic map for MAV, we developed a new model based on Transformer architecture to process sequential data called Sequential-DDETR. Sequential-DDETR is an end-to-end model to generate a sequential segmentation image. We utilized the deformable attention model, which reduces computation significantly compared to the traditional Transformer. Our Sequential-DDETR can calculate attention features across different frames to enhance semantic segmentation performance on sequential images. We also utilized the depth image to perform back projection of sequence semantic segmentation masks to build a Semantic Simultaneous Localization and Mapping(Semantic-SLAM). We have shown that our model can perform better in building Semantic-SLAM than other methods.
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/86251
DOI:	10.6342/NTU202202930
全文授權:	同意授權(全球公開)
電子全文公開日期:	2022-08-31
顯示於系所單位：	土木工程學系

文件中的檔案：

檔案	大小	格式
U0001-2908202214512900.pdf	5.66 MB	Adobe PDF	檢視/開啟

顯示文件完整紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料（如：文字、圖片、PDF）並使其易於取用。