請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/40596完整後設資料紀錄
| DC 欄位 | 值 | 語言 |
|---|---|---|
| dc.contributor.advisor | 王傑智(Chieh-Chih Wang) | |
| dc.contributor.author | Chi-Hao Lin | en |
| dc.contributor.author | 林祺豪 | zh_TW |
| dc.date.accessioned | 2021-06-14T16:52:38Z | - |
| dc.date.available | 2008-08-05 | |
| dc.date.copyright | 2008-08-05 | |
| dc.date.issued | 2008 | |
| dc.date.submitted | 2008-07-29 | |
| dc.identifier.citation | Birchfield, S. T. & Gangishetty, R. (2005). Acoustic localization by interaural level difference. In International Conference on Acoustics, Speech, and Signal Processing (ICASSP).
Cui, W., Cao, Z., & Wei, J. (2006). Dual-microphone source location method in 2-d space. In International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Hu, J.-S., Cheng, C.-C., & Liu,W.-H. (2006). Robust speaker’s location detection in a vehicle environment using gmmmodels. IEEE Transactions on Systems,Man and Cybernetics - PartB: Cybernetics, 36(2), 403–412. Knapp, C. H. & Carter, G. C. (1976). The generalized correlation method for estimation of time delay. IEEE Trans. Acoust., Speech, Signal Processing, 24, 320–327. Nakashima, H. & Mukai, T. (2005). 3d sound source localization system based on learning of binaural hearing. In IEEE International Conference on Systems, Man and Cybernetics. Thrun, S. (2005). Affine structure from sound. In Proceedings of Conference on Neural Information Processing Systems (NIPS), Cambridge, MA. MIT Press. Tomasi, C. & Kanade, T. (1992). Shape andmotion fromimage streams under orthography: a factorization method. International Journal of Computer Vision, 9(2), 137–154. | |
| dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/40596 | - |
| dc.description.abstract | 在機器人學裡,聲音感知是一個非常重要的功能。麥克風陣列在聲音感知的應用中被廣泛的使用,而在這些應用裡,麥克風在空間的座標通常是已知的。聲音估計結構(Structure from Sound)演算法提供了一個富有彈性之方法來校正不同結構的麥克風陣列,它能同時地定位多個麥克風與定位多個聲源。然而,在現存的演算法裡並沒有將量測的不確定性納入考量,也沒有提供音聲估計結構演算法之結果的不確定性估測。在這篇論文裡,我們提出了一個機率型聲音估計結構演算法(Probabilistic Structure from Sound)。此外,我們提出了一個機率型聲源定位演算法(Probabilistic Sound Source Localization),此演算法是使用機率型聲音估計結構演算法的結果來改進聲源定位的準確性。我們使用低成本的麥克風。大量的模擬與實驗結果成功的展示了機率型聲音估計結構演算法與機率型聲源定位演算法之成果。 | zh_TW |
| dc.description.abstract | Auditory perception is one of the most important functions for robotics applications. Microphone arrays are widely used for auditory perception in which the spatial structure of microphones is usually known. The thesis first describes the affine Structure from Sound (SFS) algorithm. The structure from sound is a problem to simultaneously localize microphones and sound sources. However, the existing method does not take measurement uncertainty into account and does not provide uncertainty estimates of the SFS results. In this thesis, we propose a probabilistic structure from sound (PSFS) approach using the unscented transform. The PSFS algorithm not only localizes microphones and sound sources but also estimates the uncertainties of the SFS results. In addition, a probabilistic sound source localization (PSSL) approach using the PSFS results is provided to improve sound source localization accuracy. The ample results of simulation and experiments using low cost, off-the-shell microphones demonstrate the feasibility and performance of the proposed PSFS and PSSL approaches. | en |
| dc.description.provenance | Made available in DSpace on 2021-06-14T16:52:38Z (GMT). No. of bitstreams: 1 ntu-97-R94922124-1.pdf: 4017230 bytes, checksum: 27e56cfbebd70979fa29db7573a14953 (MD5) Previous issue date: 2008 | en |
| dc.description.tableofcontents | ABSTRACT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ii
LIST OF FIGURES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v CHAPTER 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 CHAPTER 2. Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 CHAPTER 3. Sound Source Localization . . . . . . . . . . . . . . . . . . . . . . . . . 5 3.1. TDOA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 3.2. GCC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 3.3. Sound Source Localization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 CHAPTER 4. Affine Structure from Sound . . . . . . . . . . . . . . . . . . . . . . . . 9 4.1. Problem Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 4.2. Data Measurement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 4.3. Least square solution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 4.4. The Far Field Approximation . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 4.5. Affine Solution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 4.6. Recovering the sound source locations . . . . . . . . . . . . . . . . . . . . . . 12 CHAPTER 5. Probabilistic Structure from Sound . . . . . . . . . . . . . . . . . . . . 17 5.1. The Unscented Transform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17 5.2. Dealing with Uncertainties . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 5.2.1. Gaussian Mean . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 5.2.2. Gaussian Covariance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 5.2.3. Uncertainties Estimation using Unscented Transform . . . . . . . . . . . 19 5.3. Dealing with Axis Inconsistency . . . . . . . . . . . . . . . . . . . . . . . . . 19 CHAPTER 6. Probabilistic Sound Source Localization . . . . . . . . . . . . . . . . . 23 6.1. Gaussian Mean . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 6.2. Gaussian Covariance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 6.3. Probabilistic Sound Source Localization . . . . . . . . . . . . . . . . . . . . . 24 CHAPTER 7. Experimental Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 7.1. TDOA Estimation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 7.2. PSFS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 7.2.1. The Near Field Condition . . . . . . . . . . . . . . . . . . . . . . . . . . . 27 7.2.2. The Far Field Condition . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28 7.2.3. The Echoes Condition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28 7.2.4. The Non Affine Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29 7.3. The PSSL Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 CHAPTER 8. Conclusion and FutureWork . . . . . . . . . . . . . . . . . . . . . . . 34 BIBLIOGRAPHY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35 | |
| dc.language.iso | en | |
| dc.subject | 聲源定位 | zh_TW |
| dc.subject | 麥克風陣列 | zh_TW |
| dc.subject | 聲音估計結構 | zh_TW |
| dc.subject | structure from sound | en |
| dc.subject | sound source localization | en |
| dc.subject | microphone arrays | en |
| dc.title | 機率型麥克風陣列校正與聲源定位系統 | zh_TW |
| dc.title | Probabilistic Structure from Sound and Sound Source Localization | en |
| dc.type | Thesis | |
| dc.date.schoolyear | 96-2 | |
| dc.description.degree | 碩士 | |
| dc.contributor.oralexamcommittee | 郭振華(Jen-Hwa Guo),黃寶儀(Polly Huang),胡竹生(Jwu-Sheng Hu) | |
| dc.subject.keyword | 麥克風陣列,聲音估計結構,聲源定位, | zh_TW |
| dc.subject.keyword | microphone arrays,structure from sound,sound source localization, | en |
| dc.relation.page | 33 | |
| dc.rights.note | 有償授權 | |
| dc.date.accepted | 2008-07-31 | |
| dc.contributor.author-college | 電機資訊學院 | zh_TW |
| dc.contributor.author-dept | 資訊工程學研究所 | zh_TW |
| 顯示於系所單位: | 資訊工程學系 | |
文件中的檔案:
| 檔案 | 大小 | 格式 | |
|---|---|---|---|
| ntu-97-1.pdf 未授權公開取用 | 3.92 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。
