使用MCLT於音訊浮水印之實作與改進

Chih-Kai Yu; 游智凱

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/50765

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	張智星(Jyh-Shing Roger Jang)
dc.contributor.author	Chih-Kai Yu	en
dc.contributor.author	游智凱	zh_TW
dc.date.accessioned	2021-06-15T12:57:04Z	-
dc.date.available	2016-07-26
dc.date.copyright	2016-07-26
dc.date.issued	2016
dc.date.submitted	2016-07-14
dc.identifier.citation	[1] “Acoustic Data Transmission Based on Modulated Complex Lapped Transform”, Hwan Sik Yun, Kiho Cho, and Nam Soo Kim, IEEE Signal Processing Letters, Vol. 17, No. 1, pages 67–70, January 2010. [2] “Imperceptible Data Hiding in MCLT Domain for Acoustic Data Transmission Using Loudspeaker and Microphone”, Kiho Cho, Hwan Sik Yun, Sung Soo Kim and Nam Soo Kim, Proceedings of 2009 APSIPA Annual Summit and Conference, Sapporo, Japan, pages 238–241, October 2009. [3] “Robust Data Hiding for MCLT Based Acoustic Data Transmission”, Kiho Cho, Hwan Sik Yun, and Nam Soo Kim, IEEE Signal Processing Letters, Vol. 17, No. 7, pages 679–682, July 2010. [4] “A Modulated Complex Lapped Transform and its Applications to Audio Processing”, Henrique Malvar, IEEE International Conference on Acoustics, Speech, and Signal Processing, Phoenix, AZ, Vol. 3, pages 1421–1424, March 1999. [5] “Fast Algorithm for the Modulated Complex Lapped Transform”, Henrique S. Malvar, IEEE Signal Processing Letters, Vol. 10, No. 1, pages 8–10, January 2003. [6] “Overview of Audio Watermarking Techniques”, Ms. Komal V. Goenka, Ms. Pallavi K. Patil, International Journal of Emerging Technology and Advanced Engineering, Vol. 2, No. 2, February 2012. [7] “Principles of Audio Watermarking”, M.Hrnc ̌a ́r,J.Krajc ̌ovic ̌, Advances in Electrical and Electronic Engineering, Vol. 7, No. 1–2, pages 247–249, March – June 2008. [8] “Spread-Spectrum Watermarking of Audio Signals”, Darko Kirovski and Henrique S. Malvar, IEEE Transactions on Signal Processing, Vol. 51, No. 4, pages 1020–1033, April 2003. [9] “Spectral Magnitude Adjusted Data Hiding in MCLT Domain of Audio Signal for Robust Acoustic Data Transmission”, Kiho Cho, Jae Choi, Hwan Sik Yun and Nam Soo Kim, APSIPA ASC 2011 Xi’an, October 2011 [10] “Techniques for Data Hiding”, W. Bender, D. Gruhl, N. Morimoto, A. Lu, IBM Systems Journal, Vol. 35, No. 3–4, pages 313–336, 1996 [11] “Pushing Information Over Acoustic Channels”, Po-Wei Chen, Chun-Hsiang Huang, Yun-Chung Shen, Ja-Ling Wu, IEEE International Conference on Acoustics, Speech and Signal Processing, pages 1421–1424, April 2009 [12] “Information Delivery Systems for Car Passengers Without Networking Capabilities”, Chun-Hsiang Huang, Po-Wei Chen, Ping-Yen Hsieh, and Ja-Ling Wu, Advances in Multimedia Information Processing – PCM 2007, Vol. 4810, pages 539–548, December 2007 [13] “An Initial Study on Double-DCT-Based Data Hiding in Audio Signals”, Po-Yuan Pai, July 2014 [14] “Machine Learning Toolbox”, Jyh-Shing Roger Jang, available at “http://mirlab.org/jang/matlab/toolbox/machineLearning”, accessed on July 2016 [15] “Utility Toolbox”, Jyh-Shing Roger Jang, available from the link at the author’s homepage at “http://mirlab.org/jang” [16] “Speech and Audio Processing (SAP) Toolbox”, Jyh-Shing Roger Jang, available at “http://mirlab.org/jang/matlab/toolbox/sap” [17] “MDCT Matlab code”, available at mdct and imdct. “http://www.ee.columbia.edu/~marios/mdct/mdct_giraffe.html” [18] “Echo Hiding”, Daniel Gruhl, Anthony Lu, Walter Bender, International Workshop on Information Hiding, pages 295–315, May 1996 [19] “Audio Watermarking of Stereo Signals Based on Echo-Hiding Method”, Foo Say Wei, Dong Qi, Information, Communications and Signal Processing, ICICS 2009, pages 1–4, December 2009
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/50765	-
dc.description.abstract	音訊浮水印最早用於版權管理，即辨識音訊之所有權。隨著智慧型手機、平板的普及以及運算效能的進步，我們可利用音訊浮水印傳遞資訊。而此項技術最大優點便是硬體需求簡單，只需一擴音器、一麥克風即可。本論文使用MCLT (Modulated Complex Lapped Transform)來實作音訊浮水印，且利用改變MCLT係數相位藏入資訊。其原理是人類對於相位改變較不敏感，使藏入資訊之音訊與原音訊無異。且MCLT並不會產生blocking artifacts，我們可以得到更好的音訊品質。音訊浮水印極容易受到干擾，甚至麥克風方向亦會影響辨識率。使用MCLT於音訊浮水印有兩個主要問題，一是有些音訊能量相當小，低能量部份會因此無法抽取資料。二是在各種干擾下，MCLT係數相位發生旋轉，導致抽取資料時發生錯誤。針對第一個問題，本論文提出了將白噪音特定頻帶能量混入原始音訊裡，使能量較低部份能獲得些許能量。第二個問題利用資料分群來解決，本論文嘗試改變K-means 分群的起始中心點獲得改進。實驗部份本論文以有無加入白噪音、距離、角度、麥克風方向、曲風及片段大小作為實驗變因，嘗試模擬實際應用情形。我們錄製大量音樂訊號送至本論文所提出之系統作辨識，最終可發現加入白噪音能量之音訊在辨識率上獲得極大的進步。	zh_TW
dc.description.abstract	Audio watermark is a technology used for DRM (Digital Rights Management) in earlier days. Now, with the increase of popularity and improvement of computation of smartphones and tablets, we can transmit information via audio watermark. The advantage of audio watermark is that it only requires a speaker and a microphone. In this paper, we implement audio watermark system by using MCLT (Modulated Complex Lapped Transform), and embed data by modifying the phase of the MCLT coefficients because of the imperceptibility of human auditory to modified phase. As a result, we can hardly distinguish the transformed signal from the original audio signal. The MCLT does not produce blocking artifacts so we can get better audio quality. Audio watermark is very sensitive to any acoustic interferences, and even the microphone’s directions will make impact on accuracy. There are two main problems in audio watermark using MCLT. First, some audio signal’s energy is too weak to extract data. Second, the coefficients of MCLT will rotate under some acoustic interferences, and this will lead to data extraction error. For the first problem, we mix the specific frequency band of white noise signal to the audio signal, and increase the energy of weak parts. For the second problem, we use K-means clustering as a solution, and we also try to alter the initial center of K-means clustering to improve the result. In our experiments, the mixing of white noise signals, distances, angles, microphone’s directions, music genre and segment size are independent variables, and we tried many possible combinations to simulate the practical situations. We recorded many audio signals and decoded the result using the proposed system. As a result, we obtain a greater improvement of accuracy by adding white noise signal energy to the audio signal.	en
dc.description.provenance	Made available in DSpace on 2021-06-15T12:57:04Z (GMT). No. of bitstreams: 1 ntu-105-R03922128-1.pdf: 5442064 bytes, checksum: 8a25b07b7d964a45f11ba9e531eccb82 (MD5) Previous issue date: 2016	en
dc.description.tableofcontents	摘要 III Abstract IV 謝誌 V 目錄 VI 表目錄 IX 圖目錄 X 1 緒論 1 1.1 研究動機 1 1.2 研究目標 2 1.3 章節概要 2 2 相關研究 3 2.1 音訊浮水印相關方法 3 2.1.1 Echo Hiding 3 2.1.2 Phase Coding 3 2.1.3 Low-Bit Coding 3 2.1.4 Spread Spectrum 4 2.2 商業化產品 4 2.2.1 Intrasonics 4 2.2.2 Field System – Soundcode 5 3 實作方法 6 3.1 簡介 6 3.2 Modulated Complex Lapped Transform 6 3.2.1 介紹 6 3.2.2 分析與合成 7 3.2.3 干擾 9 3.3 嵌入 10 3.3.1 資料 10 3.3.2 同步 12 3.4 抽取 13 3.4.1 尋找同步 13 3.4.2 抽取資料 16 4 改進方法 18 4.1 改進方法一：加入白噪音 18 4.1.1 原因 18 4.1.2 改進方法 19 4.2 改進方法二：資料分群 20 4.2.1 原因 20 4.2.2 改進方法 21 4.3 實驗結果分析 23 5 結論與未來展望 54 5.1 結論 54 5.1.1 實作系統 54 5.1.2 改進系統 55 5.2 未來展望 57 附錄A 58 A.1 介紹 58 A.2 實驗 58 A.2.1 參數設置 58 A.2.2 實驗數據 59 A.3 結論 62 文獻參考 63
dc.language.iso	zh-TW
dc.title	使用MCLT於音訊浮水印之實作與改進	zh_TW
dc.title	Implementation and Improvement of Audio Watermarking Using MCLT	en
dc.type	Thesis
dc.date.schoolyear	104-2
dc.description.degree	碩士
dc.contributor.oralexamcommittee	吳家麟(Ja-Ling Wu),林裕訓
dc.subject.keyword	音訊浮水印,modulated complex lapped transform,資料隱藏,	zh_TW
dc.subject.keyword	audio watermark,modulated complex lapped transform,data hiding,	en
dc.relation.page	65
dc.identifier.doi	10.6342/NTU201600807
dc.rights.note	有償授權
dc.date.accepted	2016-07-14
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	資訊工程學研究所	zh_TW
顯示於系所單位：	資訊工程學系

文件中的檔案：

檔案	大小	格式
ntu-105-1.pdf 目前未授權公開取用	5.31 MB	Adobe PDF

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。