壓縮感測的時頻分析方法

Chun-Kai Wang; 王俊凱

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/723

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	貝蘇章
dc.contributor.author	Chun-Kai Wang	en
dc.contributor.author	王俊凱	zh_TW
dc.date.accessioned	2021-05-11T05:00:07Z	-
dc.date.available	2019-08-06
dc.date.available	2021-05-11T05:00:07Z	-
dc.date.copyright	2019-08-06
dc.date.issued	2019
dc.date.submitted	2019-07-31
dc.identifier.citation	[1] E. Candès and M. Wakin. “An introduction to compressive sampling.” IEEE Signal Processing Magazine, vol. 25, no. 2, pp. 21-30, Mar. 2008. [2] D.L. Donoho and X. Huo, 'Uncertainty principles and ideal atomic decomposition,' IEEE Transaction on Information Theory, vol. 47, no. 7, pp. 2845-2862, Nov. 2001. [3] R. Coifman, F. Geshwind, and Y. Meyer, “Noiselets,” Applied and Computational Harmonic Analysis, vol. 10, no. 1, pp. 27-44, 2001. [4] J.F. Claerbout and F. Muir, “Robust modeling with erratic data,” Geophysics Magazine, vol. 38, no. 5, pp. 826-844, Oct. 1973. [5] D.L. Donoho, “Compressive sensing,” IEEE Transaction on Information Theory, vol. 52, no. 4, pp. 1289-1306, Apr. 2006. [6] E. Candès and J. Romberg, “Sparsity and incoherence in compressive sampling,” Inverse Problems, vol. 23, no.3, pp. 969-985, 2007. [7] E. Candès and T. Tao, “Decoding by linear programming,” IEEE Transaction on Information Theory, vol. 51, no. 12, pp. 4203-4215, Dec.2005. [8] E. Candès, J. Romberg, and T. Tao, “Stable signal recovery from incomplete and inaccurate measurements,” Communications on pure and applied mathematics, vol. 59, no. 8, pp. 1207-1223, March. 2006. [9] S.G. Mallat and Z. Zhang, “Matching pursuit with time-frequency dictionaries,” IEEE Transaction on Signal Processing, vol. 41, no. 12, pp. 3397-3415, Dec. 1993. [10] S. Qian and D. Chen, “Signal representation using adaptive normalized Gaussian functions,” Signal Processing, vol. 36, no. 1, pp. 1-11, 1994. [11] L.F. Villemoes, “Best approximation with Walsh atoms,” Constructive Approximation, vol. 13, no. 3, pp. 329-355, Sep. 1997. [12] Y.C. Pati, R. Rezaiifar and P.S. Krishnaprasad, “Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition,” Proceedings of 27th Asilomar Conference on Signals, Systems and Computers, IEEE Computer Society Press, CA, USA, Nov. 1993. [13] J.A. Tropp and A.C. Gilbert, “Signal recovery from random measurements via orthogonal matching pursuit,” IEEE Transaction on Information Theory, vol. 53, no. 12, pp. 4655-4666, 2007 [14] S. Kunis and H. Rauhut, “Random sampling of sparse trigonometric polynomials, II - orthogonal matching pursuit versus basis pursuit,” Foundations of Computational Mathematics, vol. 8, no. 6, pp. 737-763, Dec. 2008. [15] T.T. Cai and L. Wang, “Orthogonal matching pursuit for sparse signal recovery with noise,” IEEE Transaction on Information Theory, vol. 57, no. 7, pp. 4680-4688, Jul. 2011. [16] S. Kwon, J. Wang and B. Shim, “Multipath matching pursuit,” IEEE Transaction on Information Theory, vol. 60, no. 5, pp. 2986-3001, Mar. 2014. [17] D. Needell and J. A. Tropp, “CoSaMP: Iterative signal recovery from incomplete and inaccurate samples,” Applied and Computational Harmonic Analysis, vol. 26, no. 3, pp. 301–321, May. 2009. [18] J. Wang, S. Kwon and B. Shim, “Generalized orthogonal matching pursuit,” IEEE Transaction on Signal Processing, vol. 60, no. 12, pp. 6202-6216, Sep. 2012. [19] D.L. Donoho, Y. Tsaig, I. Drori and J.L. Starck, “Sparse solution of underdetermined linear equations by stagewise orthogonal matching pursuit,” IEEE Transaction on Information Theory, vol. 58, no. 5, pp. 1094-1121, Feb. 2012. [20] S.S. Chen, D.L. Donoho and M.A. Saunders, “Atomic decomposition by basis pursuit,” SIAM Journal on Scientific Computing, pp. 33-61, vol. 20, no.1, Aug. 1998. [21] P. Bloomfield and W. Steiger, Least Absolute Deviations: Theory, Applications, and Algorithms, Birkhäuser, Boston, 1983. [22] P.E. Gill, W. Murray and M.H. Wright, Numerical linear algebra optimization, Addison-Wesley, Redwood City, CA, 1991. [23] I. Daubechies, “Time-frequency localization operators: a geometric phase space approach,” IEEE Transaction on Information Theory, vol. 34, no. 4, pp. 605-612, Jul. 1988. [24] R.R. Coifman and M.V. Wickerhauser, “Entropy-based algorithms for best-basis selection,” IEEE Transaction on Information Theory, vol. 38, no. 2, pp. 713-718, Mar. 1992. [25] L.J. Rudin, S. Osher and E. Fatemi, “Nonlinear total-variation-based noise removal algorithms,” Physica D: nonlinear phenomena, vol. 60, pp. 259-268, Nov. 1992. [26] A. Bultan, “A four-parameter atomic decomposition of chirplets,” IEEE Transaction on Signal Processing, vol. 47, no. 3, pp. 731-745, Mar. 1999. [27] H. Zhu, S.N. Zhang and H.C. Zhao, “Single-channel source separation of radar fuze mixed signal using advanced adaptive decomposition,” Acta Physica Sinica, vol. 63, no. 5, 058401, 2014. [28] Y. Zhou, X. Wang, Y. Tian and D. Zhou, “A novel time-frequency atomic dictionary for radar intra-pulse modulation signal sparse representation,” Asia-Pacific Microwave Conference (APMC), Dec. 6-9, 2015. [29] H. Zou, Q. Dai, R. Wang and Y. Li, “Parametric TFR via windowed exponential frequency modulated atoms,” IEEE Transaction on Signal Processing, vol. 8, no. 5, pp. 140-142, May. 2001. [30] A. Haar, “Zur Theorie der orthogonalen Funktionensysteme,” Mathematische Annalen, vol. 69, no. 3, pp. 331-371, Sep. 1910. [31] S.C. Pei and J.J. Ding, “Relations between Gabor transforms and fractional Fourier transforms and their applications for signal processing,” IEEE Transaction on Signal Processing, vol. 55, no. 10, pp. 4839-4850, Oct. 2007. [32] J.J. Ding, S.C. Pei and T.Y. Ko, “Higher order modulation and the efficient sampling algorithm for time variant signal,” Proceedings of the 20th European Signal Processing Conference, EURASIP, Bucharest, Romania, Aug. 2012. [33] D. Ellis (2010). mp3read and mp3write for Matlab. Retrieved Apr. 2019, from Columbia University, Electrical Engineering. Website: http://www.ee.columbia.edu/~dpwe/resources/matlab/mp3read.html [34] D. Ellis (2011). M4A (AAC) Compressed Audio File Reading. Retrieved Apr. 2019, from Columbia University, Electrical Engineering. Website: http://www.ee.columbia.edu/~dpwe/resources/matlab/m4aread/ [35] FindSounds.com. Retrieved Apr. 2019, from http://www.findsounds.com/types.html.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/handle/123456789/723	-
dc.description.abstract	由於硬體方面的快速發展，運算資源相較於數十年前更加容易取得。近年來，壓縮感測藉由運算速度的提升拓展了我們的視野，與限制於著名的夏農取樣定理(Shannon’s sampling theorem) 的傳統取樣方法有所不同。壓縮感測利用了訊號的稀疏性來達成突破傳統取樣率的限制，讓我們認為也可以利用其他方面的特性來達成同樣的效果。因此，我們嘗試使用在信號處理領域常見的時頻分析工具來做為突破點。眾所周知的是，一個訊號的最低取樣點數限制與時頻分析圖上的面積有著正相關，而這正是我們要用來設計壓縮演算法的關鍵概念。在這篇碩士論文中，我們將會運用時頻分析來實作對於壓縮聲音訊號的應用。不同於廣泛出現在生活中的MP3與M4A壓縮演算法，被捨棄的資料並非由人類的聽覺範圍決定，取而代之的是時頻分析圖上小於臨界值的點或是面積較小的區塊。時頻分析的結果會被分為各個不同的區塊，作為初步的切割結果。接著，我們將切割完的時頻分析做時頻重配(time-frequency reassignment)，運用提出的預切法(pre-cut scheme)、間隙連接法(gap connection scheme)、頭尾法(head and tail scheme)以及頻寬估計(fixed bandwidth estimation)，因而得到更進一步的信號成分分割結果。我們的下一步為近似信號成分的分割結果。對於每個信號成分，我們使用一般化調變(generalized modulation)來進行降頻並且降低單一成分的最大頻寬。接著，我們使用兩種方法來對調變過後的信號成分近似並壓縮，分別為降採樣法(the downsampling method)及勒壤得多項式法(the Legendre polynomial method)。降採樣法由於信號成分較小的頻寬，可以有效降低所需要的採樣點數，進而達到壓縮的效果。勒壤得多項式法則是經由勒壤得多項式來尋找信號成分的稀疏表達方式，轉換成較少係數的結果。壓縮過後的資料與還原資料所需要的參數共同被編碼為一個封包，得到最後的壓縮結果。封包結果容易解碼且只需逆向操作即可進行還原重建。我們所提出的演算法，藉由在時頻分析上切割信號，以降低時頻分析圖上多餘的空白處，因而減少需要儲存的壓縮信號。雖然運算的時間相對較長，但在部分信號相較於常見的壓縮格式，可以同時擁有較高壓縮率以及較低重建誤差率的明顯較佳結果。	zh_TW
dc.description.abstract	Due to the fast developments in hardware, the computation resources are available more easily than decades ago. In recent years, compressive sensing broadens our horizons by the promotion of the computation speed, which is different from the conventional sampling approaches limited to the celebrated Shannon’s theorem. The sparsity properties of signals are utilized by compressive sensing to break thorough the limitation of the traditional sampling rate, which makes us consider that the identical effect can be achieved by the characteristics in other aspects. As a result, we manage to take advantage of the time-frequency analysis tool commonly used in the field of the signal processing as a breakthrough point. It is known that the lower bound of the number of sampling points is positively associated with the area of the time-frequency analysis, which is exactly the key concept of designing our algorithm to compress the target signal. In this master thesis, we use the time-frequency analysis to implement the application of the vocal signal compression. Different from the widespread MP3 and M4A compression algorithms in life, the data discarded is determined by the pixels below the threshold or the blocks with small area instead of the human hearing capability. The consequence of the time-frequency analysis is divided into several blocks as the primary segmentation result. Then, we execute the time-frequency reassignment to the segmentation result with proposed schemes, such as the pre-cut scheme, the gap connection scheme, the head and tail scheme, and the fixed bandwidth estimation, to obtain the further signal components segmentation result. Our next step is to approximate the segmentation result of the signal components. For each component, we utilized the generalized modulation to lower the frequency and decrease the maximum bandwidth of single component. Then, we adopt two methods to approximate and compress the modulated signal components, which are the downsampling method and the Legendre polynomial method. The downsampling method can effectively decrease the number of sampling points to compress the data due to the smaller bandwidths of the signal component, while the Legendre polynomial method manages to find the sparse representations of the signal components by the Legendre polynomials and transforms the signal into less coefficients. The compressed data and the parameters needed for recovering the data are encoded into a package, which is the final compression result. The packages are easily decoded and able to be reconstructed with only reverse operation. Our proposed algorithm divides the target signal with the time-frequency analysis to reduce the redundant space on the figure and hence decreases the compressed signal for storage. In spite of relatively large computation time, the better result of higher compression ratio and lower reconstruction error holds in the meanwhile in some cases, compared to common compression formats.	en
dc.description.provenance	Made available in DSpace on 2021-05-11T05:00:07Z (GMT). No. of bitstreams: 1 ntu-108-R05942069-1.pdf: 3821209 bytes, checksum: 8045d79a1694c645f02fa5c702da33d7 (MD5) Previous issue date: 2019	en
dc.description.tableofcontents	誌謝 i 中文摘要 ii ABSTRACT iv CONTENTS vii LIST OF FIGURES xi LIST OF TABLES xiv Chapter 1 Introduction 1 1.1 Motivation 1 1.2 Primary Contributions 2 Chapter 2 Related Work 4 2.1 Compressive Sensing 4 2.1.1 The sensing problem 5 2.1.2 Sparsity 5 2.1.3 Incoherence 7 2.1.4 Sparse signal recovery 8 2.1.5 Robustness and Restricted Isometry Property (RIP) 9 2.2 Matching Pursuit and Basis Pursuit 11 2.2.1 Matching pursuit (MP) 12 2.2.2 Orthogonal matching pursuit (OMP) 14 2.2.3 Basis pursuit (BP) 16 2.2.4 Basis pursuit denoising (BPDN) 18 2.3 Other Expansion Methods 19 2.3.1 Method of frames (MOF) 19 2.3.2 Best orthogonal basis (BOB) 20 2.3.3 Total variation denoising (TVDN) 21 2.3.4 Comparison examples 23 2.4 Basis Selection 26 2.4.1 Gabor atomic dictionary 26 2.4.2 Chirplet atomic dictionary 26 2.4.3 Advanced chirplet atomic dictionary 27 2.4.4 Sinusoidal chirplet atomic dictionary 27 2.4.5 FMmlet atomic dictionary 28 2.4.6 Wavelet atomic dictionary 29 2.4.7 Dictionary mergers 30 2.5 Summary 31 Chapter 3 Proposed Work 32 3.1 Time-Frequency Analysis 32 3.1.1 Gabor transform 32 3.1.2 Wigner distribution function 34 3.1.3 Gabor-Wigner transform 35 3.1.4 Segmentation 37 3.2 Time-Frequency Reassignment 39 3.2.1 Pre-cut scheme 39 3.2.2 Local maximums and local minimums 40 3.2.3 Gap connection scheme 42 3.2.4 Head and tail scheme 45 3.2.5 Fixed bandwidth estimation 47 3.3 Signal Component Approximation 48 3.3.1 Generalized modulation 49 3.3.2 Downsampling 53 3.3.3 Legendre polynomial basis 55 3.3.4 Encoding 56 3.4 Signal Reconstruction Scheme 58 3.4.1 Decoding 59 3.4.2 Downsampling 60 3.4.3 Legendre polynomial basis 61 3.5 Summary 63 Chapter 4 Simulation Result 64 4.1 Performance 64 4.1.1 Animal signals dataset 65 4.1.2 People dataset 69 4.1.3 Vehicles dataset 74 4.2 Computation time 77 Chapter 5 Discussion 79 Chapter 6 Conclusion and Future Work 82 REFERENCE 84
dc.language.iso	en
dc.subject	時頻分析	zh_TW
dc.subject	壓縮感測	zh_TW
dc.subject	勒壤得多項式法	zh_TW
dc.subject	時頻重配	zh_TW
dc.subject	一般化調變	zh_TW
dc.subject	降採樣法	zh_TW
dc.subject	compressive sensing	en
dc.subject	Legendre polynomial method	en
dc.subject	downsampling method	en
dc.subject	generalized modulation	en
dc.subject	time-frequency reassignment	en
dc.subject	time-frequency analysis	en
dc.title	壓縮感測的時頻分析方法	zh_TW
dc.title	Time-Frequency Methods for Compressive Sensing	en
dc.date.schoolyear	107-2
dc.description.degree	碩士
dc.contributor.oralexamcommittee	丁建均,蘇柏青,簡鳳村
dc.subject.keyword	壓縮感測,時頻分析,時頻重配,一般化調變,降採樣法,勒壤得多項式法,	zh_TW
dc.subject.keyword	compressive sensing,time-frequency analysis,time-frequency reassignment,generalized modulation,downsampling method,Legendre polynomial method,	en
dc.relation.page	88
dc.identifier.doi	10.6342/NTU201902199
dc.rights.note	同意授權(全球公開)
dc.date.accepted	2019-07-31
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	電信工程學研究所	zh_TW
顯示於系所單位：	電信工程學研究所

文件中的檔案：

檔案	大小	格式
ntu-108-1.pdf	3.73 MB	Adobe PDF	檢視/開啟

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。