利用分層俯視角深度特徵應用於日常活動辨識

Shu-Chun Lin; 林叔君

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/52452

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	傅立成(Li-Chen Fu)
dc.contributor.author	Shu-Chun Lin	en
dc.contributor.author	林叔君	zh_TW
dc.date.accessioned	2021-06-15T16:15:12Z	-
dc.date.available	2018-08-25
dc.date.copyright	2015-08-25
dc.date.issued	2015
dc.date.submitted	2015-08-17
dc.identifier.citation	[1] C. Liming, J. Hoey, C. D. Nugent, D. J. Cook, and Y. Zhiwen, 'Sensor-Based Activity Recognition,' IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, vol. 42, pp. 790-808, 2012. [2] L. Shaopeng, R. X. Gao, D. John, J. W. Staudenmayer, and P. S. Freedson, 'Multisensor Data Fusion for Physical Activity Assessment,' IEEE Transactions on Biomedical Engineering, vol. 59, pp. 687-696, 2012. [3] M. Ermes, J. Parkka, J. Mantyjarvi, and I. Korhonen, 'Detection of Daily Activities and Sports With Wearable Sensors in Controlled and Uncontrolled Conditions,' IEEE Transactions on Information Technology in Biomedicine, vol. 12, pp. 20-26, 2008. [4] J. K. Aggarwal and M. S. Ryoo, 'Human activity analysis: A review,' ACM Comput. Surv., vol. 43, pp. 1-43, 2011. [5] J. K. Aggarwal and L. Xia, 'Human activity recognition from 3D data: A review,' Pattern Recognition Letters, vol. 48, pp. 70-80, 2014. [6] X. Lu and J. K. Aggarwal, 'Spatio-temporal Depth Cuboid Similarity Feature for Activity Recognition Using Depth Camera,' in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on 2013, pp. 2834-2841. [7] W. Jiang, L. Zicheng, W. Ying, and Y. Junsong, 'Mining actionlet ensemble for action recognition with depth cameras,' in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012, pp. 1290-1297. [8] X. Lu, C. Chia-Chih, and J. K. Aggarwal, 'View invariant human action recognition using histograms of 3D joints,' in IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2012, pp. 20-27. [9] A. Vieira, E. Nascimento, G. Oliveira, Z. Liu, and M. M. Campos, 'STOP: Space-Time Occupancy Patterns for 3D Action Recognition from Depth Map Sequences,' in Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. vol. 7441, ed: Springer Berlin Heidelberg, 2012, pp. 252-259. [10] X. Yang, C. Zhang, and Y. Tian, 'Recognizing actions using depth motion maps-based histograms of oriented gradients,' presented at the Proceedings of the 20th ACM international conference on Multimedia, Nara, Japan, 2012. [11] C. Chen, R. Jafari, and N. Kehtarnavaz, 'Action Recognition from Depth Sequences Using Depth Motion Maps-Based Local Binary Patterns,' in IEEE Winter Conference on Applications of Computer Vision (WACV), 2015, pp. 1092-1099. [12] O. Oreifej and L. Zicheng, 'HON4D: Histogram of Oriented 4D Normals for Activity Recognition from Depth Sequences,' in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013, pp. 716-723. [13] L. Wanqing, Z. Zhengyou, and L. Zicheng, 'Action recognition based on a bag of 3D points,' in IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2010, pp. 9-14. [14] H. Ninghang, G. Englebienne, and B. Krose, 'Posture recognition with a top-view camera,' in IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2013, pp. 2152-2157. [15] S. Weerachai and M. Mizukawa, 'Human behavior recognition via top-view vision for intelligent space,' in International Conference on Control Automation and Systems (ICCAS), 2010, pp. 1687-1690. [16] Q. Lin, C. Zhou, S. Wang, and X. Xu, 'Human Behavior Understanding via Top-View Vision,' AASRI Procedia, vol. 3, pp. 184-190, 2012. [17] T.-E. Tseng, A.-S. Liu, P.-H. Hsiao, C.-M. Huang, and L.-C. Fu, 'Real-time people detection and tracking for indoor surveillance using multiple top-view depth cameras,' in IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2014, pp. 4077-4082. [18] D. Brscic, T. Kanda, T. Ikeda, and T. Miyashita, 'Person Tracking in Large Public Spaces Using 3-D Range Sensors,' Human-Machine Systems, IEEE Transactions on, vol. 43, pp. 522-534, 2013. [19] M. Rauter, 'Reliable Human Detection and Tracking in Top-View Depth Images,' in IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2013, pp. 529-534. [20] S. Ikemura and H. Fujiyoshi, 'Human detection by Haar-like filtering using depth information,' in International Conference on Pattern Recognition (ICPR), 2012, pp. 813-816. [21] D. Kouno, K. Shimada, and T. Endo, 'Person Identification Using Top-View Image with Depth Information,' in 13th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel & Distributed Computing (SNPD), 2012, pp. 140-145. [22] G. Hu, D. Reilly, B. Swinden, and Q. Gao, 'Human Activity Analysis in a 3D Bird’s-eye View,' in Image Analysis and Recognition. vol. 8815, A. Campilho and M. Kamel, Eds., ed: Springer International Publishing, 2014, pp. 365-373. [23] G. Hu, D. Reilly, M. Alnusayri, B. Swinden, and Q. Gao, 'DT-DT: Top-down Human Activity Analysis for Interactive Surface Applications,' in Proceedings of the Ninth ACM International Conference on Interactive Tabletops and Surfaces, Dresden, Germany, 2014, pp. 167-176. [24] J. Shotton, A. Fitzgibbon, M. Cook, T. Sharp, M. Finocchio, R. Moore, et al., 'Real-time human pose recognition in parts from single depth images,' in IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2011, pp. 1297-1304. [25] P. Senin, 'Dynamic time warping algorithm review,' University of Hawaii, 2008. [26] S. Sempena, N. U. Maulidevi, and P. R. Aryan, 'Human action recognition using Dynamic Time Warping,' in International Conference on Electrical Engineering and Informatics (ICEEI), 2011, pp. 1-5. [27] Q. K. L. Chinh Huu Pham, Thanh Ha Le, 'Human Action Recognition Using Dynamic Time Warping and Voting Algorithm,' VNU Journal of Science, vol. 30, pp. 22-30, 2014. [28] N. Gillian, R. Benjamin Knapp, and Sile O’Modhrain, 'Recognition of multivariate temporal musical gestures using n-dimensional dynamic time warping,' in Proceedings of the International Coference on New Interfaces for Musical Expression (NIME), Oslo, Norway, 2011. [29] B. E. Boser, I. M. Guyon, and V. N. Vapnik, 'A training algorithm for optimal margin classifiers,' in Proceedings of the fifth annual workshop on Computational learning theory, Pittsburgh, Pennsylvania, USA, 1992, pp. 144-152. [30] C. Cortes and V. Vapnik, 'Support-vector networks,' Machine Learning, vol. 20, pp. 273-297, 1995. [31] V. Reilly, H. Idrees, and M. Shah, 'Detection and Tracking of Large Number of Targets in Wide Area Surveillance,' in Computer Vision – ECCV 2010. vol. 6313, K. Daniilidis, P. Maragos, and N. Paragios, Eds., ed: Springer Berlin Heidelberg, 2010, pp. 186-199. [32] Human head. Available: https://en.wikipedia.org/wiki/Human_head [33] Body proportion. Available: https://en.wikipedia.org/wiki/Body_proportions [34] H. Ming-Kuei, 'Visual pattern recognition by moment invariants,' Information Theory, IRE Transactions on, vol. 8, pp. 179-187, 1962. [35] P. Felzenszwalb and D. Huttenlocher, 'Efficient Graph-Based Image Segmentation,' International Journal of Computer Vision, vol. 59, pp. 167-181, 2004. [36] C. Collin, D. T. Wade, S. Davies, and V. Horne, 'The Barthel ADL Index: A reliability study,' Disability and Rehabilitation, vol. 10, pp. 61-63, 1988. [37] A. B. Klaus Greff, Stephan Kraus, Didier Stricker, Esteban Clua, 'A Comparison between Background Subtraction Algorithms using a Consumer Depth Camera,' International Conference on Computer Vision Theory and Applications, 2012.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/52452	-
dc.description.abstract	In this thesis, two novel features for activity recognition from top-view depth image sequences are firstly proposed. Most of previous works are focusing mainly on dealing with the side-view depth image sequences, which unfortunately may encounter occlusion problems. Therefore, top-view camera setting is adopted in our thesis. Based on the notion of computed tomography, the top-view depth images are segmented to different layer along z-axis. Then, the representative body points which are found on each layered image will be a meaningful feature as the substitute of body parts for the activity postures. Besides, a discriminative shape descriptor is also proposed to describe the human shape for different activity postures. Based on the occupancy value of small region, the cylinders-sector occupancy grid with saturation function is proposed to capture special characteristic of top-view human shape. To make our proposed features invariant to orientation, the human orientation is also calculated by extracting the regions of head and shoulders, and then refines the above two features according to the orientation. Finally, dynamic time warping algorithm is applied to address the problem with different sequence lengths and the SVM classifier is trained to classify our activities. To verify our performance, 2 new top-view datasets are constructed. In our experiments, challenging cross-subject tests are conducted, and the effectiveness of our representative body points and layered sector-based shape descriptor are demonstrated. The result shows that the accuracy can achieve up to 96%, which is quite promising while being compared with those from the state-of-the-art methods in the literature.	en
dc.description.provenance	Made available in DSpace on 2021-06-15T16:15:12Z (GMT). No. of bitstreams: 1 ntu-104-R02921013-1.pdf: 2638961 bytes, checksum: 3b75c0b814cf7a226c521a6729facc57 (MD5) Previous issue date: 2015	en
dc.description.tableofcontents	口試委員會審定書 # 中文摘要 iii ABSTRACT iv CONTENTS vi LIST OF FIGURES ix LIST OF TABLES xiii Chapter 1 Introduction 1 1.1 Motivation 1 1.2 Problem Formulation 5 1.3 Literature Review 6 1.3.1 Applications based on top-view camera settings 6 1.3.2 Activity recognition using depth image sequences 7 1.4 Contribution 9 1.5 Thesis Organization 10 Chapter 2 Preliminaries and the System Configuration 12 2.1 Dynamic Time Warping (DTW) 13 2.1.1 Principle of DTW 13 2.1.2 Classification by DTW 16 2.1.3 Advantages and Disadvantages of classification by DTW 16 2.2 Support Vector Machine (SVM) 17 2.2.1 Linear SVM 18 2.2.2 General form of linear SVM 20 2.2.3 Soft Margin SVM 22 2.2.4 Nonlinear SVM 23 2.3 System Design and Preprocessing 24 2.3.1 Camera Environment Setting 25 2.3.2 World Coordinate Mapping 26 2.3.3 Background subtraction 27 Chapter 3 Methodology 30 3.1 Layered Representative Body points 31 3.2 Layered Sector-based Shape Descriptor 40 3.3 Human Orientation and Orientation Refinement 44 3.3.1 Human Orientation 45 3.3.2 Layered Representative Body Points with Orientation Refinement 50 3.3.3 Layered Sector-based Shape Descriptor with Orientation Refinement 51 3.4 Classification 52 Chapter 4 Experiment 55 4.1 Environmental Description 55 4.2 Datasets Description 56 4.2.1 Top-View 3D Daily Activity Dataset 57 4.2.2 Top-View 3D Daily Activity with Orientation Dataset 59 4.3 Action Recognition Results 61 4.3.1 Top-View 3D Daily Activity Dataset 61 4.3.2 Top-View 3D Daily Activity with Orientation Dataset 64 Chapter 5 Conclusion and Future Work 70 REFERENCE 72
dc.language.iso	en
dc.subject	活動辨識	zh_TW
dc.subject	深度	zh_TW
dc.subject	動態時間校正	zh_TW
dc.subject	俯視角	zh_TW
dc.subject	Top-view	en
dc.subject	activity recognition	en
dc.subject	depth	en
dc.subject	dynamic time warping	en
dc.title	利用分層俯視角深度特徵應用於日常活動辨識	zh_TW
dc.title	Daily Activity Recognition Using Features from Layered Top-View Depth Information	en
dc.type	Thesis
dc.date.schoolyear	103-2
dc.description.degree	碩士
dc.contributor.oralexamcommittee	陳永耀(Yung-Yao Chen),洪一平(Yi-Ping Hung),陳祝嵩(Chu-Song Chen),范欽雄(Chin-Shyurng Fahn)
dc.subject.keyword	活動辨識,俯視角,深度,動態時間校正,	zh_TW
dc.subject.keyword	Top-view,activity recognition,depth,dynamic time warping,	en
dc.relation.page	75
dc.rights.note	有償授權
dc.date.accepted	2015-08-18
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	電機工程學研究所	zh_TW
顯示於系所單位：	電機工程學系

文件中的檔案：

檔案	大小	格式
ntu-104-1.pdf 未授權公開取用	2.58 MB	Adobe PDF

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。