請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/55294
完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.advisor | 連豊力(Feng-Li Lian) | |
dc.contributor.author | Bo-Yu Lin | en |
dc.contributor.author | 林柏宇 | zh_TW |
dc.date.accessioned | 2021-06-16T03:55:16Z | - |
dc.date.issued | 2021 | |
dc.date.submitted | 2021-02-05 | |
dc.identifier.citation | [1: Chalmers et al. 2017] Peter N. Chalmers, Markus A. Wimmer, Nikhil N. Verma, Brian J. Cole and Anthony A. Romeo, “The Relationship Between Pitching Mechanics and Injury: A Review of Current Concepts,” Sports Health, Vol. 9, No. 3, pp. 216-221, 2017. [2: Posner et al. 2011] Matthew Posner, Kenneth L. Cameron, Jennifer Moriatis Wolf, Philip J. Belmont, Jr and Brett D. Owens, “Epidemiology of Major League Baseball Injuries,” The American Journal of Sports Medicine, Vol. 39, No. 8, pp. 1675-1691, June 2011. [3: Hu et al. 2019] Hu Jian-Fang, Wang Xiong-Hui, Zheng Wei-Shi and Lai Jian-Huang, “RGB-D Action Recognition: Recent Advances and Future Perspectives,” Acta Automatica Sinica, Vol. 45, No. 5, pp. 829-840, 2019. [4: Saini et al. 2014] Sanjay Saini, Dayang Rohaya Bt Awang Rambli, M. Nordin B. Zakaria and Suziah Bt Sulaiman, “A Review on Particle Swarm Optimization Algorithm and Its Variants to Human Motion Tracking,” Mathematical Problems in Engineering, Vol. 2014, pp. 1-16, Nov. 2014. [5: Kennedy Eberhart 1995] James Kennedy and Russell Eberhart, “Particle Swarm Optimization,” in the Proceedings of International Conference on Neural Networks, Perth, WA, Australia, Vol. 4, pp. 1942-1948, Nov. 27-Dec. 1, 1995. [6: Liu et al. 2017] Zhenbao Liu, Jinxin Huang, Junwei Han, Shuhui Bu and Jianfeng Lv, “Human Motion Tracking by Multiple RGBD Cameras,” IEEE Transactions on Circuits and Systems for Video Technology, Vol. 27, No. 9, pp. 2014-2027, Sept. 2017. [7: Sharifi et al. 2017] Ashraf Sharifi, Ahad Harati and Abedin Vahedian, “Marker-based human pose tracking using adaptive annealed particle swarm optimization with search space partitioning,” Image and Vision Computing, Vol. 62, pp. 28-38, 2017. [8: Liu et al. 2015] Zhao Liu, Jian Zhu, Jiajun Bu and Chun Chen, “A survey of human pose estimation: The body parts parsing based methods,” Journal of Visiual Communication and Image Representation, Vol. 32, pp. 10-19, 2015. [9: Dalal Triggs 2005] Navneet Dalal and Bill Triggs, “Histograms of Oriented Gradients for Human Detection,” in the Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, Vol. 2, 2005. [10: Sapp et al. 2010] Benjamin Sapp, Alexander Toshev and Ben Taskar, “Cascaded Models for Articulated Pose Estimation,” in the Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Heraklion, Greece, pp. 406-420, 2010. [11: Toshev Szegedy 2014] Alexander Toshev and Christian Sze gedy, “DeepPose: Human Pose Estimation via Deep Neural Networks,” in the Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, pp. 1653-1660, 2014. [12: Chen Yuille 2014] Xianjie Chen and Alan Yuille, “Articulated Pose Estimation by a Graphical Model with Image Dependent Pairwise Relations,” in the Proceedings of the Advances in Neural Information Processing Systems, pp. 1736-1744, 2014. [13: Tompson et al. 2014] Jonathan Tompson, Arjun Jain, Yann LeCun and Christoph Bregler, “Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation,” in the Proceedings of the Advances in Neural Information Processing Systems, pp. 1799-1807, 2014. [14: Fang et al. 2017] Hao-Shu Fang, Shuqin Xie, Yu-Wing Tai and Cewu Lu, “RMPE: Regional Multi-person Pose Estimation,” in the Proceedings of IEEE International Conference on Computer Vision, Venice, Italy, pp. 2353-2362, Oct. 22-29, 2017. [15: Bourdev Malik 2009] Lubomir Bourdev and Jitendra Malik, “Poselets: Body part detectors trained using 3D human pose annotations,” in the Proceedings of IEEE International Conference on Computer Vision, Kyoto, Japan, pp. 1365-1372, Sept. 29-Oct. 2, 2009. [16: Wang Li 2013] Fang Wang and Yi Li, “Beyond Physical Connections: Tree Models in Human Pose Estimation,” in the Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, pp. 596-603, June 23-28, 2013. [17: Srinivasan Shi 2007] Praveen Srinivasan and Jianbo Shi, “Bottom-up Recognition and Parsing of the Human Body,” in the Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Minneapolis, MN, USA, pp. 1-8, June 17-22, 2007. [18: Dantone et al. 2013] Matthias Dantone, Juergen Gall, Christian Leistner and Luc Van Gool, “Human Pose Estimation using Body Parts Dependent Joint Regressors,” in the Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, pp. 3041-3048, June 23-28, 2013. [19: Ramakrishna et al. 2014] Varun Ramakrishna, Daniel Munoz, Martial Hebert, James Andrew Bagnell and Yaser Sheikh, “Pose Machines: Articulated Pose Estimation via Inference Machines,” in the Proceedings of European Conference on Computer Vision, 2014. [20: Tran Forsyth 2010] Duan Tran and David Forsyth, “Improved human parsing with a full relational model,” in the Proceedings of European Conference on Computer Vision, pp. 227-240, 2010. [21: Fischler Elschlager 1973] Martin A. Fischler and Robert A. Elschlager, “The Representation and Matching of Pictorial Structures,” IEEE Transactions on Computers, Vol. C-22, No. 1, pp. 67-92, Jan. 1973. [22: Felzenszwalb Huttenlocher 2005] Pedro F. Felzenszwalb and Daniel P. Huttenlocher, “Pictorial Structures for Object Recognition,” International Journal of Computer Vision, Vol. 61, No. 1, pp. 55-79, 2005. [23: Yang Ramanan 2011] Yi Yang and Deva Ramanan, “Articulated pose estimation with flexible mixtures-of-parts,” in the Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, pp. 1385-1392, 2011. [24: Kiefel Gehler 2014] Martin Kiefel and Peter V. Gehler, “Human Pose Estimation with Fields of Parts,” in the Proceedings of European Conference on Computer Vision, Zurich, Switzerland, Vol. 8693, pp. 331-346, Sept. 6-12, 2014. [25: Sarafianos et al. 2016] Nilolaos Sarafianos, Bogdan Boteanu, Bogdan Ionescu and Ioannis A. Kakadiaris, “3D Human pose estimation: A review of the literature and analysis of covariates,” Computer Vision and Image Understanding, Vol. 152, pp. 1-20, Nov. 2016. [26: Daubney et al. 2012] Ben Daubney, David Gibson and Neill Campbell, “Estimating pose of articulated objects using low-level motion,” Computer Vision and Image Understanding, Vol. 116, No. 3, pp. 330-346, March 2012. [27: Ning et al. 2008] Huazhong Ning, Wei Xu, Yihong Gong and Thomas Huang, “Discriminative Learning of Visual Words for 3D Human Pose Estimation,” in the Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK, USA, pp. 1-8, 2008. [28: Burenius et al. 2013] Magnus Burenius, Josephine Sullivan and Stefan Carlsson, “3D Pictorial Structures for Multiple View Articulated Pose Estimation,” in the Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, pp. 3618-3625, 2013. [29: Belagiannis et al. 2014] Vasileios Belagiannis, Sikandar Amin, Mykhaylo Andriluka, Bernt Schiele, Nassir Navab and Slobodan Ilic, “3D Pictorial Structures for Multiple Human Pose Estimation,” in the Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, pp. 1669-1676, 2014. [30: Zuffi et al. 2012] Silvia Zuffi, Oren Freifeld and Michael J. Black, “From Pictorial Structures to Deformable Structures,” in the Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, pp. 3546-3553, 2012. [31: Huang Yang 2009] Jia-Bin Huang and Ming-Hsuan Yang, “Estimating Human Pose from Occluded Images,” in the Proceedings of Asian Conference on Computer Vision, Xian, China, Vol. 5994, pp. 48-60, 2009. [32: Sedai et al. 2010] Suman Sedai, Mohammed Bennamoun and Du Huynh, “Localized fusion of Shape and Appearance features for 3D Human Pose Estimation,” in the Proceedings of British Machine Vision Conference, Aberystwyth, UK, pp. 51.1-51.10, 2010. [33: Grauman et al. 2003] Kristen Grauman, Gregory Shakhnarovich and Trevor Darrell, “Inferring 3D structure with a statistical image-based shape model,” in the Proceedings of IEEE International Conference on Computer Vision, Nice, France, Vol. 1, pp. 641-647, 2003. [34: Sedai et al. 2013] Suman Sedai, Mohammed Bennamoun and Du Q. Huynh, “A Gaussian Process Guided Particle Filter for Tracking 3D Human Pose in Video,” IEEE Transactions on Image Processing, Vol. 22, No. 11, pp. 4286-4300, Nov. 2013. [35: Rosales Sclaroff 2006] RÓMer Rosales and Stan Sclaroff, “Combining Generative and Discriminative Models in a Framework for Articulated Pose Estimation,” International Journal of Computer Vision, Vol. 67, No. 3, pp. 251-276, 2006. [36: Salzmann Urtasun 2010] Mathieu Salzmann and Raquel Urtasun, “Combining discriminative and generative methods for 3D deformable surface and articulated pose reconstruction,” in the Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA, pp. 647-654, 2010. [37: Redmon Farhadi 2018] Joseph Redmon and Ali Farhadi, “YOLOv3: An Incremental Improvement,” in arXiv:1804.02767v1, April 2018. [38: Deutscher Reid 2005] Jonathan Deutscher and Ian Reid, “Articulated Body Motion Capture by Stochastic Search,” International Journal of Computer Vision, Vol. 61, pp. 185-205, 2005. [39: Laganière 2011] Robert Laganière, “Estimating Projective Relations in Images,” in OpenCV 2 computer vision application programming cookbook: Over 50 recipes to master this library of programming functions for real-time computer vision. Packt Pub, Birmingham, UK, 2011. [40: Longuet-Higgins 1981] H. Christopher Longuet-Higgins, “A computer algorithm for reconstructing a scene from two projections,” Nature, Vol. 5828, No. 291, pp. 133-135, 1981. [41: Lehment et al. 2010] Nicolas H. Lehment, Dejan Arsić, Moritz Kaiser and Gerhard Rigoll, “Automated pose estimation in 3D point clouds applying annealing particle filters and inverse kinematics on a GPU,” in the Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops, San Francisco, CA, USA, pp. 87-92, 2010. [42: Lu 2020] Yen-Ying Lu, “Implications of the 2D human poses detector to reconstruction of 3D human pose in multi-camera system,” Master thesis, National Taiwan University, College of Electrical Engineering and Computer Science, Graduate Institute of Photonics and Optoelectronics, 2020. [43: Sigal Black 2006] Lenoid Sigal and Michael J. Black, “HumanEva: Synchronized video and motion capture dataset for evaluation of articulated human motion,” Technical Report CS-06-08, Brown University, 2006. Honeywell Research. [44: 中央社記者蘇思云 2020] 中央社記者蘇思云. 影像辨識3D重建運動軌跡 掌握投球打擊姿勢. [Online]. Available: https://www.cna.com.tw/news/ait/202010090048.aspx [45: Sawe 2018] Benjamin Elisha Sawe. (2018, Apr.). The Most Popular Sports in the World. WorldAtlas. [Online]. Available: https://www.worldatlas.com/articles/what-are-the-most-popular-sports-in-the-world.html [46: Vicon, Inc.] Vicon, Inc. Vicon Motion Systems. [Online]. Available: https://www.vicon.com/ [47: Machine Vision and Intelligence Group 2019] Machine Vision and Intelligence Group. AlphaPose. [Online]. Available: https://www.mvig.org/research/alphapose.html [48: Fang et al. 2019] Hao-Shu Fang, Jiefeng Li, Yuliang Xiu, Ruiheng Chang and Cewu Lu (2019, Dec.). AlphaPose. GitHub repository. [Online]. Available: https://github.com/MVIG-SJTU/AlphaPose.git [49: The Imaging Source, Inc.] The Imaging Source, Inc. DMK 33GX287. [Online]. Available: https://www.theimagingsource.tw/%E7%94%A2%E5%93%81/%E5%B7%A5%E6%A5%AD%E7%9B%B8%E6%A9%9F/gige-%E9%BB%91%E7%99%BD/dmk33gx287/ [50: The Imaging Source, Inc.] The Imaging Source, Inc. DMK 33UX287. [Online]. Available: https://www.theimagingsource.tw/%E7%94%A2%E5%93%81/%E5%B7%A5%E6%A5%AD%E7%9B%B8%E6%A9%9F/usb-3.0-%E9%BB%91%E7%99%BD/dmk33ux287/ [51: The Imaging Source, Inc.] The Imaging Source, Inc. The Imaging Source. [Online]. Available: https://www.theimagingsource.com/ | |
dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/55294 | - |
dc.description.abstract | 利用多台攝影機實現運動員的三維動作追蹤一直是運動分析裡一個重要的部分。儘管市面上已經存在許多商業化的動作追蹤系統,但大部分不是費用高昂就是系統的建置十分複雜,這也導致了大部分的商業化動作追蹤系統都應用在實驗室的場景。相反的,大部分的棒球場都已經配備有多台的高速攝影機,如果能利用這些高速攝影機拍攝的影片來做三維動作追蹤與後續的運動分析,那對於運動產業來說會是一個很大的幫助。 本篇論文的目的,是利用從棒球場的高速攝影機拍攝的影片來做棒球投手的動作追蹤。首先,利用YOLO與AlphaPose做二維姿態的估測並得到二維姿態的資料。接著,利用兩兩相機之間的對極幾何關係重建出一個初步的三維骨架。然而這個初步的三維骨架會有許多錯誤的姿態與誤差。為了處理這些錯誤姿態與誤差,本篇論文設計了一系列前處理的步驟。 首先,三維資料會先經過一次過濾以去除一些明顯的誤差。接著,對有缺失的資料進行補值。最後,將整筆資料進行平滑化以便去除初步骨架中的微小抖動問題,此時便可以得到一個初步的結果。然而,平滑化的過程中也會導致在投球瞬間附近的資料被平滑掉,導致這些高速瞬間的預測值失準。因此,本篇論文利用粒子群最佳化演算法將這些被平滑掉的資料調整回到最合理的值。粒子群最佳化演算法非常適合應用在需要考慮多個觀測值的情境,因為粒子群最佳化演算法可以同時計算它的預測值對應到多個觀測值的誤差,並根據這個誤差再去做調整。然而,因為這個特性,粒子群最佳化演算法也可能會過度擬合到資料中的微小誤差。為了避免這個情形,在做粒子群最佳化之前,本篇論文設計了一系列前提來篩選掉不需要做最佳化的資料,並找到需要執行最佳化的投球瞬間。而為了解決遮蔽導致的姿態錯誤,本文設計了一個學習投手運動模型的機制。 最後,本篇論文利用從棒球場實際收到的數據來做實驗與分析,並對於本篇論文提出的方法進行測試與分析。 | zh_TW |
dc.description.abstract | 3D human motion tracking for athletes based on data captured by multiple cameras is an important part of sport analysis. Although there are some commercial human motion tracking systems, most of them are either high-cost or require complex setup, which makes such systems usually be used only in laboratory scenarios. On the contrary, almost every baseball field has high-speed cameras, it would be helpful if the data captured by these high-speed cameras can be utilized to do 3D human motion tracking. In the proposed system, the data captured by high-speed cameras at baseball field are utilized to track the 3D motion of baseball pitcher. For 2D pose estimation, the proposed system utilizes YOLO and AlphaPose to generate the 2D data from videos captured by high-speed cameras. After obtaining the 2D data, epipolar geometry is adopted to generate the initial 3D skeletons. However, there may be false detections and noise in the initial 3D skeletons. To deal with the false detections and noise, 3D data preprocessing is introduced in the proposed system. For 3D data preprocessing, a filtering process is adopted first to filter out obvious false detections. After that, the missing data are padded by interpolation. Finally, a smoothing process is adopted to remove noise. Since the smoothing process may also suppress the data around the pitching moment, particle swarm optimization (PSO) is adopted to deal with the error caused by the smoothing process. PSO is a useful tool to compensate the error, because it can adjust the value according to 2D data. To avoid overfitting, a series of conditions are set to filter out the data which has no need for PSO, and find the data around the pitching moment accurately. Furthermore, to deal with more serious false detections that cannot be filtered out, a motion model is designed and utilized. Experiments and analysis are provided to verify the proposed 3D human motion tracking system. | en |
dc.description.provenance | Made available in DSpace on 2021-06-16T03:55:16Z (GMT). No. of bitstreams: 1 U0001-0402202121213700.pdf: 14612326 bytes, checksum: dc76950fcac1c309038ff257d7e9ba10 (MD5) Previous issue date: 2021 | en |
dc.description.tableofcontents | 摘要 i ABSTRACT iii CONTENTS v LIST OF FIGURES vii LIST OF TABLES xii Chapter 1 Introduction 1 1.1 Motivation 1 1.2 Problem Formulation 3 1.2.1 2D Human Pose Estimation 4 1.2.2 3D Human Motion Tracking 4 1.3 Contributions 5 1.4 Organization of the Thesis 6 Chapter 2 Background and Literature Survey 7 2.1 Human Pose Estimation (HPE) 7 2.2 2D Single Person Parsing 9 2.3 Human Body Model 10 2.4 Human Motion Tracking 11 2.5 Summary 14 Chapter 3 Related Algorithms 16 3.1 YOLO 16 3.2 AlphaPose 18 3.3 Pinhole Camera Model 19 3.4 Epipolar Geometry 21 3.5 Particle Swarm Optimization (PSO) 23 Chapter 4 Background and Fundamental Information 26 4.1 Pitcher Motion Analysis 26 4.2 Marker Tracking with Polynomial Curve Fitting 33 4.3 Convergence Test for PSO 43 Chapter 5 Human Motion Tracking 46 5.1 System Architecture 49 5.1.1 2D Data Acquisition 49 5.1.2 3D Human Motion Tracking 50 5.2 Human Pose Estimation 53 5.2.1 Human Detection from 2D Image 54 5.2.2 2D Pose Estimation 57 5.3 3D Reconstruction based on Epipolar Geometry 59 5.4 Propagation Model 63 5.5 Loss Value Evaluation 65 5.6 Optimization Method 67 5.6.1 Initialization Step of PSO 69 5.6.2 Iterative Steps of PSO 70 5.7 Motion Model Formulation 72 Chapter 6 Experimental Results and Analysis 75 6.1 Experimental Setup 75 6.1.1 Setup for Data Acquisition 76 6.1.2 Platform for 3D Reconstruction 79 6.2 2D Pose Data Acquisition 80 6.2.1 Data Collection and Processing 81 6.2.2 Analysis of 2D pose estimation 84 6.3 3D Human Motion Tracking Test 86 6.3.1 3D Reconstruction Result 86 6.3.2 3D Data Preprocessing 89 6.3.3 PSO and Postprocessing 98 6.3.4 Analysis and Summary 103 6.4 Experiments and Analysis 110 6.4.1 Experiments of Pitcher 11 114 6.4.2 Experiments of Pitcher 37 141 6.4.3 Experiments of Pitcher 51 163 6.5 Summary and Discussions 187 6.5.1 2D Data Acquisition 187 6.5.2 Preprocessing System 190 6.5.3 Particle Swarm Optimization 193 6.5.4 Motion Model 196 Chapter 7 Conclusions and Future Works 201 7.1 Conclusions 201 7.2 Future Works 202 References 204 | |
dc.language.iso | en | |
dc.title | 基於粒子群最佳化演算法之無標記棒球投手三維動作追蹤系統 | zh_TW |
dc.title | A Markerless 3D Human Motion Tracking System for Baseball Pitcher Based on Particle Swarm Optimization | en |
dc.type | Thesis | |
dc.date.schoolyear | 109-2 | |
dc.description.degree | 碩士 | |
dc.contributor.oralexamcommittee | 李後燦(Hou-Tsan Lee),黃正民(Cheng-Ming Huang),許志明(Chih-Ming Hsu) | |
dc.subject.keyword | 三維動作追蹤,姿態估測,三維姿態重建,粒子群最佳化演算法,運動模型,運動分析, | zh_TW |
dc.subject.keyword | Human motion tracking,pose estimation,3D reconstruction,Particle swarm optimization,motion model,sport analysis, | en |
dc.relation.page | 209 | |
dc.identifier.doi | 10.6342/NTU202100539 | |
dc.rights.note | 有償授權 | |
dc.date.accepted | 2021-02-06 | |
dc.contributor.author-college | 電機資訊學院 | zh_TW |
dc.contributor.author-dept | 電機工程學研究所 | zh_TW |
顯示於系所單位: | 電機工程學系 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
U0001-0402202121213700.pdf 目前未授權公開取用 | 14.27 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。