攝影機網路之目標物追蹤與視覺化顯示

Kuan-Wen Chen; 陳冠文

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/39484

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	洪一平(Yi-Ping Hung)
dc.contributor.author	Kuan-Wen Chen	en
dc.contributor.author	陳冠文	zh_TW
dc.date.accessioned	2021-06-13T17:29:40Z	-
dc.date.available	2011-07-25
dc.date.copyright	2011-07-25
dc.date.issued	2011
dc.date.submitted	2011-07-12
dc.identifier.citation	[1] Ahmed and P. Eades, “Automatic Camera Path Generation for Graph Navigation in 3D,” in Proc. Asia Pacific Symposium on Information Visualization, 2005, pp. 27-32. [2] P. K. Atrey, M. S. Kankanhalli, and R. Jain, “Timeline-based information assimilation in multimedia surveillance and monitoring systems,” in Proc. ACM Workshop on Video Surveillance and Sensor Networks, 2005, pp. 103-112. [3] M. S. Arulampalam, S. Maskell, N. Gordon, and T. Clapp, “A tutorial on particle filters for online nolinear/non-gaussian Bayesian tracking,” IEEE Transactions on Signal Processing, 50(2), 2002, pp. 174-188. [4] A. D. Abrams and R. B. Pless, “Webcams in Context: Web Interfaces to Create Live 3D Environments,” in Proc. ACM Multimedia, 2010, pp. 331-340. [5] M. Andriluka, S. Roth, and B. Schiele, “People-Tracking-by-Detection and People-Detection-by-Tracking,” in Proc. IEEE Conference on Computer Vision and Pattern Recognition, 2008. [6] P. Baudisch, D. DeCarlo, A. Duchowski, and B. Geisler “Focusing on the essential: considering attention in display design,” Communication of ACM, 46(3), 2003, pp. 60-66. [7] P. Baudisch, N. Good, V. Bellotti, and P. Schraedley,“Keeping Things in Context: A Comparative Evaluation of Focus Plus Context Screens, Overviews, and Zooming,” in Proc. ACM Conference on Human Factors in Computing Systems (CHI), 2002, pp. 259-266. [8] P. Baudisch, N. Good, and P. Stewart, “Focus Plus Context Screens: Combining Display Technology with Visualization Techniques,” in Proc. ACM Symposium on User Interface Software and Technology, 2001, pp. 31-40. [9] P. N. Belhumeur, J. P. Hespanha, and D. J. Kriegman, “Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 19(7), 1997, pp. 711-720. [10] M. Brown, and D. G. Lowe, “Recognising Panoramas,” in Proc. IEEE International Conference on Computer Vision, 2003, pp. 1218-1225. [11] T. Boult, R. Micheals, X. Gao, and M. Eckmann, “Into the Woods: Visual Surveillance of Noncooperative and Camouflaged Targets in Complex Outdoor Settings,” Proceedings of the IEEE, 2001, pp. 1382-1402. [12] cars.com, http://www.cars.com. 2008. [13] Q. Cai and J. K. Aggarwal, “Tracking Human Motion in Structured Environments Using a Distributed-Camera System,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 21(11), 1999, pp. 1241-1247. [14] K. W. Chen and Y. P. Hung, “Multi-Cue Integration for Multi-Camera Tracking,” in Proc. International Conference on Pattern Recognition, 2010, pp. 145-148. [15] X. Cao, C. Forlines, and R. Balakrishnan, “Multi-User Interaction using Handheld Projectors,” in Proc. ACM Symposium on User Interface Software and Technology, 2007, pp. 43-52. [16] K. W. Chen, C. W. Lin, M. Y. Chen, and Y. P. Hung, “e-Fovea: A Multi-Resolution Approach with Steerable Focus to Large-Scale and High-Resolution Monitoring,' in Proc. ACM Multimedia, 2010, pp. 311-320. [17] R. Collins, A. Lipton, H. Fujiyoshi, and T. Kanade, “Algorithms for Cooperative Multisensor Surveillance,” Proceedings of the IEEE, 89(10), 2001, pp. 1456-1477. [18] K. W. Chen, P. J. Lee, and Y. P. Hung, 'Egocentric View Transition for Video Monitoring in a Distributed Camera Network,' in Proc. International Conference on Multimedia Modeling, 2011, pp. 171-181. [19] K. W. Chen, C. C. Lai, Y. P. Hung, and C. S. Chen, “An Adaptive Learning Method for Target Tracking across Multiple Cameras,” in Proc. IEEE Conference on Computer Vision and Pattern Recognition, 2008. [20] K. W. Chen, C. C. Lai, P. J. Lee, C. S. Chen, and Y. P. Hung, “Adaptive Learning for Target Tracking and True Linking Discovering across Multiple Non-Overlapping Cameras,” IEEE Transactions on Multimedia, 13(5), 2011. [21] T. Cormen, C. Leiserson, R. Rivest, and C. Stein, “Introduction to Algorithms,” the MIT Press, Cambridge, Massachusetts, 1999. [22] D. Comaniciu, V. Ramesh, and P. Meer, “Real-Time Tracking of Non-rigid Objects Using Mean Shift,” in Proc. IEEE Conference on Computer Vision and Pattern Recognition, 2000, pp. 142-149. [23] H. Chen, R. Sukthankar, G. Wallace, and K. Li, “Scalable Alignment of Large-Format Multi-Projector Displays using Camera Homography Trees.” in Proc. IEEE Visualization, 2002, pp. 339-346. [24] I. H. Chen, and S. J. Wang, “An Efficient Approach for Dynamic Calibration of Multiple Cameras,” IEEE Transactions on Automation Science and Engineering, 4(2), 2007, pp. 187-194. [25] L. W. Chan, H. T. Wu, H. S. Kao, J. C. Ko, H. R. Lin, M. Y. Chen, J. Hsu, and Y. P. Hung, “Enabling Beyond-Surface Interactions for Interactive Surface with an Invisible Projection,” in Proc. ACM Symposium on User Interface Software and Technology, 2010, pp. 263-272. [26] L. W. Chan, W. S. Ye, S. C. Liao, Y. P. Tsai, J. Hsu, and Y. P. Hung, “A Flexible Display by Integrating a Wall-Size Display and Steerable Projectors,” in Proc. International Conference on Ubiquitous Intelligence and Computing, 2006. [27] F. Dellaert, “Addressing the Correspondence Problem: A Markov Chain Monte Carlo Approach,” Technical report, Carnegie Mellon University School of Computer Science, 2000. [28] A. Dick and M. Brooks, “A Stochastic Approach to Tracking Objects across Multiple Cameras,” in Proc. Australian Conference on Artificial Intelligence, 2004, pp. 160-170. [29] M. D’Esposito, J. A. Detre, G. K. Aguirre, M. Stallcup, D. C. Alsop, L. J. Tippet, and M. J. Farah, “A Functional MRI Study of Mental Image Generation,” Neuropsychologia, 350, 1997, pp. 725-73. [30] H. Detmold, A. Hengel, A. Dick, A. Cichowski, and R. Hill, “Topology Estimation for Thousand-Camera Surveillance Networks,” in Proc. ACM/IEEE International Conference on Distributed Smart Cameras, 2007, pp. 195-202. [31] G. Doddington, W. Liggett, A. Martin, M. Przybocki, and D. Reynolds, “Sheeps, Goats, Lambs and Wolves: A Statistical Analysis of Speaker Performance in the NIST 1998 Speaker Recognition Evaluation,” in Proc. International Council of Societies of Industrial Design, 1998. [32] A. Dempster, N. Laird, and D. Rubin, “Maximum Likelihood from Incomplete Data via the EM Algorithm,” Journal of the Royal Statistical Society, B-39(1), 1977, pp.1-38. [33] P. E. Debevec, C. J. Taylor, and J. Malik, “Modeling and Rendering Architecture from Photographs: A Hybrid Geometry- and Image-Based Approach,” in Proc. ACM SIGGRAPH, 1996, pp. 11-20. [34] A. Ess, B. Leibe, K. Schindler, and L. V. Gool, “A Mobile Vision System for Robust Multi-Person Tracking,” in Proc. IEEE Conference on Computer Vision and Pattern Recognition, 2008. [35] R. A. Finke, “Principles of Mental Imagery,” MIT Press, Cambridge, MA, 1989. [36] Full Vision Industry Co., Ltd., “Specifications of FVIP150H IP Camera,” http://www.fvsecurity.com/ip150d.php [37] M. Farenzena, L. Bazzani, A. Perina, V. Murino, and M. Cristani, “Person re-identification by symmetry-driven accumulation of local features,' in Proc. IEEE Conference on Computer Vision and Pattern Recognition, 2010, pp. 2360-2367. [38] J. P. Freeman, B. McChesney, M. Denari, and S. Thompson, “The Explosion in Intelligent Video,” in Proc. International Security Conference West, 2006. [39] Feiner, S. and A. Shamash, 'Hybrid user interfaces: Breeding virtually bigger interfaces for physically smaller computers' in Proc. ACM Symposium on User Interface Software and Technology, 1991, pp. 9-17. [40] A. Gilbert and R. Bowden, “Tracking Objects across Cameras by Incrementally Learning Inter-Camera Color Calibration and Patterns of Activity,” in Proc. European Conference on Computer Vision, 2, 2006, pp. 125-136. [41] A. Gilbert and R. Bowden, “Incremental, Scalable Tracking of Objects Inter Camera,” Computer Vision and Image Understanding, 111(1), 2008, pp. 43-58. [42] J. Geisler, R. Eck, N. Rehfeld, E. Peinsipp-Byma, C. Schütz, and S. Geggus, ”Fovea-Tablett®: A New Paradigm for the Interaction with Large Screens,” in Proc. IEEE International Workshop on Human Computer Interaction, 2007, pp. 278-287. [43] A. Girgensohn, D. Kimber, J. Vaughan, T. Yang, F. Shipman, T. Turner, E. Rieffel, L. Wilcox, F. Chen, and T. Dunnigan, “DOTS: Support for Effective Video Surveillance,” in Proc. ACM Multimedia, 2007, pp 423-432. [44] N. Gheissari, T. B. Sebastian, P. H. Tu, and J. Rittscher, “Person Reidentification Using Spatiotemporal Appearance,” in Proc. IEEE Conference on Computer Vision and Pattern Recognition, 2006, pp. 1528-1535. [45] A. Girgensohn, F. Shipman, T. Turner, and L. Wilcox, “Effects of Presenting Geographic Context on Tracking Activity between Cameras,” in Proc. ACM Conference on Human Factors in Computing Systems (CHI), 2007, pp. 1167–1176. [46] W. K. Hastings, “Monte Carlo Sampling Methods Using Markov Chains and Their Applications,” Biometrika, 57(1), 1970, pp. 97-109. [47] C. H. Hsiao, L. W. Chan, M. C. Chen, J. Hsu, and Y. P. Hung, “To Move or Not to Move: A Comparison between Steerable and Fixed Regions of High-Resolution Projection in Multi-Resolution Tabletop Systems,” in Proc. ACM Conference on Human Factors in Computing Systems (CHI), 2009. [48] T. T. Hu, Y. W. Chia, L. W. Chan, Y. P. Hung, and J. Hsu, “i-m-Top: An Interactive Multi-Resolution Tabletop System Accommodating to Multi-Resolution Human Vision,” in Proc. IEEE International Workshop on Tabletops and Interactive Surfaces, 2008, pp. 177-180. [49] C. H. Hsiao, W. C. Huang, K. W. Chen, L. W. Chang, and Y. P. Hung, “Generating Pictorial-Based Representation of Mental Image for Video Monitoring,” in Proc. International Conference on Intelligent User Interfaces, 2009. [50] T. Horprasert, D. Harwood, and L. Davi, “A Statistical Approach for Real-Time Robust Background Subtraction and Shadow Detection,” in Proc. FRAME-RATE Workshop, 1999. [51] T. Huang and S. Russell, “Object Identification in a Bayesian Context,” in Proc. International Joint Conference on Artificial Intelligence, 1997, pp. 1276-1282. [52] L. Huston, R. Sukthankar, J. Campbell, and P. Pillai, “Forensic Video Reconstruction,” in Proc. ACM Workshop on Video Surveillance and Sensor Networks, 2004, pp. 20-28. [53] G. Haan, J. Scheuer, R. Vries, and F. H. Post, “Egocentric Navigation for Video Surveillance in 3D Virtual Environments,” in Proc. IEEE Symposium on 3D User Interfaces, 2009, pp. 111-118. [54] R. I. Hartley and A. Zisserman, “Multiple View Geometry,” 2nd ed, Cambridge University Press, 2004. [55] L. Itti, “Automatic foveation for video compression using a neurobiological model of visual attention,” IEEE Transactions on Image Processing, 13(10), 2004, pp. 1304-1318. [56] G. Iannizzotto, C. Costanzo, F. La Rosa, and P. Lanzafame, “A multimodal perceptual user interface for video-surveillance environments,” in Proc. Conference on Multimodal Interfaces, 2005, pp. 45-52. [57] N. Inamoto and H. Saito, “Free Viewpoint Video Synthesis and Presentation of Sporting Events for Mixed Reality Entertainment,” in Proc. ACM International Conference on Advances in Computer Entertainment Technology, 2004, pp. 42–50. [58] A. Jain and A. Ross, “Learning User-Specific Parameters in a Multibiometric System,” in Proc. IEEE International Conference on Image Processing, 2002, pp. 57-60. [59] O. Javed, Z. Rasheed, K. Shafique, and M. Shah, “Tracking across Multiple Cameras with Disjoint Views,” in Proc. IEEE International Conference on Computer Vision, 2003, pp. 952-957. [60] O. Javed, K. Shafique, Z. Rasheed, and M. Shah, “Modeling Inter-Camera Space-Time and Appearance Relationships for Tracking across Non-overlapping Views,” Computer Vision and Image Understanding, 109(2), 2008, pp. 146-162. [61] O. Javed, K. Shafique, and M. Shah, “Appearance Modeling for Tracking in Multiple Non-overlapping Cameras,” in Proc. IEEE Conference on Computer Vision and Pattern Recognition, 2005, pp. 26-33. [62] S. M. Kosslyn, T. M. Ball, and B. J. Reiser, “Visual Images Preserve Metric Spatial Information: Evidence from Studies of Image Scanning,” Trends in Neurosciences, 4, 1978, pp. 47-60. [63] K. Kim, T.H. Chalidabhongse, D. Harwood and L. S. Davis, “Real-time foreground-background segmentation using codebook model,” Real-Time Imaging, 11(3), 2005, pp. 172-185. [64] K. Kim and L. S. Davis, “Multi-Camera Tracking and Segmentation of Occluded People on Ground Plane using Search-Guided Particle Filtering,” in Proc. European Conference on Computer Vision, 3, 2006, pp 98-109. [65] J. Kittler, M. Hatef, R.P.W. Duin, and J. Matas, “On Combining Classifiers,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(3), 1998, pp. 226-239. [66] C. H. Kuo, C. Huang, and R. Nevatia, “Inter-camera Association of Multi-target Tracks by On-Line Learned Appearance Affinity Models,” in Proc. European Conference on Computer Vision, 2010, pp. 383-396. [67] H. Y. Kim, S. D. Kim, and S. W. Lee, “Robust Change Detection by Global-Illumination-Change Compensation and Noise-Adaptive Thresholding,” Optical Engineering, 43(3), 2004, pp. 580-590. [68] A. Katkere, S. Moezzi, D. Y. Kuramura, P. Kelly, and R. Jain, “Towards Video-Based Immersive Environments,” Multimedia System, 5(2), 1997, pp. 69–85. [69] T. Kanade, P. Narayanan, and P. Rander, “Virtualized Reality: Concept and Early Results,” Tech. Rep. CMU-CS-95-153, 1995. [70] S. Khan and M. Shah, “Consistent Labeling of Tracked Objects in Multiple Cameras with Overlapping Fields of View,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(10), 2003, pp. 1355-1360. [71] S. Khan and M. Shah, “A Multiview Approach to Tracking People in Crowded Scenes using a Planar Homography Constraint. in Proc. European Conference on Computer Vision, 4, 2006, pp. 133-146. [72] E. Kandel, J. Schwartz, and T. Jessell, “Principles of Neural Science,” 4th ed., McGraw-Hill, 2000. [73] N. Krahnstoever, T. Yu, S.N. Lim, K. Patwardhan, and P. Tu, “Collaborative Real-Time Control of Active Cameras in Large Scale Surveillance Systems,” in Proc. Workshop on Multi-camera and Multi-modal Sensor Fusion Algorithms and Applications, 2008. [74] V. Kettnaker and R. Zabih, “Bayesian Multi-Camera Surveillance,” in Proc. IEEE Conference on Computer Vision and Pattern Recognition, 1999, pp. 252-259. [75] K. Levenberg, “A Method for the Solution of Certain Problems in Least Squares,” Quarterly Applied Math, 2, 1944, pp. 164-168. [76] M. Loeve, “Probability Theory,” Graduate Texts in Mathematics, 45, 4th edition, Springer-Verlaf, 1977. [77] D. Lowe, “Object Recognition from Local Scale-Invariant Features,” in Proc. IEEE International Conference on Computer Vision, 1999, pp. 1150–1157. [78] M. Lalonde, S. Foucher, L. Gagnon, E. Pronovost, M. Derenne, and A. Janelle, “A System to Automatically Track Humans and Vehicles with a PTZ Camera,” SPIE Defense & Security: Visual Information Processing XVI (SPIE #6575), 2007. [79] B. J. Lei and E. A. Hendriks, “Real-Time Multi-Step View Reconstruction for a Virtual Teleconference System,” EURASIP Journal on Applied Signal Processing, 2002(10), 2002, pp. 1067–1087. [80] L. C. Loschky and G. W. McConkie, 'User Performance with Gaze Contingent Multiresolutional Displays,' in Proc. Eye Tracking Research & Applications Sympostium (Palm Beach Gardens, FL), 2000, pp. 97-103. [81] P. McKellar, “Imagination and Thinking,” London: Cohen & West, 1957. [82] D. Makris and T. Ellis, “Automatic Learning of an Activity-Based Semantic Scene Model,” in Proc. IEEE International Conference on Advanced Video and Signal Based Surveillance, 2003, pp. 183-188. [83] D. Makris and T. Ellis, “Learning Semantic Scene Models from Observing Activity in Visual Surveillance,” IEEE Transactions on Systems, Man, and Cybernetics - Part B, 35(3), 2005, pp. 387-408. [84] D. Makris, T. Ellis, and J. Black, “Bridging the Gaps between Cameras,” in Proc. IEEE Conference on Computer Vision and Pattern Recognition, 2004, pp. 205-210. [85] J. E. Martínez, A. Erol, G. Bebis, R. Boyle and X. Twombly , “Rendering Optimizations Guided by Head-Pose Estimates and Their Uncertainty,” in Proc. Advances in Visual Computing: First International Symposium, 3804, 2005, pp. 253-262. [86] A. Mittal and D. Huttenlocher, “Scene Modeling for Wide Area Surveillance and Image Synthesis,” in Proc. IEEE Conference on Computer Vision and Pattern Recognition, 2000, pp. 160-167. [87] L. Marchesotti, L. Marcenaro, and C. Regazzoni, “Dual Camera System for Face Detection in Unconstrained Environments,” in Proc. International Conference on Image Processing, 2003, pp. 681-684. [88] H. T. Nguyen and A. W. M. Smeulders, “Multiple Target Tracking by Incremental Probabilistic PCA,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 29(1), 2007, pp. 52-64. [89] N. Otsu, “A Threshold Selection Method from Gray-Level Histograms,” IEEE Transactions on Systems, Man, and Cybernetics - Part B, 9(1), 1979, pp.62-66. [90] Y. Ohta, I. Kitahara, Y. Kameda, H. Ishikawa, and T. Koyama, 'Live 3D Video in Soccer Stadium,' International Journal of Computer Vision, 75(1), 2007, pp. 173-187. [91] S. Palmer, “Vision Science: Photons to Phenomenology,” MIT Press, 1999. [92] PENPOWER, “TrackIN iDVR, Auto PTZ Tracking System,” http://www.trackinvideo.com/iDVR/main3_3.html/. [93] C. Plaisant, D. Carr, and B. Shneiderman, “Image-Browser Taxonomy and Guidelines for Designers,” IEEE Software, 12(2), 1995, pp. 21–32. [94] F. Porikli and A. Divakaran, “Multi-Camera Calibration, Object Tracking and Query Generation,” in Proc. International Conference on Multimedia and Expo, 2003, pp. 653-656. [95] B. Prosser, S. Gong, and T. Xiang, “Multi-Camera Matching using Bi-Directional Cumulative Brightness Transfer Functions,” in Proc. British Machine Vision Conference, 2008. [96] C. Pinhanez, R. Kjeldsen, G. Pingali, A. Levas, M. Podlaseck, and P. Chou, “Ubiquitous Interactive Graphics,” IBM Research Report, May 17 ,2002. [97] H. Pasula, S. Rusell, M. Ostland, and Y. Ritov, “Tracking Many Objects with Many Sensors,” in Proc. International Joint Conference on Artificial Intelligence, 1999, pp. 1160-1171. [98] A. Potamianos, E.S. Soto, and K. Daoudi, “Stream Weight Computation for Multi-Stream Classifiers,” in Proc. IEEE International Conference on Acoustics, Speech, and Signal Process, 2006. [99] W. T. Reeves, “Particle systems - a technique for modeling a class of fuzzy objects,” ACM Transactions on Graphics, 2, 1983, pp. 91-108. [100] A. Richardson, “Mental Imagery,” London: Routledge & Kegan Paul, 1969. [101] E. Rivlin, and H. Rotstein, “Control of a Camera for Active Vision: Foveal Vision, Smooth Tracking and Saccade,” International Journal of Computer Vision, 39(2), 2000, pp. 81–96. [102] R. Szeliski, “Image Alignment and Stitching: A Tutorial,” Technical Report, MSR-TR-2004-92, 2004. [103] C. Stauffer, “Learning to Track Objects through Unobserved Regions,” in Proc. IEEE Workshop on Motion and Video Computing, 2005, pp. 96-102. [104] H. S. Sawhney, A. Arpa, R. Kumar, S. Samarasekera, M. Aggarwal, S. Hsu, D. Nister, and K. Hanna, “Video Flashlights: Real Time Rendering of Multiple Videos for Immersive Model Visualization,” in Proc. Eurographics workshop on Rendering, 2002, pp. 157–168. [105] O. Staadt, B. Ahlborn, O. Kreylos, and B. Hamann, “A foveal Inset for Large Display Environment,” in Proc. IEEE Virtual Reality Conference, 2006, pp. 281-282. [106] M. J. Swain and D. H. Ballard, “Indexing via Color Histograms,” in Proc. IEEE International Conference on Computer Vision, 1990, pp. 390-393. [107] G. Sansoni, M. Carocci, and R. Rodella, “Three-Dimensional Vision Based on a Combination of Gray-Code and Phase-Shift Light Projection: Analysis and Compensation of the Systematic Errors,” Applied Optics, 38, 1999, pp. 6565-6573. [108] S. M. Seitz and C. R. Dyer, “View Morphing,” in Proc. ACM SIGGRAPH, 1996, pp. 21–30. [109] C. Stauffer and W. E. L. Grimson, “Learning Patterns of Activity using Real-Time Tracking, ” IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8), 2000, pp. 747-757. [110] J. Sanneblad, and L. Holmquist, “Ubiquitous Graphics: Combining Hand-Held and Wall-Size Displays to Interact with Large Images,” in Proc. International Working conference on Advanced Visual Interfaces, 2006, pp. 373-377. [111] K. Shafique, A. Hakeem, O. Javed, and N. Haering, “Self Calibrating Visual Sensor Networks,” in Proc. IEEE Workshop on Applications of Computer Vision, 2008, pp. 1-6. [112] I.O. Sebe, J. Hu, S. You, and U. Neumann, “3D Video Surveillance with Augmented Virtual Environments,” in Proc. ACM SIGMM International Workshop on Video Surveillance, 2003, pp. 107-112. [113] M. Segal, C. Korobkin, R. Widenfelt, J. Foran, and P. Haeberli, “Fast Shadows and Lighting Effects Using Texture Mapping,” in Proc. ACM SIGGRAPH, 1992, pp. 249-252. [114] R. N. Shepard and J. Metzler, “Mental Rotation of Three-Dimensional Objects,” Science, 171, 1971, pp. 701-703. [115] E. S. Soto, A. Potamianos, and K. Daoudi, “Unsupervised Stream Weight Estimation Using Anti-Models,” in Proc. IEEE International Conference on Acoustics, Speech, and Signal Process, 2007. [116] R. Sulcthankar, R. Stockton, and M. Mullin, “Smarter Presentations: Exploiting Homography in Camera-Projector Systems,” in Proc. International Conference on Computer Vision, 2001, pp. 247-253. [117] N. Snavely, S. M. Seitz, and R. Szeliski, “Photo Tourism: Exploring Photo Collections in 3d,” in Proc. ACM SIGGRAPH, 2006, pp. 835–846. [118] C. Stauffer and K. Tieu, “Automated Multi-Camera Planar Tracking Correspondence Modeling,” in Proc. IEEE Conference on Computer Vision and Pattern Recognition, 2003, pp. 259-266. [119] S. D. Slotnick, W. L. Thompson, and S. M. Kosslyn, “Visual Mental Imagery induces Retinotopically Organized Activation of Early Visual Areas,” Cerebral Cortex, 15, 2005, pp. 1570-1583. [120] R. Snelick, U. Uludag, A. Mink, M. Indovina, and A. Jain, “Large-Scale Evaluation of Multimodal Biometric Authentication Using State-of-the-Art Systems,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(3), 2005, pp. 450-455. [121] M. Song and H. Wang, “Highly Efficient Incremental Estimation of Gaussian Mixture Models for Online Data Stream Clustering,” in Proc. SPIE Conference on Intelligent Computing: Theory and Applications III, 2005, pp. 174-183. [122] N. J. Thomas, “Mental Imagery, Philosophical Issues About,” In L. Nadel (Ed.), Encyclopedia of Cognitive Science, 2, London: Nature Publishing/Macmillan, 2003, pp. 1147-1153. [123] N. J. Thomas, “Mental Imagery,” In Zalta, E. N., ed. The Stanford Encyclopedia of Philosophy, Fall, 2007. http://plato.stanford.edu/archives/fall2007/entries/mental-imagery [124] M. E. Tipping and C. M. Bishop, “Probabilistic Principal Component Analysis,” Journal of the Royal Statistical Society, Series B, 61(3), 1999, pp. 611-622. [125] K. Tieu, G. Dalley, and W. Grimson, “Inference of Non-overlapping Camera Network Topology by Measuring Statistical Dependence,” in Proc. IEEE International Conference on Computer Vision, 2005, pp. 1842-1849. [126] P. Thorndyke and B. Hayes-Roth, “Differences in Spatial Knowledge Acquired from Maps and Navigation,” Cognitive Psychology, 14(4), 1982, pp. 560-589. [127] K. Toyama, J. Krumm, B. Brumitt, and B. Meyers, “Wallflower: Principles and Practice of Background Maintenance,” in Proc. IEEE International Conference on Computer Vision, 1999, pp. 255–261. [128] M. Turk, and A. Pentland, “Eigenfaces for Recognition,” Journal of Cognitive Neurosicence, 3(1), 1991, pp. 71-86. [129] Y. P. Tsai, Y. N. Wu, and Y.-P. Hung, “Generating a Multiresolution Display by Integrating Multiple Projectors,” in Proc. IEEE International Workshop on Projector-Camera Systems, 2003. [130] P. Viola and M. Jones, “Robust Real-time Object Detection,” International Journal of Computer Vision, 57(2), 2002, pp. 137-154. [131] B. Wandell, “Foundations of Vision,” Sinauer Associates, 1995. [132] R. W. Wolff, “Stochastic Modelling and the Theory of Queues,” Prentice Hall, Englewood Cliffs, New Jersey, 1989. [133] G. Welch and G. Bishop, “An introduction to the kalman filter,” Chapel Hill, NC, USA, Tech. Rep., 1995. [134] Y. Wang, D. M. Krum, E. M. Coelho, and D. A. Bowman, “Contextualized Videos: Combining Videos with Environment Models to Support Situational Understanding,” IEEE Transactions on Visualization and Computer Graphics, 13(6), 2007, pp. 1568-1575. [135] X. Wang, K. Tieu, and W. E. L. Grimson, “Correspondence-Free Multi-Camera Activity Analysis and Scene Modeling,” in Proc. IEEE Conference on Computer Vision and Pattern Recognition, 2008. [136] B. Xie, V. Ramesh, and T. E. Boult, “Sudden Illumination Change Detection Using Order Consistency,” Image and Vision Computing, 22(2), 2004, pp. 117–125. [137] Z. Zhang, “A Flexible New Technique for Camera Calibration,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 22, 2000, pp. 1330-1334. [138] W. Zhao, R. Chellappa, A. Rosenfeld, and P.J. Phillips, “Face Recognition: A Literature Survey,” ACM Computing Surveys, 2003, pp. 399-458. [139] C. Zhang, Z. Liu, Z. Zhang, Q. Zhao, “Semantic Saliency Driven Camera Control for Personal Remote Collaboration,” in Proc. IEEE International Workshop on Multimedia Signal Processing, 2008, pp. 28-33.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/39484	-
dc.description.abstract	攝影機網路已廣泛應用於視訊安全監控系統中，例如：機場安全監控、車站安全監控、交通流量監控等。其主要優點在於可以監控大範圍的區域。可是隨著攝影機的數量愈來愈多，對於使用者而言，要同時觀看如此多的畫面是非常困難的。在這篇論文中，我們將探討攝影機網路安全監控系統中，於中控室監控時所需面對的主要研究課題。首先，對於跨攝影機間的事件連結，我們提出一自動學習演算法，以進行多攝影機間之目標物自動連續追蹤。於追蹤過程中的攝影機畫面切換，我們提出一主觀式平順轉場技術，以幫助使用者於攝影機切換過程中能持續監控目標物。對於大範圍與高解析度監控之應用，有別於傳統的昂貴設置方式，我們提出一多重解析度顯示設計-大小眼觀察家系統。對於多攝影機之目標物追蹤研究，我們主要探討多攝影機之監看區域彼此間沒有重疊的情形，其困難點在於如何學習攝影機兩兩之間的時空關係與亮度轉換函式。目前該領域現有技術主要透過事先收集訓練資料並藉由人工點對應關係的方式進行學習，不過其只能應用於短時間監控或是該監控環境不會改變時。當監控環境會逐漸改變時，例如：光線變化，則這些方法會無法適應環境改變以導致追蹤錯誤。而在這篇論文中，我們提出一自動且能適應性學習的演算法，因此更能將方法應用於長時間的安全監控。對於使用者於攝影機網路監控畫面追蹤目標物。傳統監控系統會於主要監控畫面進行直接畫面切換，可是當我們於多攝影機間進行持續追蹤，畫面會不斷切換。對於使用者而言，頻繁的直接畫面切換會造成很大的監控負擔，會很難去聯想目前使用者在環境中是從哪裡走到哪裡。因此，在這篇論文，我們提出一主觀式平順轉場技術，藉由產生攝影機間切換時的虛擬畫面，以幫助使用者更能了解當攝影機切換時的目標物移動情形。而有別於傳統視訊轉場技術，我們的方法可處理多攝影機間的監控區域是比較不同甚至不重疊的情形。最後，我們提出一個同時具有大範圍與高解析度監控特性的多重解析度顯示系統–大小眼觀察家。該系統可同時達到高解析度顯示、高畫面更新率與低建置成本，其靈感來自於人眼視覺，只於使用者感興趣的區域顯示高解析度畫面。我們也提出一使用者測試實驗。於該實驗中，我們將所提出系統與現有方法進行比較。而實驗結果顯示，使用我們的系統，確實能有效提升使用者的監控效率。	zh_TW
dc.description.abstract	Camera network have been widely used in visual surveillance applications, such as airport or railway security, traffic monitoring, and etc. The main benefit of multi-camera system is that it can monitor the activities of targets over a large area. However, to security guards or users, the difficulty of monitoring such a system increases with the increase of cameras, especially when the events happen among multiple cameras. In this dissertation, we investigate two major tasks of monitoring in the command center display. One is to track targets in a camera network with computer automation. The other is to develop displaying techniques to help users to monitor the events in a camera network more easily. First, to track targets across networked cameras, we focus on the situations where the view fields of cameras are not necessarily overlapping each other. One of the major problems of tracking across non-overlapping cameras is to learn the spatio-temporal relationship and the appearance relationship, where the appearance relationship is usually modeled as a brightness transfer function. Traditional methods learning the relationships by using either hand-labeled correspondence or batch-learning procedure are applicable when the environment remains unchanged. However, in many situations such as lighting changes, the environment varies seriously and hence traditional methods fail to work. In this dissertation, we propose an unsupervised method which learns adaptively and can be applied to long-term monitoring. Second, when monitoring the tracking activity in the camera network, the traditional surveillance systems usually switch the main camera view from one to another directly, but it makes users difficult to be aware of the trajectory of the target in the environment when switching views many times. In this dissertation, we propose a novel egocentric view transition approach, which synthesizes the virtual views during the period of switching cameras and eases the mental effort for users to understand the events. An important property of our system is that it can be applied to the situations of where the view fields of transition cameras are not close enough or even exclusive. Finally, for large-scale and high-resolution monitoring, we proposed a multi-resolution display with steerable focus, e-Fovea,. Large-scale and high-resolution monitoring systems are ideal for many visual surveillance applications. However, existing approaches have insufficient resolution and low frame rate per second, or have high complexity and cost. We take inspiration from the human visual system and propose a multi-resolution design, e-Fovea, which provides peripheral vision with a steerable fovea that is in higher resolution. In this dissertation, we further present two user studies, with a total of 36 participants, to compare e-Fovea to two existing multi-resolution visual monitoring designs. The user study results show that for visual monitoring tasks, our e-Fovea design with steerable focus is significantly faster than existing approaches and preferred by users.	en
dc.description.provenance	Made available in DSpace on 2021-06-13T17:29:40Z (GMT). No. of bitstreams: 1 ntu-100-F93922014-1.pdf: 11410051 bytes, checksum: c643106b4cdf4d7150bb7efa90976eb0 (MD5) Previous issue date: 2011	en
dc.description.tableofcontents	TABLE OF FIGURES xvii TABLE OF TABLES xxiii CHAPTER 1 INTRODUCTION 1 1.1 Background and Motivation 1 1.2 Outline of this Research 3 1.2.1 Adaptive Learning for Target Tracking across Multiple Non-Overlapping Cameras 3 1.2.2 Egocentric View Transition in a Camera Network 5 1.2.3 Multi-Resolution Design for Large Scale and High-Resolution Monitoring 6 1.3 Organization of the Dissertation 6 CHAPTER 2 RELATED WORK 7 2.1 Tracking across Non-Overlapping Cameras 7 2.2 Monitoring the Tracking Activity among Multiple Cameras 10 2.3 Large-Scale and High-Resolution Monitoring System 11 2.4 Summary 13 CHAPTER 3 ADAPTIVE LEARNING FOR TARGET TRACKING ACROSS MULTIPLE NON-OVERLAPPING CAMERAS 15 3.1 Introduction 15 3.1.1 Characteristics of Our Approach 16 3.2 Problem Formulation 18 3.3 Learning Spatio-Temporal Relationship 19 3.3.1 Batch Learning Phase 20 3.3.2 Incremental Learning Phase 21 3.4 Automatic Discovering and Removing Weak Links 24 3.4.1 Remove Weak Links - Batch Learning Phase 26 3.4.2 Remove Weak Links - Incremental Learning Phase 28 3.5 Learning Brightness Transfer Function 31 3.5.1 Brightness Transfer Functions – A Review 31 3.5.2 Criterion for BTF Estimation 33 3.5.3 Spatio-Temporal Information and MCMC Sampling 34 3.5.4 Adaptively Learning BTF 35 3.5.5 Handling Sudden Illumination Change 36 3.6 Learning Fusion Weights 36 3.6.1 Basic Method 36 3.6.2 Supervised Learning Method 37 3.6.3 Unsupervised Learning Method 38 3.7 Results 39 3.7.1 Experimental Setup 39 3.7.2 Experiment on Learning Spatio-Temporal Relationship 41 3.7.3 Experiment on Learning Brightness Transfer Function 44 3.7.4 Experiment on Learning Fusion Weights 45 3.7.5 Experiment on Tracking Targets across Multiple Cameras 49 3.7.6 Discussion 51 3.6 Summary 55 CHAPTER 4 EGOCENTRIC VIEW TRANSITION IN A CAMERA NETWORK 57 4.1 Introduction 57 4.2 Motivation and Evaluation 60 4.2.1 Psychological Support 60 4.2.2 User Study: Monitoring with View Transition or Not 61 4.3 System Overview 65 4.3.1 Preprocessing 66 4.3.2 Multi-Camera Tracking 67 4.4 View Transition for Overlapping Cameras 67 4.4.1 Foreground Detection 68 4.4.2 Foreground Billboard Construction and Position Estimation 69 4.4.3 Virtual Camera Placement 69 4.5 View Transition for Non-Overlapping Cameras 70 4.5.1 Particle System 71 4.5.2 Foreground Particles Generation 71 4.5.3 Particles Movement Control 71 4.5.4 Virtual Camera Placement 72 4.5.5 Background Texture Adaptation 73 4.6 Results 74 4.5 Summary 76 CHAPTER 5 MULTI-RESOLUTION DESIGN FOR LARGE-SCALE AND HIGH-RESOLUTION MONITORING 79 5.1 Introduction 79 5.2 User Study Evaluation 81 5.2.1 Interfaces and Apparatus 82 5.2.2 User Study 1: Single Moving Target Tracking 84 5.2.3 User Study 2: Multiple Moving Target Identification 87 5.2.4 Summary 91 5.3 Discussion 92 5.3.1 No Switching and Re-orientation Required 92 5.3.2 Providing Global Context 92 5.3.3 Eliminating Clipping 93 5.3.4 Without Feeling Dizzy 93 5.4 Design and Implementation of e-Fovea 93 5.4.1 System Architecture 94 5.4.2 Camera Calibration 95 5.4.3 Projector Calibration 97 5.4.4 Projector-Camera Integration 100 5.5 Results 102 5.5.1 Evaluation of Camera Calibration 102 5.5.2 Evaluation of Projector Calibration 104 5.5.3 Demonstration 105 5.6 Summary 106 CHAPTER 6 CONCLUSION AND FUTURE WORK 109 6.1 Summary of the Dissertation 109 6.2 Future Directions 110 LIST OF REFERENCES 113 PUBLICATIONS 129
dc.language.iso	en
dc.title	攝影機網路之目標物追蹤與視覺化顯示	zh_TW
dc.title	Target Tracking and Monitoring in a Camera Network	en
dc.type	Thesis
dc.date.schoolyear	99-2
dc.description.degree	博士
dc.contributor.oralexamcommittee	李錫堅(Hsi-Jian Lee),陳祝嵩(Chu-Song Chen),傅楸善(Chiou-Shann Fuh),王聖智(Sheng-Jyh Wang),王傑智(Chieh-Chih Wang),李忠謀(Chung-Mou Lee),李明穗(Ming-Sui Lee)
dc.subject.keyword	視訊追蹤,攝影機網路,視訊安全監控,無重疊區域之攝影機,時空關係,亮度轉換關係,主觀式平順轉場,影像畫面切換,中控室,多重解析度,可移動式聚焦點,異質型雙攝影機系統,使用者測試,	zh_TW
dc.subject.keyword	Visual tracking,camera network,visual surveillance,non-overlapping cameras,spatio-temporal relationship,brightness transfer function,egocentric view transition,switching views,command center,multi-resolution,steerable focus,visual monitoring,hybrid dual-camera system,user study,	en
dc.relation.page	130
dc.rights.note	有償授權
dc.date.accepted	2011-07-12
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	資訊工程學研究所	zh_TW
顯示於系所單位：	資訊工程學系

文件中的檔案：

檔案	大小	格式
ntu-100-1.pdf 目前未授權公開取用	11.14 MB	Adobe PDF

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。