視訊資料庫之知識結構與相似度查詢

Ping Yu; 余平

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/32728

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	李瑞庭(Anthony J.T. Lee)
dc.contributor.author	Ping Yu	en
dc.contributor.author	余平	zh_TW
dc.date.accessioned	2021-06-13T04:14:18Z	-
dc.date.available	2006-07-27
dc.date.copyright	2006-07-27
dc.date.issued	2006
dc.date.submitted	2006-07-25
dc.identifier.citation	[1] Adali, S., Candan, K.S., Chen, S., Erol, K., Subrahmanian, V., “The advanced video information system: data structures and query processing,” Multimedia Systems, vol.4, pp. 172-186, 1996. [2] Aghbari, Z., Kaneko, K., and Makinouchi, A., “Content-trajectory approach for searching video databases,” IEEE Trans. on Multimedia, vol. 5, no. 4, pp. 516-531, Dec. 2003. [3] Agrawal, R. and Srikant, R., “Fast algorithms for mining association rules,” In Proc. of Intl. Conf. Very Large Data Bases, Santiago, Chile, pp. 487-499, September 1994. [4] Ahanger, G. and Little, T.D.C., “A survey of technologies for parsing and indexing digital video,” Journal of visual communication and image representation, vol. 7, no. 1, March, pp. 28-43, 1996. [5] Brunelli, R., Mich, O., and Modena, C.M., “A survey on the automatic indexing of video,” Journal of Visual Communication and Image Representation, vol.10, pp. 78-112, 1999. [6] Caspi, Y. and Irani, M., “Spatio-temporal alignment of sequences,” IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 24, no. 11, pp. 1409-1424, 2002. [7] Chan, Y.K. and Chang, C.C., “Spatial similarity retrieval in video databases, ” Journal of Visual Communication and Image Representation, vol. 12, pp. 107-122, 2001. [8] Chang, C.C. and Lee, S.Y., “Retrieval of symbolic pictures,” Journal of Information Science and Engineering, vol. 7, no. 3, pp. 405-422, Sept. 1991. [9] Chang, S., Chen, W., Meng, H.J., Sundaram, H., and Zhong, D., “VideoQ: an automated content-based video search system using visual cues,” Proc. of ACM Intl. Conf. on Multimedia Conference, Seattle, WA, pp. 313-324, 1997. [10] Chang, S.K., Shi, Q.Y., and Yan, C.W., “Iconic indexing by 2D strings,” IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 9, no. 3, pp. 413-429, May 1987. [11] Chang, Y.I., Ann, H.Y., and Yeh, W.H., “A unique-ID-based matrix strategy for efficient iconic indexing of symbolic pictures,” Pattern Recognition, vol. 33, pp. 1263-1276, 2000. [12] Chang, Y.I., Yang, B.Y., and Yeh, W.H., “A generalized prime-number-based matrix strategy for efficient iconic indexing of symbolic pictures,” Pattern Recognition Letter, vol. 22, pp. 657-666, 2001. [13] Chang, Y.I., Yang, B.Y., and Yeh, W.H., “A bit-pattern-based matrix strategy for efficient iconic indexing of symbolic pictures,” Pattern Recognition Letter, vol. 24, pp. 537-545, 2003. [14] Chen, D.Y., Lee, S.Y., and Liao, H.Y.M, “Robust video sequence retrieval using a novel object-based T2D-histogram descriptor,” Journal of Visual Communication and Image Representation, vol. 16, pp. 212-232, 2005. [15] Chu, W.W., Cardenas, A.F., and Taira, R.K., “A knowledge-based multimedia medical distributed database system, KMED,” Information Systems, vol. 20, no. 2, pp. 75-96, 1995. [16] Corte, A.L., Lombardo, A., Palazzo, S., and Schembra, G., “Control of perceived quality of service in multimedia retrieval services: prediction-based mechanisms vs. compensation buffers,” Multimedia Systems, vol. 6, pp. 102-112, 1998. [17] Dimitrova, N. and Golshani, F., “Rx for semantic video database retrieval,” Proc. of ACM Intl. Conf. on Multimedia, San Francisco, CA, pp. 219-226, 1994. [18] Donderler, M.E., Ulusoy, O., and Gudukbay, U., “A rule-based video database system architecture,” Information Science, vol. 143, pp.13–45, 2002. [19] Doulamis, A.D., Doulamis, N.D., and Kollias, S.D., “A fuzzy video content representation for video summarization and content-based retrieval,” Signal Processing, vol. 80, pp. 1049-1067, 2000. [20] Erol, B. and Kossentini, F., “Shape-based retrieval of video objects,” IEEE Trans. on Multimedia, vol. 7, no. 1, pp. 179-182, Feb. 2005. [21] Fan, J., Elmagarmid, A.K., Zhu, X., Aref, W.G., and Wu, L., “ClassView: hierarchical video shot classification, indexing, and accessing,” IEEE Trans. on Multimedia, vol. 6, no. 1, pp. 70-86, Feb. 2004. [22] Flickner, M., Sawhney, H., Niblack, W., Ashley, J., Huang, Q., Dom, B., Gorkani, M., Hafner, J., Lee, D., Petkovic, D., Steele, D., and Yanker, P., ”Query by image and video content: the QBIC system,” IEEE Computer, vol. 28, pp. 23-32, 1995. [23] Gevers, T., “Robust segmentation and tracking of colored objects in video,” IEEE Trans on Circuits and Systems for Video Technology, vol. 14, no. 6, pp. 776-781, June 2004. [24] Guimar, S.J.F., Couprie, M.M., Arauj, A.D.A., and Leite, M.J., “Video segmentation based on 2D image analysis,” Pattern Recognition Letters, vol. 24, pp. 947-957, 2003. [25] Guting, R.H., Bohlen, M.H., Ervig, M., Jensen, C.S., Lorentzos, N.A., Schneider, M., and Vazirgiannis, M., “A foundation for representing and querying moving objects,” ACM Trans. on Database Systems, vol. 25, no. 1, pp. 1-42, 2000. [26] Hanjalic, A. and Xu, L.Q., “Affective video content representation and modeling,” IEEE Trans. on Multimedia, Vol. 7, No. 1, pp. 143-154, Feb. 2005. [27] Hjelsvold, R., Langlrgen, S., Midtstraum, R., and Sandst, O., “Integrated video archive tools,” Proc. of ACM Multimedia, San Francisco, CA, USA, pp. 283-293, 1995. [28] Hsu, C.C., Chu, W.W., and Taira, R.K., “A knowledge-based approach for retrieving images by content,” IEEE Trans. on Knowledge and Data Engineering, vol. 8, pp. 522-532, 1996. [29] Hsu, F.J., Lee, S.Y., and Lin, B.S., “Video data indexing by 2D C-trees,” Journal of Visual Languages and Computing, vol. 9, pp. 375-397, 1998. [30] Huang, P.W., and Jean, Y.R., “Using 2D C+-string as spatial knowledge representation for image database systems,” Pattern Recognition, vol. 27, pp. 1249-1257, 1994. [31] Jungert, E., “Extended symbolic projections as a knowledge structure for spatial reasoning,” Proc. of 4th BPRA Conf. on Pattern Recognition, pp. 343-351, 1988. [32] Khatib, W.A., Day, Y.F., and Berra, P.B., “Semantic modeling and knowledge representation in multimedia databases,” IEEE Trans. on Knowledge and Data Engineering, vol. 11, no. 1, pp.64-80, 1999. [33] Kokkoras, F., Jiang, H., Vlahavas, I., Elmagarmid, A.K., Houstis, E.N., and Aref, W.G., “Smart VideoText: a video data model based on conceptual graphs,” Multimedia Systems, vol. 8, no. 4, pp. 328-338, 2002. [34] Koprulu, M., Cicekli, N.K., and Yazici, A., “Spatio-temporal querying in video databases,” Information Science, vol. 160, pp. 131-152, 2004. [35] Kuo, T.C.T. and Chen, A.L.P., “A content-based query language for video databases,” Proc. of IEEE Intl. Conf. on Multimedia Computing and Systems, pp. 209–214, 1996. [36] Lee, A.J.T., Chiu, H.P., and Yu, P., “3D C-string: a new spatio-temporal knowledge structure for video database systems,” Pattern Recognition, vol. 35, 2002, pp. 2521-2537. [37] Lee, A.J.T. and Chiu, H.P., “2D Z-string: a new spatial knowledge representation for image databases,” Pattern Recognition Letter, vol. 24, pp. 3015-3026, 2003. [38] Lee, A.J.T., Yu, P., Chiu, H.P., and Hong, R.W., “3D Z-string: a new knowledge structure to represent spatio-temporal relations between objects in a video,” Pattern Recognition Letters, vol. 26, pp. 2500-2508, 2005. [39] Lee, S.Y. and Hsu, F.J., “2D C-string: a new spatial knowledge representation for image database system,” Pattern Recognition, vol. 23, pp. 1077-1087, 1990. [40] Lee, S.Y. and Hsu, F.J., “Spatial reasoning and similarity retrieval of images using 2D C-string knowledge representation,” Pattern Recognition, vol. 25, pp. 305-318, 1992. [41] Lee, S.Y. and Kao, H.M., “Video indexing-an approach based on moving object and track,” SPIE Storage and Retrieval for Image and Video Databases, vol. 1908, pp. 25-36, 1993. [42] Lei, Z. and Lin, Y.T., “3D shape inferencing and modeling for video retrieval,” Journal of Visual Communication and Image Representation, vol. 11, pp. 41-57, 2000. [43] Li, J.Z., Ozsu, M.T., and Szafron, D., ”Modeling of moving objects in a video database,” Proc. of IEEE Intl. Conf. on Multimedia Computing and Systems, Ottawa, Canada, pp. 336-343, 1997. [44] Lienhart, R., “Automatic text recognition for video indexing,” Proc. ACM Multimedia Conf., pp. 11-20, 1996. [45] Liu, C.C. and Chen, A.L P., “3D-list: a data structure for efficient video query processing,” IEEE Trans. on Knowledge and Data Engineering, vol. 14, pp. 106-122, 2002. [46] Liu, Y. and Kender, J.R., “Fast video segment retrieval by sort-merge feature selection, boundary refinement, and lazy evaluation,” Computer Vision and Image Understanding, vol. 92, pp. 147-175, 2003. [47] Mohan, R., “Text-based search of TV news stories,” Proc. of Intl. Conf. on Multimedia Storage and Archival Systems, pp. 2-13, SPIE, Nov. 1996. [48] Nabil, M., Ngu, A.H., and Shepherd, J., “Modeling and retrieval of moving objects,” Multimedia Tools and Applications, vol. 13, pp. 35-71, 2001. [49] Nagasaka, A. and Tanaka, Y., “Automated video indexing and full-video search for object appearance,” Proc. of Second Working Conference on Visual Database System, pp. 119-133, 1992. [50] Naphade, M. R. and Huang, T. S., “A probabilistic framework for semantic video indexing, filtering, and retrieval,” IEEE Trans. on Multimedia, vol. 3, no. 1, pp. 141-151, 2001. [51] Ngo, C.W., Pong, T.C., and Zhang, H.J., “Motion analysis and segmentation through spatio-temporal slices processing,” IEEE Trans. on Image Processing, vol. 12, no. 3, pp. 341-355 , 2003. [52] Oomoto, E. and Tanaka, K., “OVID: design and implementation of a video object database system,” IEEE Trans. on Knowledge and Data Engineering, vol. 5, pp. 629-643, 1993. [53] Overview of the MPEG-4 Standard, ISO/IEC JTC1/SC29/WG11, Mar 2001. [54] Petraglia, G., Sebillo, M., Tucci, M., and Tortora, G., “Virtual images for similarity retrieval in image databases,” IEEE Trans. on Knowledge and Data Engineering, vol. 13, no. 6, pp. 951-967, 2001. [55] Rui, Y., Huang, T.S., Mehrotra, S., “Relevance feedback techniques in interactive content-based image retrieval,” SPIE Storage and Retrieval for Image and Video Databases VI, vol. 3312, pp. 25-36, 1998. [56] Sebe, N., Lew, M S., and Smeulders, A.W.M., “Video retrieval and summarization,” Computer Vision and Image Understanding, vol. 92, pp. 141-146, 2003. [57] Shearer, K., Venkatesh, S., and Bunke, H., “Video sequence matching via decision tree path following,” Pattern Recognition, vol. 22, pp. 479-492, 2001. [58] Sistla, A.P., Wolfson, O., Chamberlain, S., and Dao, S., “Modeling and querying moving objects,” Proc. of IEEE Intl. Conf. on Data Engineering, pp. 422-432, 1997. [59] Smeaton, A.F. and Qigley, I., “Experiments on using semantic distances between words in image caption tetrieval,” Proc. of ACM Conf. on Research and Development in Information Retrieval, 1996. [60] Snoek, C.G.M. and Worring, M., “Multimedia event based video indexing using time intervals,” IEEE Trans. on Multimedia, pp. 1-10, 2004. [61] Yang, H., Chaisorn, L., Zhao, Y., Neo, S., and Chua, T., “VideoQA: question answering on news video,” Proc. of ACM Intl. Conf. on Multimedia, pp. 632-641, 2003. [62] Yeung, M.M., Yeo, B.L., and Liu, B., “Extracting story units from long programs for video browsing and navigation,” Proc. of the 3rd IEEE Intl. Conf. on Multimedia Computing and Systems, 1996, pp. 296-305. [63] Yi, H., Rajan, D. and Chia, L.T., “A new motion histogram to index motion content in video segments,” Pattern Recognition Letters, vol. 26, pp. 1221-1231, 2005. [64] Yoshitaka, A. and Ichikawa T., “A survey of content-based retrieval for multimedia databases,” IEEE Trans. on Knowledge and Data Engineering, vol. 11, no. 1, pp. 81-93, 1999. [65] Zhong, D., Zhang, H., and Chang, S.F., “Clustering methods for video browsing and annotation,” SPIE Storage and Retrieval for Image and Video Databases IV, vol. 2670, pp. 239-246, 1996. [66] Zhu, X., Wu, X., Elmagarmid, A.K., Feng, Z., and Wu, L., “Video data mining: semantic indexing and event detection from the association perspective,” IEEE Trans. on Knowledge and Data Engineering, vol. 17, no. 5, pp. 665-677, 2005.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/32728	-
dc.description.abstract	近年來，因傳統資料庫無法適當的處理視訊資料，使得如何有效的管理視訊資料庫成為熱門的研究課題。在視訊資料庫系統中，用來區別視訊最重要的方法之一，是利用視訊中的物件及物件間的空間與時間關係，而如何利用這些特性，將視訊儲存在視訊資料庫中，成為重要的視訊資料庫設計議題。在本論文中，我們首先提出一個新的視訊知識結構3D C-string，可用來表示視訊中物件的空間與時間關係，且能持續追蹤各個物件的移動速度及大小的改變。然後，我們提出3DC相似度查詢演算法，藉由提供多種視訊的相似度型態，此查詢演算法具有在不同標準下區別視訊的能力。接著，我們提出另一個新的視訊知識結構3D Z-string，因不用將物件切割為子物件，使得此方法在儲存需求及執行時間上均較3D C-string更為簡潔且有效率。最後，我們提出3DZ相似度查詢演算法，因可找出部份相似的物件集合，且提供藉由回饋更新查詢結果的機制，使得此視訊查詢方法更具彈性，且更能符合使用者的需求。最後，我們進行一連串的實驗。實驗的結果顯示，本論文所提的方法，比以往的方法更具有效性及有用性。此外，我們也製作一個視訊資料庫雛型系統來實證本論文所提的各種方法。	zh_TW
dc.description.abstract	In recent years, how to efficiently process and manage video databases has attracted more and more attention because traditional database systems are not suitable for processing those data. In video database systems, one of the most important methods for discriminating the videos is to use the perception of spatio-temporal relations between objects in the desired videos. Therefore, how videos are stored in a database becomes an important design issue of a video database system In this dissertation, we first propose a new knowledge structure called 3D C-string. The 3D C-string can represent the spatio-temporal relations between objects in a video and keep track of the motions and size changes of the objects. Secondly, we propose the 3DC similarity retrieval algorithm. By providing various types of similarity between videos, our proposed approach has discriminating power about different criteria. Thirdly, we propose a new knowledge structure called 3D Z-string. Since there is no cutting between the objects in the video, the 3D Z-string approach is more compact and efficient than the 3D C-string approach in terms of storage requirement and execution time. Finally, we proposed the 3DZ similarity retrieval algorithm. Since the approach can find the partly matched object sets and provide the refined mechanism to meet users’ requirement from the feedbacks. The approach provides a more flexible way to retrieve similar videos. To show the efficiency and effectiveness of our proposed approaches, we perform a series of experiments to compare our proposed approaches with the previously proposed approaches. The experimental results show that our proposed approaches outperform the previously proposed approaches. We also develop a prototype video database management system that supports the methods presented in this dissertation.	en
dc.description.provenance	Made available in DSpace on 2021-06-13T04:14:18Z (GMT). No. of bitstreams: 1 ntu-95-D86725001-1.pdf: 3223250 bytes, checksum: 11d7fc4be2ca5f2c4b4d8d9a36af575a (MD5) Previous issue date: 2006	en
dc.description.tableofcontents	Table of Contents i List of Figures iii List of Tables vii Chapter 1 Introduction 1 1.1 Motivation 4 1.2 Contributions 6 1.3 Dissertation layout 8 Chapter 2 Background and Literature Survey 10 2.1 Spatial knowledge structures for images 10 2.1.1 2D string 10 2.1.2 2D G-string 11 2.1.3 2D C-string 11 2.1.4 2D C+-string 13 2.1.5 Unique-ID-based matrix 13 2.1.6 Bit-pattern-based matrix 14 2.1.7 2D Z-string 14 2.2 Spatio-temporal knowledge structures for videos 15 2.2.1 2D C-Tree 15 2.2.2 9DLT string 15 2.2.3 3D string 16 2.3 Systems 17 2.3.1 OVID 17 2.3.2 QBIC 17 2.3.3 VideoQ 18 2.3.4 AVIS 18 2.3.5 VideoSTAR 19 2.4 Discussion 19 Chapter 3 3D C-string 21 3.1 The 3D C-string representation of a symbolic video 21 3.2 3DC string generation algorithm 30 3.2.1 3DC spatial string generation algorithm 30 3.2.2 3DC temporal string generation algorithm 41 3.3 3DC video reconstruction algorithm 50 3.3.1 3DC spatial string reconstruction algorithm 50 3.3.2 3DC temporal string reconstruction algorithm 56 Chapter 4 3DC Similarity Retrieval Algorithm 64 4.1 3DC spatial relation inference algorithm 64 4.2 3DC similarity retrieval algorithm 71 Chapter 5 3D Z-string 81 5.1 The 3D Z-string representation of a symbolic video 81 5.2 3DZ string generation algorithm 84 5.3 3DZ video reconstruction algorithm 92 Chapter 6 3DZ Similarity Retrieval Algorithm 99 6.1 3DZ spatio-temporal relation inference 99 6.2 Similarity retrieval 104 6.3 Relevance feedback 113 6.4 Discussion 115 Chapter 7 Performance Analysis 116 7.1 Synthesized videos 116 7.1.1 Generation of synthetic videos 116 7.1.2 3DC string generation and video reconstruction algorithms 119 7.1.3 3DZ string generation and video reconstruction algorithms 119 7.1.4 3DC similarity retrieval algorithm 123 7.1.5 3DZ similarity retrieval algorithm 127 7.2 Real videos 134 7.2.1 3DC string generation and video reconstruction algorithms 134 7.2.2 3DZ string generation and video reconstruction algorithms 136 7.2.3 3DC similarity retrieval algorithm 139 7.2.4 3DZ similarity retrieval algorithm 145 Chapter 8 Prototype System 154 8.1 Video indexing tool 155 8.2 Video query tool 155 Chapter 9 Conclusions and Future Work 159 References 163
dc.language.iso	en
dc.subject	相似度查詢	zh_TW
dc.subject	視訊資料庫	zh_TW
dc.subject	空間與時間關係推導	zh_TW
dc.subject	3D C-string	zh_TW
dc.subject	3D Z-string	zh_TW
dc.subject	3D C-string	en
dc.subject	Video databases	en
dc.subject	Similarity retrieval	en
dc.subject	Spatio-temporal inference	en
dc.subject	3D Z-string	en
dc.title	視訊資料庫之知識結構與相似度查詢	zh_TW
dc.title	Knowledge Structure and Similarity Retrieval in Video Databases	en
dc.type	Thesis
dc.date.schoolyear	94-2
dc.description.degree	博士
dc.contributor.oralexamcommittee	陳彥良(Yen-Liang Chen),劉敦仁(Duen-Ren Liu),沈錳坤(Man-Kwan Shan),莊裕澤(Yuh-Jzer Joung)
dc.subject.keyword	視訊資料庫,空間與時間關係推導,3D C-string,3D Z-string,相似度查詢,	zh_TW
dc.subject.keyword	Video databases,Spatio-temporal inference,3D C-string,3D Z-string,Similarity retrieval,	en
dc.relation.page	185
dc.rights.note	有償授權
dc.date.accepted	2006-07-25
dc.contributor.author-college	管理學院	zh_TW
dc.contributor.author-dept	資訊管理學研究所	zh_TW
顯示於系所單位：	資訊管理學系

文件中的檔案：

檔案	大小	格式
ntu-95-1.pdf 未授權公開取用	3.15 MB	Adobe PDF

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。