利用語音進行照片中人物影像的自動化標註及檢索

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/52671

標題:	利用語音進行照片中人物影像的自動化標註及檢索 Automatic Facial Image Annotation and Retrieval by Integrating Voice Label and Visual Appearance
作者:	Hong-Wun Jheng 鄭宏文
指導教授:	徐宏民
關鍵字:	照片標註,語音檢索, Photo Annotation,Speech Retrieval,
出版年 :	2015
學位:	碩士
摘要:	Annotation is important for managing and retrieving a large amount of photos, but it is generally labor-intensive and time-consuming. However, speaking while taking photos is straightforward and effortless, and using voice for annotation is faster than typing words. To best reduce the manual cost of annotating photos, we propose a novel framework which utilizes the scarce spoken annotations recorded while capturing as voice labels and automatically label every facial image in the photo collection. To accomplish this goal, we employ a probabilistic graphical model which integrates voice labels and visual appearances for inference. Combined with group prior estimation and gender attribute association, we can achieve an outstanding performance on the proposed synthesized group photo collections.
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/52671
全文授權:	有償授權
顯示於系所單位：	資訊工程學系

文件中的檔案：

檔案	大小	格式
ntu-104-1.pdf 未授權公開取用	1.57 MB	Adobe PDF

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

DSpace