Snap2Read/Comp2Watch: 增進手持裝置平台上的多媒體瀏覽體驗

Yu-Ming Hsu; 許裕明

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/9236

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	徐宏民(Winston H. Hsu)
dc.contributor.author	Yu-Ming Hsu	en
dc.contributor.author	許裕明	zh_TW
dc.date.accessioned	2021-05-20T20:14:03Z	-
dc.date.available	2013-08-18
dc.date.available	2021-05-20T20:14:03Z	-
dc.date.copyright	2011-08-18
dc.date.issued	2011
dc.date.submitted	2011-08-13
dc.identifier.citation	[1] B. Erol, E. Antunez and J. J. Hull, “HOTPAPER: Multimedia Interaction with Paper using Mobile Phones”, ACM Conference, 2008. [2] C. Liao and Q. Liu, “PACER: Toward A Cameraphone-based Paper Interface for Fine-grained and Flexible Interaction with Documents”, ACM MM, 2009. [3] X. Xie, G. Miao, R. Song, Ji-Rong Wen and Wei-Ying Ma, “Efficient Browsing of Web Search Results on Mobile Devices Based on Block Importance Model,” Proc. Pervasive Computing and Communications, IEEE, 2005. [4] G. Hattori, K. Hoashi, K. Matsumoto, F. Sugaya, “Robust Web Page Segmentation for Mobile Terminal Using Content-Distances and Page Layout Information”, ACM WWW, 2007. [5] O. Okun, D. Doermann and M. Pietikainen, “Page Segmentation and Zone Classification: The State of the Art,” in UMD, 1999. [6] Chih-Chung Chang and Chih-Jen Lin, “LIBSVM: a library for support vector machines”, 2001. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm [7] A. Bosch, A. Zisserman and X. Munoz, “Representing shape with a spatial pyramid kernel”, CIVR, 2007. [8] GEDI: Groundtruthing Editor http://gedigroundtruth.sourceforge.net/ [9] A. Antonacopoulos, B. Gatos and D. Bridson, “ICDAR2005 Page Segmentation Competition”, ICDAR, 2005. [10] S. Uchihashi, J. Foote, A. Girgensohn, and J. Boreczky. Video Manga: generating semantically meaningful video summaries. In Proc. ACM Multimedia (MM), 1999. DOI= http://dx.doi.org/10.1145/319463.319654 [11] C. Rother, L. Bordeaux, Y. Hamadi, and A. Blake. AutoCollage. In Proc. ACM SIGGRAPH 2006. DOI= http://dx.doi.org/10.1145/1179352.1141965 [12] M. H. Lee, N. Singhal, S. Cho, and In Kyu Park. Mobile Photo Collage. In Computer Vision and Pattern Recognition Workshops (CVPRW), 2010. DOI= http://dx.doi.org/10.1109/CVPRW.2010.5543752 [13] S. Goferman, A. Tal, and L. Zelnik-Manor. Puzzle-like Collage. In EUROGRAPHICS, 2010. [14] J. Harel, C. Koch, and P. Perona. Graph-Based Visual Saliency. In Proc. Neural Information Processing Systems (NIPS), 2006. [15] S.C. Lee and S. Zhai. The Performance of Touch Screen Soft Buttons. In Proc. ACM Conference on Human Factors in Computing Systems (CHI), pages 309–318, 2009. DOI= http://dx.doi.org/10.1145/1518701.1518750 [16] A. Bosch, A. Zisserman, and X. Munoz. Representing shape with a spatial pyramid kernel. In Proc. ACM international conference on image and video retrieval (CIVR), 2007. DOI= http://dx.doi.org/10.1145/1282280.1282340 [17] Nielsen Company. State of the Media - Mobile Usage Trends: Q3 and Q4 2010. http://blog.nielsen.com/nielsenwire/online_mobile/number-of-americans-watching-mobile-video-grows-more-than-40-in-last-year/
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/9236	-
dc.description.abstract	手持裝置服務的興起（例如：電子書、影片串流）以及手持裝置的盛行率不但顯露出在手機上閱讀�觀看的需求，也顯示了在手機服務開發上被看好的商業機會。然而，在不同的服務上都有其所面臨的挑戰。以在手機上閱讀電子書來說，不相容的閱讀器、不一致的電子書格式，甚至是有限的螢幕大小，都會導致在手持裝置上閱讀文件的不便。同時，要在手機上閱讀那些沒有數位副本的紙本雜誌是很困難的。因此，我們提出一個系統「Snap2Read」可以自動地切割手機上拍下的文件圖片（從紙本雜誌）並將它們轉成可閱讀的片段（patches）（像是文字、標題、圖片等等）然後將他們縮放、裁切成適合的大小以便讓使用者可以透過手機上的點擊就能夠很簡單地瀏覽數位化的雜誌頁面。另一個在手機上熱門的活動則是觀看影片，但是極小的手機螢幕尺寸、有限的頻寬，以及零碎的使用時間仍然阻礙了使用者的體驗：它們要不就是中斷了使用者的觀看過程，抑或是讓使用者無法一次瀏覽多樣的內容。而傳統的影片摘要技術並不能應用在有限的螢幕上，因此我們提出了「Comp2Watch」這個系統，發音近似於「come to watch」。這個名字也有著「將影片畫面組合成美術拼貼」以及「壓縮觀看時間」的意義。它考慮了感興趣的區域（ROI）因素讓使用者能夠快速地瞥過影片，並且我們也修改了價值函數（cost function）用來整合不同長寬比的樣板，我們也處理了因為有限空間而導致的單調排版（monotone layout）問題。實驗結果顯示使用者可以在沒有遺失太多週邊資訊的情況下獲得更清楚的畫面主體。	zh_TW
dc.description.abstract	The rise of mobile services (e.g., electronic book, video streaming) and the prevalence of mobile devices reveal the needs for mobile reading/watching and the booming business opportunities in mobile service developments. However, there are different challenges among those services. For reading books on mobile devices, incompatible e-book readers, non-uniformed e-book formats, or even limited screen size causes the inconvenience of reading documents on handheld devices. Meanwhile, it is difficult to read physical magazines that do not have the corresponding digital copies. Therefore, we propose a system, Snap2Read, that can automatically segment the captured document images (i.e., from the physical magazines) in mobile phones into readable patches (e.g. text, title, image), and then scale them into suitable size so that users can easily browse the digitalized magazine pages via the mobile phone with simple clicks. Another popular activity on mobile is watching videos, but the small mobile screen size, low bandwidth, and fragmented watching time also hinder the user experiences: they either interrupt the watching process or limit users to browse many contents at the same time. Traditional video summarization techniques are suffering the small screen issue. Therefore, we propose a system, Comp2Watch, which is pronounced like “come to watch”. It implies the meaning of “composing the frames into a collage” and “compressing the watching time”. It puts ROI factors into consideration in order to help users take a quick glance at videos. Also, we modify the cost function to incorporate the templates with variable aspect ratios. We also address the monotone layout problem caused by the limited space. The experimental results show that users can obtain clearer subject without losing many contexts.	en
dc.description.provenance	Made available in DSpace on 2021-05-20T20:14:03Z (GMT). No. of bitstreams: 1 ntu-100-R98922030-1.pdf: 8218422 bytes, checksum: 2ca4f9ea614499327f35b789127bf3cd (MD5) Previous issue date: 2011	en
dc.description.tableofcontents	摘要 i Abstract ii Chapter 1 Introduction 1 1.1 Snap2Read 1 1.2 Comp2Watch 5 Chapter 2 Mobile Magazine Reading Enhancement – Snap2Read 11 2.1 Page Segmentation 11 2.2 Zone Classification 15 2.3 Mobile Adaptation 17 2.4 Experimental Results 18 Chapter 3 Mobile Video Watching Enhancement – Comp2Watch 27 3.1 Related Work 27 3.2 Keyframe Selection 29 3.3 ROI Packing 32 3.4 Experiments 36 Chapter 4 Conclusions and Future Work 45 Bibliography 47
dc.language.iso	zh-TW
dc.title	Snap2Read/Comp2Watch: 增進手持裝置平台上的多媒體瀏覽體驗	zh_TW
dc.title	Snap2Read/Comp2Watch: Enhancing the Multimedia Browsing Experience on Mobile Devices	en
dc.type	Thesis
dc.date.schoolyear	99-2
dc.description.degree	碩士
dc.contributor.oralexamcommittee	黃俊翔(Chun-Hsiang Huang),林軒田(Hsuan-Tien Lin)
dc.subject.keyword	手持裝置,雜誌,影片,多媒體內容分析,改寫,	zh_TW
dc.subject.keyword	mobile device,magazine,video,multimedia content analysis,adaptation,	en
dc.relation.page	48
dc.rights.note	同意授權(全球公開)
dc.date.accepted	2011-08-14
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	資訊工程學研究所	zh_TW
顯示於系所單位：	資訊工程學系

文件中的檔案：

檔案	大小	格式
ntu-100-1.pdf	8.03 MB	Adobe PDF	檢視/開啟

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。