Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 資訊工程學系
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/9236
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor徐宏民(Winston H. Hsu)
dc.contributor.authorYu-Ming Hsuen
dc.contributor.author許裕明zh_TW
dc.date.accessioned2021-05-20T20:14:03Z-
dc.date.available2013-08-18
dc.date.available2021-05-20T20:14:03Z-
dc.date.copyright2011-08-18
dc.date.issued2011
dc.date.submitted2011-08-13
dc.identifier.citation[1] B. Erol, E. Antunez and J. J. Hull, “HOTPAPER: Multimedia Interaction with Paper using Mobile Phones”, ACM Conference, 2008.
[2] C. Liao and Q. Liu, “PACER: Toward A Cameraphone-based Paper Interface for Fine-grained and Flexible Interaction with Documents”, ACM MM, 2009.
[3] X. Xie, G. Miao, R. Song, Ji-Rong Wen and Wei-Ying Ma, “Efficient Browsing of Web Search Results on Mobile Devices Based on Block Importance Model,” Proc. Pervasive Computing and Communications, IEEE, 2005.
[4] G. Hattori, K. Hoashi, K. Matsumoto, F. Sugaya, “Robust Web Page Segmentation for Mobile Terminal Using Content-Distances and Page Layout Information”, ACM WWW, 2007.
[5] O. Okun, D. Doermann and M. Pietikainen, “Page Segmentation and Zone Classification: The State of the Art,” in UMD, 1999.
[6] Chih-Chung Chang and Chih-Jen Lin, “LIBSVM: a library for support vector machines”, 2001. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
[7] A. Bosch, A. Zisserman and X. Munoz, “Representing shape with a spatial pyramid kernel”, CIVR, 2007.
[8] GEDI: Groundtruthing Editor http://gedigroundtruth.sourceforge.net/
[9] A. Antonacopoulos, B. Gatos and D. Bridson, “ICDAR2005 Page Segmentation Competition”, ICDAR, 2005.
[10] S. Uchihashi, J. Foote, A. Girgensohn, and J. Boreczky. Video Manga: generating semantically meaningful video summaries. In Proc. ACM Multimedia (MM), 1999. DOI= http://dx.doi.org/10.1145/319463.319654
[11] C. Rother, L. Bordeaux, Y. Hamadi, and A. Blake. AutoCollage. In Proc. ACM SIGGRAPH 2006. DOI= http://dx.doi.org/10.1145/1179352.1141965
[12] M. H. Lee, N. Singhal, S. Cho, and In Kyu Park. Mobile Photo Collage. In Computer Vision and Pattern Recognition Workshops (CVPRW), 2010. DOI= http://dx.doi.org/10.1109/CVPRW.2010.5543752
[13] S. Goferman, A. Tal, and L. Zelnik-Manor. Puzzle-like Collage. In EUROGRAPHICS, 2010.
[14] J. Harel, C. Koch, and P. Perona. Graph-Based Visual Saliency. In Proc. Neural Information Processing Systems (NIPS), 2006.
[15] S.C. Lee and S. Zhai. The Performance of Touch Screen Soft Buttons. In Proc. ACM Conference on Human Factors in Computing Systems (CHI), pages 309–318, 2009. DOI= http://dx.doi.org/10.1145/1518701.1518750
[16] A. Bosch, A. Zisserman, and X. Munoz. Representing shape with a spatial pyramid kernel. In Proc. ACM international conference on image and video retrieval (CIVR), 2007. DOI= http://dx.doi.org/10.1145/1282280.1282340
[17] Nielsen Company. State of the Media - Mobile Usage Trends: Q3 and Q4 2010. http://blog.nielsen.com/nielsenwire/online_mobile/number-of-americans-watching-mobile-video-grows-more-than-40-in-last-year/
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/9236-
dc.description.abstract手持裝置服務的興起(例如:電子書、影片串流)以及手持裝置的盛行率不但顯露出在手機上閱讀�觀看的需求,也顯示了在手機服務開發上被看好的商業機會。然而,在不同的服務上都有其所面臨的挑戰。以在手機上閱讀電子書來說,不相容的閱讀器、不一致的電子書格式,甚至是有限的螢幕大小,都會導致在手持裝置上閱讀文件的不便。同時,要在手機上閱讀那些沒有數位副本的紙本雜誌是很困難的。因此,我們提出一個系統「Snap2Read」可以自動地切割手機上拍下的文件圖片(從紙本雜誌)並將它們轉成可閱讀的片段(patches)(像是文字、標題、圖片等等)然後將他們縮放、裁切成適合的大小以便讓使用者可以透過手機上的點擊就能夠很簡單地瀏覽數位化的雜誌頁面。另一個在手機上熱門的活動則是觀看影片,但是極小的手機螢幕尺寸、有限的頻寬,以及零碎的使用時間仍然阻礙了使用者的體驗:它們要不就是中斷了使用者的觀看過程,抑或是讓使用者無法一次瀏覽多樣的內容。而傳統的影片摘要技術並不能應用在有限的螢幕上,因此我們提出了「Comp2Watch」這個系統,發音近似於「come to watch」。這個名字也有著「將影片畫面組合成美術拼貼」以及「壓縮觀看時間」的意義。它考慮了感興趣的區域(ROI)因素讓使用者能夠快速地瞥過影片,並且我們也修改了價值函數(cost function)用來整合不同長寬比的樣板,我們也處理了因為有限空間而導致的單調排版(monotone layout)問題。實驗結果顯示使用者可以在沒有遺失太多週邊資訊的情況下獲得更清楚的畫面主體。zh_TW
dc.description.abstractThe rise of mobile services (e.g., electronic book, video streaming) and the prevalence of mobile devices reveal the needs for mobile reading/watching and the booming business opportunities in mobile service developments. However, there are different challenges among those services. For reading books on mobile devices, incompatible e-book readers, non-uniformed e-book formats, or even limited screen size causes the inconvenience of reading documents on handheld devices. Meanwhile, it is difficult to read physical magazines that do not have the corresponding digital copies. Therefore, we propose a system, Snap2Read, that can automatically segment the captured document images (i.e., from the physical magazines) in mobile phones into readable patches (e.g. text, title, image), and then scale them into suitable size so that users can easily browse the digitalized magazine pages via the mobile phone with simple clicks. Another popular activity on mobile is watching videos, but the small mobile screen size, low bandwidth, and fragmented watching time also hinder the user experiences: they either interrupt the watching process or limit users to browse many contents at the same time. Traditional video summarization techniques are suffering the small screen issue. Therefore, we propose a system, Comp2Watch, which is pronounced like “come to watch”. It implies the meaning of “composing the frames into a collage” and “compressing the watching time”. It puts ROI factors into consideration in order to help users take a quick glance at videos. Also, we modify the cost function to incorporate the templates with variable aspect ratios. We also address the monotone layout problem caused by the limited space. The experimental results show that users can obtain clearer subject without losing many contexts.en
dc.description.provenanceMade available in DSpace on 2021-05-20T20:14:03Z (GMT). No. of bitstreams: 1
ntu-100-R98922030-1.pdf: 8218422 bytes, checksum: 2ca4f9ea614499327f35b789127bf3cd (MD5)
Previous issue date: 2011
en
dc.description.tableofcontents摘要 i
Abstract ii
Chapter 1 Introduction 1
1.1 Snap2Read 1
1.2 Comp2Watch 5
Chapter 2 Mobile Magazine Reading Enhancement – Snap2Read 11
2.1 Page Segmentation 11
2.2 Zone Classification 15
2.3 Mobile Adaptation 17
2.4 Experimental Results 18
Chapter 3 Mobile Video Watching Enhancement – Comp2Watch 27
3.1 Related Work 27
3.2 Keyframe Selection 29
3.3 ROI Packing 32
3.4 Experiments 36
Chapter 4 Conclusions and Future Work 45
Bibliography 47
dc.language.isozh-TW
dc.titleSnap2Read/Comp2Watch: 增進手持裝置平台上的多媒體瀏覽體驗zh_TW
dc.titleSnap2Read/Comp2Watch: Enhancing the Multimedia Browsing Experience on Mobile Devicesen
dc.typeThesis
dc.date.schoolyear99-2
dc.description.degree碩士
dc.contributor.oralexamcommittee黃俊翔(Chun-Hsiang Huang),林軒田(Hsuan-Tien Lin)
dc.subject.keyword手持裝置,雜誌,影片,多媒體內容分析,改寫,zh_TW
dc.subject.keywordmobile device,magazine,video,multimedia content analysis,adaptation,en
dc.relation.page48
dc.rights.note同意授權(全球公開)
dc.date.accepted2011-08-14
dc.contributor.author-college電機資訊學院zh_TW
dc.contributor.author-dept資訊工程學研究所zh_TW
顯示於系所單位:資訊工程學系

文件中的檔案:
檔案 大小格式 
ntu-100-1.pdf8.03 MBAdobe PDF檢視/開啟
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved