請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/37656
完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.advisor | 李秀惠(Hsiu-Hui Lee) | |
dc.contributor.author | Yu-Shan Lin | en |
dc.contributor.author | 林于聖 | zh_TW |
dc.date.accessioned | 2021-06-13T15:37:10Z | - |
dc.date.available | 2008-07-11 | |
dc.date.copyright | 2008-07-11 | |
dc.date.issued | 2008 | |
dc.date.submitted | 2008-07-10 | |
dc.identifier.citation | [GKNG 2003] Suhit Gupta, Gail Kaiser, David Neistadt and Peter Grimm. “DOM-based Content Extraction of HTML Documents,” International WWW Conference, 2003.
[HMMS 1999] Masahiro Hori, Rakesh Mohan, Hiroshi Maruyama and Sandeep Singhal. “Annotation of Web Content for Transcoding,” W3C Note, July, 1999. http://www.w3.org/TR/annot/ [MBR 2007] J. Mahmud, Y. Borodin, and I. V. Ramakrishnan. “CSurf: A context-driven non-visual web browser,” International WWW Conference, 2007. [NCG] National Taiwan Univ. CSIE search engine powered by Google, URL= http://www.csie.ntu.edu.tw/ [TAFM 2002] H. Takagi, C. Asakawa, K. Fukuda, and J. Maeda. “Site-wide annotation: Reconstructing existing pages to be accessible,” ASSETS, 2002. [TEXTDIFF] URL= http://www.angusj.com/delphi/textdiff.html [WA] Wretch Album, URL= http://www.wretch.cc/album/ [WBI] Web Intermediaries, IBM Research, URL= http://www.almaden.ibm.com/cs/wbi/ [XMLUNIT] URL= http://xmlunit.sourceforge.net/ [XPATH] XML Path Language (XPath) 2.0, URL= http://www.w3.org/TR/xpath20/ [YAHOOBID] Yahoo Bid, URL= http://tw.bid.yahoo.com/ [YDICT] Yahoo Dictionary, URL= http://tw.dictionary.yahoo.com/ [YZ 2001] Y. Yang and H. Zhang. “HTML page analysis based on visual cues,” on Document Analysis and Recognition (ICDAR), 2001. | |
dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/37656 | - |
dc.description.abstract | 隨著資訊時代的洪流,網際網路已經廣為眾人所接受,而網頁服務更是成為現代人生活中不可缺少的工具。各大公司企業紛紛提供相關網頁服務,為人們帶來便利,人們只須要坐在電腦前連上網路就可以查尋天氣、查尋公車捷運路線、查尋字典、查尋電影時刻表、金融轉帳等等。越來越多的網頁服務供人們選擇;但也因網頁繁多,每次使用一項服務就需要連結到一個新的網頁,並且連帶了很多不必要的資訊、廣告,使得使用者在使用服務時需要使用多餘的時間和精力來找出自己要的資訊,完成他的目的。
因此,許多業者如Yahoo、Google等也紛紛提供一些「小工具」(以下簡稱Gadget),如Yahoo Widget、Google Gadget等等,目的是要將人們所需要的服務萃取出其精華,然後以小視窗的方式呈現出來,幫助使用者節省時間、螢幕空間。但雖如此,每一個Gadget還是需要由程式設計師撰寫出來;以網頁服務來說,程式設計師需要先看過網頁原始碼,再由其中的超連結等資訊重新編寫出Gadget,而此步驟也是製作Gadget之過程中最耗費時間和人力的,況且對於一般沒有相關資訊背景的使用者來說,是個很高的門檻。 為了達成自動轉換小工具的目的,我們在Web Content Analysis和Web Page Partitioning這兩個領域中參考了一些相關研究,而本篇論文提出一些方法以及系統,此系統可透過Proxy、Web Server的方式,利用程式設計師在撰寫網頁和程式時的Control、View、Model分離的特性,將一般網頁自動轉換成Gadget給一般使用者使用,不需耗費太多人力,也不需要高深的網頁程式設計相關技術,即使一般沒有相關資訊背景的使用者也可以輕鬆操作此系統來產生出Gadget。 | zh_TW |
dc.description.abstract | Gadgets are more and more popular, especially the web gadgets, users could simply open the gadget platforms on internet. However, to make a gadget sometimes cost time on transforming a general website to a gadget. In this paper, we provide some mechanisms to help the transformation automatically, and simplify the steps on making new gadgets.
Before transforming a website, we need to analysis the HTML documents of it; there are already some forerunners’ researches on “Web Content and Contextual Analysis”, we can consult these researches when we do analyzing. The architectures must be also considered, “Web Page Partitioning” is also a popular area. There are some researches on proxy server for mobile devices and web transcoding mechanism. We consider the relevant contents and page themes can be separated. When users send different inputs to a search engine, different result pages would be generated. By comparing these result pages, we can extract the relevant contents that users need, and make new gadgets with these contents. Our system is based on proxy architecture, and we experiment on some real search engines. Fortunately, the results are excellent and satisfying. | en |
dc.description.provenance | Made available in DSpace on 2021-06-13T15:37:10Z (GMT). No. of bitstreams: 1 ntu-97-R95922082-1.pdf: 1870157 bytes, checksum: fac1617dc8213156d58b1ef7cc7abe11 (MD5) Previous issue date: 2008 | en |
dc.description.tableofcontents | 中文摘要 1
Abstract 2 Chapter 1 Introduction 3 Chapter 2 Related Works 5 Chapter 3 System Guide 8 3.1 Make a New Widget Page 8 3.2 Generate the XML documents for widget platforms 10 Chapter 4 System Architecture 13 4.1 System Overview 13 4.2 Mode Dispatcher 14 Chapter 5 Extraction Mechanism 18 5.1 Mechanism Overview 18 5.2 XML Processing utilities 20 5.3 HTML Document Partitioning 22 5.4 Partitioning Algorithm 23 5.5 Advertisement Filters 30 5.5.1 Overview 30 5.5.2 Basic Definition of Action and Element 30 5.5.3 Three Types of Advertisement 31 Chapter 6 Experiments 34 6.1 Experimented Web Sites 34 6.2 Experiment Method 34 6.2.1 Statistics of differences 34 6.2.2 Verification 36 Chapter 7 Conclusions and Future Works 40 7.1 Conclusions 40 7.2 Future Works 40 References 41 | |
dc.language.iso | en | |
dc.title | 自動化之網頁轉換小工具系統 | zh_TW |
dc.title | Automated Web to Gadget Transcoding System | en |
dc.type | Thesis | |
dc.date.schoolyear | 96-2 | |
dc.description.degree | 碩士 | |
dc.contributor.oralexamcommittee | 陳俊良(Chuen-Liang Chen),廖純中(Churn-Jung Liau) | |
dc.subject.keyword | 小工具,網頁自動轉換,Gadget自動轉換,小視窗,Widget自動轉換, | zh_TW |
dc.subject.keyword | gadget,widget,web-based gadget transcoding,automated transcoding system, | en |
dc.relation.page | 42 | |
dc.rights.note | 有償授權 | |
dc.date.accepted | 2008-07-10 | |
dc.contributor.author-college | 電機資訊學院 | zh_TW |
dc.contributor.author-dept | 資訊工程學研究所 | zh_TW |
顯示於系所單位: | 資訊工程學系 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-97-1.pdf 目前未授權公開取用 | 1.83 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。