請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/5281
完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.advisor | 于天立(Tian-Li Yu) | |
dc.contributor.author | Jie-Wei Wu | en |
dc.contributor.author | 吳潔薇 | zh_TW |
dc.date.accessioned | 2021-05-15T17:54:59Z | - |
dc.date.available | 2014-07-29 | |
dc.date.available | 2021-05-15T17:54:59Z | - |
dc.date.copyright | 2014-07-29 | |
dc.date.issued | 2014 | |
dc.date.submitted | 2014-07-21 | |
dc.identifier.citation | [1] C. Basu, H. Hirsh, and W. Cohen. Recommendation as classification: Using social and content-based information in recommendation. In In Proceedings of the Fifteenth National Conference on Artificial Intelligence, pages 714–720. AAAI Press, 1998.
[2] J. S. Breese, D. Heckerman, and C. Kadie. Empirical analysis of predictive al- gorithms for collaborative filtering. In Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, UAI’98, pages 43–52, San Francisco, CA, USA, 1998. Morgan Kaufmann Publishers Inc. [3] M. Claypool, A. Gokhale, T. Miranda, P. Murnikov, D. Netes, and M. Sartin. Combining content-based and collaborative filters in an online newspaper. In Proceedings of ACM SIGIR workshop on recommender systems, volume 60. Citeseer, 1999. [4] M. Connor and J. Herlocker. Clustering items for collaborative filtering, 2001. [5] J. Davidson, B. Liebald, J. Liu, P. Nandy, T. Van Vleet, U. Gargi, S. Gupta, Y. He, M. Lambert, B. Livingston, and D. Sampath. The youtube video rec- ommendation system. In Proceedings of the Fourth ACM Conference on Rec- ommender Systems, RecSys ’10, pages 293–296, New York, NY, USA, 2010. ACM. [6] J. Davis and M. Goadrich. The relationship between precision-recall and roc curves. In Proceedings of the 23rd international conference on Machine learning, pages 233–240. ACM, 2006. [7] S. C. Deerwester, S. T. Dumais, T. K. Landauer, G. W. Furnas, and R. A. Harshman. Indexing by latent semantic analysis. JASIS, 41(6):391–407, 1990. [8] J. A. Hartigan. Clustering. Annual review of biophysics and bioengineering, 2(1):81–102, 1973. 44 [9] J. A. Hartigan and M. A. Wong. Algorithm as 136: A k-means clustering algorithm. Applied statistics, pages 100–108, 1979. [10] J. L. Herlocker, J. A. Konstan, A. Borchers, and J. Riedl. An algorithmic framework for performing collaborative filtering. In Proceedings of the 22Nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’99, pages 230–237, New York, NY, USA, 1999. ACM. [11] J. L. Herlocker, J. A. Konstan, A. Borchers, and J. Riedl. An algorithmic framework for performing collaborative filtering. In Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, pages 230–237. ACM, 1999. [12] F. Jelinek. Statistical methods for speech recognition. 1997. [13] J. A. Konstan, B. N. Miller, D. Maltz, J. L. Herlocker, L. R. Gordon, and J. Riedl. Grouplens: applying collaborative filtering to usenet news. Commu- nications of the ACM, 40(3):77–87, 1997. [14] G. Linden, J. Jacobi, and E. Benson. Collaborative recommendations using item-to-item similarity mappings, July 24 2001. US Patent 6,266,649. [15] G. Linden, B. Smith, and J. York. Amazon.com recommendations: item-to- item collaborative filtering. Internet Computing, IEEE, 7(1):76–80, Jan 2003. [16] H. T. Ng, W. B. Goh, and K. L. Low. Feature selection, perceptron learning, and a usability case study for text categorization. SIGIR Forum, 31(SI):67–73, July 1997. [17] G. Salton, A. Wong, and C. S. Yang. A vector space model for automatic indexing. Commun. ACM, 18(11):613–620, Nov. 1975. [18] B. Sarwar, G. Karypis, J. Konstan, and J. Riedl. Analysis of recommendation algorithms for e-commerce. In Proceedings of the 2Nd ACM Conference on Electronic Commerce, EC ’00, pages 158–167, New York, NY, USA, 2000. ACM. [19] B. Sarwar, G. Karypis, J. Konstan, and J. Riedl. Item-based collaborative filtering recommendation algorithms. In Proceedings of the 10th International Conference on World Wide Web, WWW ’01, pages 285–295, New York, NY, USA, 2001. ACM. 45 [20] F. Sebastiani. Machine learning in automated text categorization. ACM Com- put. Surv., 34(1):1–47, Mar. 2002. [21] G. Shani and A. Gunawardana. Evaluating recommendation systems. In Rec- ommender systems handbook, pages 257–297. Springer, 2011. [22] A. Singhal. Modern information retrieval: A brief overview. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 24(4):35– 43, 2001. [23] X. Su and T. M. Khoshgoftaar. A survey of collaborative filtering techniques. Advances in artificial intelligence, 2009:4, 2009. [24] X. Su and T. M. Khoshgoftaar. A survey of collaborative filtering techniques. Adv. in Artif. Intell., 2009:4:2–4:2, Jan. 2009. [25] L. Terveen, W. Hill, B. Amento, D. McDonald, and J. Creter. Phoaks: A system for sharing recommendations. Communications of the ACM, 40(3):59–62, 1997. [26] H. Tseng. A conditional random field word segmenter. In In Fourth SIGHAN Workshop on Chinese Language Processing, 2005. [27] L. Ungar, D. Foster, E. Andre, S. Wars, F. S. Wars, D. S. Wars, and J. H. Whispers. Clustering methods for collaborative filtering. AAAI Press, 1998. [28] R. Van Meteren and M. Van Someren. Using content-based filtering for rec- ommendation. In Proceedings of the Machine Learning in the New Information Age: MLnet/ECML2000 Workshop, 2000. [29] C. Wang and D. M. Blei. Collaborative topic modeling for recommending scientific articles. In Proceedings of the 17th ACM SIGKDD International Con- ference on Knowledge Discovery and Data Mining, KDD ’11, pages 448–456, New York, NY, USA, 2011. ACM. | |
dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/5281 | - |
dc.description.abstract | 此篇研究提出了一種推薦系統設計,其中利用了以使用者為基礎的協同過濾、以項目為基礎的協同過濾和基於內容過濾的優點。
不同的是,上述兩種以使用者或項目為基礎的協同過濾,其在項目或使用者 空間中是高維度的。 而基於內容過濾的方式,雖然可以處理協同過濾的冷啟動問題,但對於發覺使用者可能的潛在項目卻較為無效。 此篇提出的推薦系統採用了「基於使用者和基於項目的概念空間」,維度的大小,或概念空間的數量,在有必要時才增加,另外該系統能利用產生另一種在文章上的向量維度,以處理冷啟動問題。 更多的是,推薦能隨著時間更改演化,在快速增加的訊息下,避免重複的過程是必要的。 概念空間的為度是依照項目的特徵來設立,另外「概念」 是項目概念空間分群的結果,也就是文章分群的結果。 而使用者空間概念是項目概念空間根據使用者的行為調整後的。 譬如說使用者看了某兩個項目,我們便假設這兩個項目之間有關係。 這兩個概念空間互相演化,接著系統利用演化最後的「概念」來作推薦。 這樣的系統在實驗上實作在文章推薦上面。 在我們的例子中,項目是文章,項目特徵是文章的詞,而使用者行為就是讀者閱讀文章的紀錄。 在實驗中,以使用者為基礎的協同過濾和以項目為基礎的協同過濾的為度分別約是三萬和三千,這是根據實驗中的使用者和文章數量決定的。而我們提出的系統,由於使用了「概念」來作,以概念的個數視為維度,其大小從五開始,每個迭代都可能增加,最後在第十二次的迭代收斂,此時維度為八十七。 除此之外,所提出的系統能動態調整代表文章的向量維度, 文章的維度是用於將文章分群時的依據,文章會依照向量維度來計算相似度, 最後向量的長度是四十四, 新的文章則可以根據這個維度來加入分群以及被推薦。 精確度-召回率虛線顯示,我們所提出的推薦系統,使用者真正點擊推薦文章的比例,相較於以使用者為基礎的協同過濾、以項目為基礎的協同過濾和基於內容過濾,有更多的點擊比例。 另外在平均精確度也可,系統的平均隨著迭代次數增長並超過其他推薦系統。 我們希望概念空間的這個想法可以擴展到有可被提取的特徵與用戶之間交互關係的項目。 | zh_TW |
dc.description.abstract | This thesis proposes a recommendation system (RS) which incorporates the advantages of the user/item-based collaborative filtering (CF) and the content-based filtering.
Unlike the user/item-based CF where the user/item spaces are of high dimension, the proposed RS utilizes the user-based and item-based concept spaces where dimension, or the number of concepts, is increased only necessary. In addition, the proposed system can deal with the cold start problem with producing another kind dimension of items. With modifying clustering results, it can be used to create recommendation in the rapid increasing information. The dimension of the item-based concepts is defined by the features of the items, and concepts are the clustering result of the item-based concept space. The user-based concepts are the result of clustering adjustment from the item-based concepts with the information of users' behaviors, such as whether or not a user is interested in both items in a concept. The user-base and item-based concepts co-evolve iteratively in the above manner. At the end, the proposed RS utilizes the learned concepts combined with the reading dependence to perform recommendation. The proposed techniques are demonstrated on the article recommendation. In this case, the features of an item correspond to the segmented contents of an article, and users' behaviors correspond to users' reading preferences. In the experiment, the item-based/user-based CF dimension is about $30,000$ and $3,000$ while the concept space in proposed RS articles starts from $5$ and ended up merely $87$ after $12$ iterations. The proposed RS dynamically adjust the dimension of articles. The dimensions of articles is $44$ in the end and used for clustering articles. New articles then can be clustered and recommended as well. The precision-recall curves indicates that the proposed RS achieves more hits than user-based/item-based CF and content-based filtering. The average precision-recall curves and mean average precision of proposed system grows and exceeds others. This idea of two concept spaces can be extended to the situation with items with extractable features as dimension and the interaction between items and users. | en |
dc.description.provenance | Made available in DSpace on 2021-05-15T17:54:59Z (GMT). No. of bitstreams: 1 ntu-103-R01921056-1.pdf: 836094 bytes, checksum: 5543e444d4c086875cbe6e076ea219ae (MD5) Previous issue date: 2014 | en |
dc.description.tableofcontents | 致謝 i
中文摘要 ii Abstract iv 1 Introduction (p1) 2 Background (p6) 2.1 Collaborative Filtering Based Recommendation Systems . . . . . . . (p6) 2.1.1 Overview of CFProcess ..................... (p7) 2.1.2 Memory-based CF ........................ (p7) 2.1.3 Model-based CF ......................... (p11) 2.1.4 Challenges of CF ......................... (p12) 2.2 Content-basedFiltering.......................... (p13) 2.3 Hybrid Approaches Combined with CF and Content-based Filtering . (p15) 3 System Design (p18) 3.1 Symbols.................................. (p18) 3.2 Framework................................. (p21) 3.3 Concepts.................................. (p23) 3.3.1 Item-based Concepts ....................... (p24) 3.3.2 User-based Concepts ....................... (p24) 3.4 Stages of Operation............................ (p24) 3.4.1 Keyword Generation Stage.................... (p25) 3.4.2 Concept Reinforcement Stage .................. (p26) 3.5 Recommendation ............................. (p28) 3.5.1 Generating Reading Dependence ................ (p29) 3.5.2 Ranking of Articles........................ (p29) 4 Experiments and Results (p32) 4.1 DataSet and Configuration ....................... (p32) 4.2 Evaluation Metrics ............................ (p33) 4.3 Evaluation Process ............................ (p36) 4.3.1 Testing Data ........................... (p36) 4.3.2 Evaluation of Proposed System ................. (p36) 4.3.3 Evaluation of CF ......................... (p36) 4.3.4 Evaluation of Content-based Filtering. . . . . . . . . . . . . . (p37) 4.4 Results and Discussion .......................... (p37) 5 Conclusion (p42) Bibliography (p44) | |
dc.language.iso | en | |
dc.title | 基於使用者偏好可調概念空間的推薦系統設計 | zh_TW |
dc.title | User Preference Based Recommendation System Design with Adaptive Concept Space | en |
dc.type | Thesis | |
dc.date.schoolyear | 102-2 | |
dc.description.degree | 碩士 | |
dc.contributor.oralexamcommittee | 鄭卜壬(Pu-Jen Cheng),張時中(Shi-Chung Chang),陳穎平(Ying-ping Chen) | |
dc.subject.keyword | 推薦系統,協同式過濾,內容式資訊過濾,使用者閱讀行為, | zh_TW |
dc.subject.keyword | Recommendation system,collaborative filtering,content-based filtering,users’ reading behaviors, | en |
dc.relation.page | 46 | |
dc.rights.note | 同意授權(全球公開) | |
dc.date.accepted | 2014-07-21 | |
dc.contributor.author-college | 電機資訊學院 | zh_TW |
dc.contributor.author-dept | 電機工程學研究所 | zh_TW |
顯示於系所單位: | 電機工程學系 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-103-1.pdf | 816.5 kB | Adobe PDF | 檢視/開啟 |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。