請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/26326
完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.advisor | 陳達仁(Dar-Zen Chen) | |
dc.contributor.author | Yee-Ting Cho | en |
dc.contributor.author | 邱宜霆 | zh_TW |
dc.date.accessioned | 2021-06-08T07:06:26Z | - |
dc.date.copyright | 2008-09-02 | |
dc.date.issued | 2008 | |
dc.date.submitted | 2008-08-29 | |
dc.identifier.citation | [1] S. Farbrizio, 'Machine Learning in Automated Text Categorization,' ACM Computing Surveys, vol. 34(1), pp. 1-47, 2002.
[2] C. Wei, Hu, P., and Dong, Y. X., 'Managing Document Categories in E-Commerce Environments: An Evolution-Based Approach,' European Journal of Information Systems, vol. 11(3), pp. 222-255, September 2002. [3] M. Krier and F. Zacca, 'Automatic Categorization Applications at the European Patent Office,' World Patent Information, vol. 24, pp. 187-196, 2002. [4] Y. Yang, J. G. Carbonell, R. Brown, T. Pierce, B. T. Archibald, and X. Liu, 'Learning Approaches for Detecting and Tracking News Events,' IEEE Intelligent System,Vol.14,No.3,pp, vol. 14(3), pp. 32-43, 1999. [5] R. Sproat, C. Shih, W. Gale, and N. Chang, 'A Stochastic Finite-State Word-Segmentation Algorithm for Chinese,' Computational Linguistics, vol. 22(3), pp. 376-404, 1996. [6] Z. Wu and G. Tseng, 'ACTS: An Automatic Chinese Text Segmentation System for Full Text Retrieval,' Journal of American Society for Information Science, vol. 46(2), pp. 93-96, 1995. [7] J. P. Courtial, M. Callon, and A. Sigogneau, 'The Use of Patent Titles for Identifying the Topics of Invention and Forecasting Trends,' Scientometrics, vol. 26(2), pp. 231-242, 1993. [8] P. Jaccard, 'Nouvelles recherches Sur la distribution florale,' Bull. Soc. Vaud. Sci. Natl., vol. 44, pp. 223-270, 1908. [9] W. Yan and E. Xun, 'English Word Similarity Calculation Based on WordNet,' SWCL, 2004. [10] A.K. Jain, M.N. Murty, and P.J. Flynn, 'Data Clustering: A Review,' ACM Computing Surveys, vol. 31(3), pp. 264-323, 1999. [11] J. Han and M. Kamber, 'Data Mining: Concepts and Techniques,' San Francisco : Morgan Kaufmann Publishers, 2001. [12] B. S. Everitt, S. Landau, and M. Leese, 'Cluster Analysis,' London : Arnold ; New York : Oxford University Press vol. 4th ed., 2001. [13] A. D.Gordon, 'Classification,' Boca Raton: Chapman and Hall/CRC, vol. 2nd ed., 1999. [14] J. MacQueen, 'Some Methods for Classification and Analysis of Multivariate Observations,' In Proc. 5th Berkeley Symp. Math. Stat. and Prob., vol. 1, pp. 281-297, 1967. [15] G. Milligan, 'An Examination of the Effect of Six Types of Error Peturbation on Fifteen Clustering Algorithms,' Psychometrika, vol. 45, pp. 325-342, 1980. [16] J.F. Hair, R.E. Anderson, R.L. Tatham, and W.C. Black, 'Multivariate Data Analysis, 5th ed.,' Prentice Hall International, London, 1998. [17] J.F. Hair, R.E. Anderson, B. Black, B. Babin, R.E. Anderson, R. L. Tatham, 'Multivariate Data Analysis, 6th ed.,' Prentice Hall International, London, 2005. [18] M. S. Aldenderfer and R. K. Blashfield, 'Cluster analysis,' Beverly Hills : Sage Publications, 1984. [19] Web Resource: USPTO, http://www.uspto.gov , 2008. [20] J. L. Fagan, 'The Effectiveness of a Nonsyntactic Approach to Automatic Phrase Indexing for Document Retrieval,' Journal of American Society for Information Science, vol. 40(2), pp. 115-132, 1989. [21] H. Paijmans, 'Comparing the Document Representation of Two IR Systems: CLARIT and TOPIC,' Journal of American Society for Information Science, vol. 44(7), pp. 383-392, 1993. [22] R. Caruana and D. Freitag, 'Greedy Attribute Selection,' Proceedings of the Eleventh International Conference on Machine Learning, New Brunswick, New Jersey, pp. 28-36, 1994. [23] R. Burgin and M. Dillon, 'Improving Disambiguation in FASIT,' Journal of American Society for Information Science, vol. 43(2), pp. 101-114, 1992. | |
dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/26326 | - |
dc.description.abstract | 專利文件中之每一專利分類號,均代表著一個明確的技術領域分類代碼,本研究取用美國專利文件作為研究對象,並以美國專利分類號作為每篇專利文件之代表特徵,並利用該特徵值,將技術相似性高的專利叢集起來,以建立專利技術叢集。而每篇美國專利文件均具有一到多個美國專利分類號,因此每篇專利都可能具有多面向的技術觀點,可以一特徵值序列表示之。
美國專利分類號體系是由多個階層式分類架構所組成,每個架構均代表一主要技術領域的集合體,專利技術叢集的法則,即在於以各專利文件所屬的分類號特徵值序列,來計算專利文件間之技術相似度,相似度的算法可單純考量分類號特徵在相同分類號階層架構裡的概念距離遠近而得,或是再納入另一考量因子,即隸屬相異分類號架構底下的分類號間的共篇程度來決定;至於技術叢集的方法,採用兩階段叢集法則,第一階段利用階層式叢集法,獲得適當的叢集數,第二階段再利用非階層式叢集法,獲得每一叢集的所屬專利技術內容。 最後,以實際的案例分析,來探討不同的專利相似度計算考量因子,對於不同的專利技術特徵組成性質,是否會造成差異性的叢集結果。 | zh_TW |
dc.description.abstract | Each classification code in the patent document represents a definite technical domain, this thesis took U.S. patent as research source and selected U.S. patent classification (USPC) code as the feature to represent a patent. According to those features, the technology cluster could be formed by the high similar patents. Each U.S. patent may have one or multiple technical viewpoints due to its amounts of USPC code, it can be represented by a feature list. U.S. classification system is composed by many USPC schedules, each USPC schedule represent a set of technical domain.
The objective technology cluster was formed by the feature list similarities of patents, and the similarities could be measured by two factors, one was only considering the conceptual distance between two USPC codes under the same USPC schedule, another was considering the pair coupled rate of two USPC codes under different USPC schedules in addition. This thesis used two stage clustering algorithm to get the technology cluster, the first stage used hierarchical clustering algorithm to determine the number of clusters, the second stage used non-hierarchical clustering algorithm to get the members of each cluster. Finally, in cases study, discussed whether different patent similarity measures, could result to different technology clusters or not. | en |
dc.description.provenance | Made available in DSpace on 2021-06-08T07:06:26Z (GMT). No. of bitstreams: 1 ntu-97-R95546028-1.pdf: 846231 bytes, checksum: 1ee1af58b086c6808c74b8759cdce63b (MD5) Previous issue date: 2008 | en |
dc.description.tableofcontents | Chapter 1 Introduction 1
1.1 Background 1 1.2 Related works and Literature Review 1 1.3 Characteristic of Patent Document 7 1.4 Motivation 9 1.5 Research Objective 10 Chapter 2 Patent Characteristics By USPC 11 2.1 Feature Representation for a Patent 11 2.2 Nomenclature of USPC Schedule 11 2.3 Hierarchical Relationship of USPC Schedule 12 2.4 The Feature Composition Index for Patents 14 Chapter 3 Association between USPC under the Same Main Class16 3.1 Research Procedure 16 3.2 USPC Distance under the Same Classification Schedule 17 3.3 Distance between Classifications under the Same Main Class 17 3.4 Patent Similarity 23 Chapter 4 Association between USPC under Different Main Classes 25 4.1 Research Procedure 25 4.2 Pair Coupled Rate between USPC under Different Main classes 26 4.3 USPC Distance from the Pair Coupled Rate 26 Chapter 5 Patent Clustering 28 5.1 Clustering Method 28 5.3 The Measure of Patent Clustering Result 30 Chapter 6 Case Study: OLED 31 6.1 Clustering Analysis without Similarity between Different USPC Schedules 35 6.2 Clustering Analysis with Similarity between Different USPC Schedules 38 6.3 Discussion the Difference of Result between Two Analyses 40 Chapter 7 Case Study: Dexterous Hand of Robot 42 7.1 Clustering Analysis without Similarity between Different USPC Schedules 45 7.2 Clustering Analysis with Similarity between Different USPC Schedules 47 7.3 Discussion the Difference of Result between Two Analyses 50 Chapter 8 Conclusions 52 References 54 | |
dc.language.iso | en | |
dc.title | 應用專利所屬分類號及分類階層架構之技術叢集研究 | zh_TW |
dc.title | Patent Technology Clustering Based on Their Classification and the Classification Schedule | en |
dc.type | Thesis | |
dc.date.schoolyear | 96-2 | |
dc.description.degree | 碩士 | |
dc.contributor.oralexamcommittee | 黃慕萱(Mu-Hsuan Huang),林正平(Chang-Pin Lin),謝文賓(Win-Bin Hsieh) | |
dc.subject.keyword | 專利分群,專利分類號,美國專利分類架構,專利相似度,技術叢集, | zh_TW |
dc.subject.keyword | patent cluster,patent classification,USPC schedule,patent similarity,technology clustering, | en |
dc.relation.page | 56 | |
dc.rights.note | 未授權 | |
dc.date.accepted | 2008-08-29 | |
dc.contributor.author-college | 工學院 | zh_TW |
dc.contributor.author-dept | 工業工程學研究所 | zh_TW |
顯示於系所單位: | 工業工程學研究所 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-97-1.pdf 目前未授權公開取用 | 826.4 kB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。