Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 共同教育中心
  3. 統計碩士學位學程
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/101758
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor陳彥賓zh_TW
dc.contributor.advisorYan-Bin Chenen
dc.contributor.author郭軒丞zh_TW
dc.contributor.authorShiuan-Cheng Kuoen
dc.date.accessioned2026-03-04T16:20:08Z-
dc.date.available2026-03-28-
dc.date.copyright2026-03-04-
dc.date.issued2026-
dc.date.submitted2026-02-11-
dc.identifier.citation[1] J. Albert-Smet, A. Torrente, and J. Romo. Band depth based initialization of K-means for functional data clustering. Advances in Data Analysis and Classification, 17:463–484, 2023.
[2] J. D. Banfield and A. E. Raftery. Model-based Gaussian and non-Gaussian clustering. Biometrics, 49:803–821, 1993.
[3] G. Brys, M. Hubert, and P. J. Rousseeuw. A robustification of independent component analysis. Journal of Chemometrics, 19:364–375, 2005.
[4] G. Brys, M. Hubert, and A. Struyf. A robust measure of skewness. Journal of Computational and Graphical Statistics, 13(4):996–1017, 2004.
[5] G. Celeux and G. Govaert. Gaussian parsimonious clustering models. Pattern Recognition, 28(5):781–793, 1995.
[6] S. Chakraborty and S. Das. Detecting meaningful clusters from highdimensional data: A strongly consistent sparse center-based clustering approach. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(6):2894–2908, 2020.
[7] P. Chaudhuri. On a geometric notion of quantiles for multivariate data. Journal of the American Statistical Association, 91(434):862–872, 1996.
[8] D. L. Donoho. Breakdown properties of multivariate location estimators. Ph.d. qualifying paper, Harvard University, Boston, MA, 1982.
[9] E. Elhamifar and R. Vidal. Sparse subspace clustering: Algorithm, theory, and applications. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(11):2765–2781, 2013.
[10] M. Ester, H.-P. Kriegel, J. Sander, and X. Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining (KDD-96), volume 96, pages 226–231, 1996.
[11] J. Guan, S. Li, J. Zhu, X. He, and J. Chen. Fast main density peak clustering within relevant regions via a robust decision graph. Pattern Recognition, 152:110458, 2024.
[12] L. Hubert and P. Arabie. Comparing partitions. Journal of Classification, 2:193–218, 1985.
[13] M. Hubert, P. J. Rousseeuw, and P. Segaert. Multivariate functional outlier detection. Statistical Methods & Applications, 24(2):177–202, 2015.
[14] M. Hubert, P. J. Rousseeuw, and P. Segaert. Multivariate and functional classification using depth and distance. Advances in Data Analysis and Classification, 11:445–466, 2017.
[15] M. Hubert, P. J. Rousseeuw, and K. Vanden Branden. ROBPCA: A new approach to robust principal component analysis. Technometrics, 47(1):64–79, 2005.
[16] M. Hubert and S. Van der Veeken. Outlier detection for skewed data. Journal of Chemometrics, 22(3-4):235–246, 2008.
[17] M.-H. Jeong, Y. Cai, C. J. Sullivan, and S. Wang. Data depth based clustering analysis. In Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pages 1–10, Burlingame, California, 2016. ACM.
[18] R. Jörnsten. Clustering and classification based on the L1 data depth. Journal of Multivariate Analysis, 90(1):67–89, 2004.
[19] J. MacQueen. Some methods for classification and analysis of multivariate observations. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 1:281–297, 1967.
[20] P. C. Mahalanobis. On the generalized distance in statistics. Proceedings of the National Institute of Sciences of India, 2(1):49–55, 1936.
[21] P. D. McNicholas. Mixture model-based classification. Chapman and Hall/ CRC, Boca Raton, 2016.
[22] D. Paindaveine and G. Van Bever. From depth to local depth: A focus on centrality. Journal of the American Statistical Association, 108(503):1105–1119, 2013.
[23] K. Pearson. LIII. On lines and planes of closest fit to systems of points in space. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, 2(11):559–572, 1901.
[24] P. J. Rousseeuw. Least median of squares regression. Journal of the American Statistical Association, 79(388):871–880, 1984.
[25] P. J. Rousseeuw, I. Ruts, and J. W. Tukey. The bagplot: A bivariate boxplot. The American Statistician, 53(4):382–387, 1999.
[26] P. J. Rousseeuw and K. Van Driessen. A fast algorithm for the minimum covariance determinant estimator. Technometrics, 41(3):212–223, 1999.
[27] A. Torrente and J. Romo. Initializing k-means clustering by bootstrap and data depth. Journal of Classification, 38(2):232–256, 2021.
[28] J. W. Tukey. Mathematics and the picturing of data. In Proceedings of the International Congress of Mathematicians, volume 2, pages 523–531, Vancouver, 1974. Canadian Mathematical Congress.
[29] S. Wang, A. Leblanc, and P. D. McNicholas. Depth-Based Local Center Clustering: A framework for handling different clustering scenarios. arXiv preprint arXiv:2505.09516v1, May 2025. Version 1.
[30] S. Wang, A. Leblanc, and P. D. McNicholas. Depth-Based Local Center Clustering: A framework for handling different clustering scenarios. arXiv preprint arXiv:2505.09516v2, Jan. 2026. Version 2.
[31] Y. Zuo. Projection-based depth functions and associated medians. The Annals of Statistics, 31(5):1460–1490, 2003.
[32] Y. Zuo and R. Serfling. General notions of statistical depth function. The Annals of Statistics, 28(2):461–482, 2000.
-
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/101758-
dc.description.abstract在群集分析中,對具備非凸幾何、嚴重密度異質性與非對稱結構的資料進行無監督劃分,始終是一項根本性的挑戰。本研究擴展深度局部中心分群(DLCC)演算法,以處理其在資料分派上的限制。為緩解由嚴重密度異質性引發的結構失衡,本方法基於嚴格的鄰域重疊準則保留稀疏中心,並限制過渡群集的初始擴張,從而防止稀疏觀測值遭到系統性誤派。針對未分派的觀測值,本程序相對於結構核心,評估連續的深度適應性袋形距離。此幾何適應性距離能有效捕捉局部非等向性特徵與非對稱結構。經數值實驗證實,本框架能維持非凸幾何中的結構連通性,並防止在嚴重密度異質性與結構非對稱下發生系統性分類錯誤。zh_TW
dc.description.abstractThe unsupervised partitioning of data characterized by non-convex geometries, severe density heterogeneity, and asymmetric structures remains a fundamental challenge in cluster analysis. This study extends the Depth-Based Local Center Clustering (DLCC) algorithm to address its allocation limitations. To mitigate the structural imbalance induced by severe density heterogeneity, the methodology retains sparse centers based on a strict neighborhood overlap criterion and constrains the initial expansion of interim clusters, thereby preventing the systematic misallocation of sparse observations. For unallocated observations, the procedure evaluates continuous depth-adapted bagdistance relative to the structural cores. This geometry-adaptive distance captures local anisotropic features and asymmetric structures. Numerical experiments demonstrate that the framework maintains structural connectivity in non-convex geometries and prevents systematic misclassifications under severe density heterogeneity and structural asymmetry.en
dc.description.provenanceSubmitted by admin ntu (admin@lib.ntu.edu.tw) on 2026-03-04T16:20:08Z
No. of bitstreams: 0
en
dc.description.provenanceMade available in DSpace on 2026-03-04T16:20:08Z (GMT). No. of bitstreams: 0en
dc.description.tableofcontents口試委員會審定書i
摘要ii
Abstract iii
Contents iv
List of Figures vi
List of Tables vii
Chapter 1 Introduction 1
1.1 Background and Motivation . . . . . . . . . . . . . . . . . . . . . 1
1.2 Objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.3 Contributions and Organization . . . . . . . . . . . . . . . . . . . 3
Chapter 2 Preliminaries 5
2.1 Statistical Depth Functions . . . . . . . . . . . . . . . . . . . . . 5
2.1.1 Mahalanobis Depth . . . . . . . . . . . . . . . . . . . . . . . . . 6
2.1.2 Spatial Depth . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.1.3 Skew-Adjusted Projection Depth . . . . . . . . . . . . . . . . . . 7
2.2 Bagdistance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.3 DLCC Framework . . . . . . . . . . . . . . . . . . . . . . . . . . 11
Chapter 3 Methodology 13
3.1 Framework Overview . . . . . . . . . . . . . . . . . . . . . . . . . 13
3.2 Phase I: Local Structure Identification . . . . . . . . . . . . . . . 14
3.2.1 Similarity Matrix and Local Centers . . . . . . . . . . . . . . . . 16
3.2.2 Frequency-Based Selection and Retention of Sparse Centers . . . 19
3.3 Phase II: Center Grouping and Initialization . . . . . . . . . . . . 21
3.3.1 Construction of Structural Cores . . . . . . . . . . . . . . . . . 21
3.3.2 Adjusting for Allocation Bias under Density Heterogeneity . . . 22
3.4 Phase III: Depth-Adapted Bagdistance Assignment . . . . . . . . 24
3.4.1 Definition of Depth-Adapted Bagdistance . . . . . . . . . . . . . 26
3.4.2 Depth Function Selection and Local Subspace Projection . . . . 27
3.5 Phase IV: Global Refinement . . . . . . . . . . . . . . . . . . . . 29
3.5.1 Minimizing Depth-Adapted Bagdistance . . . . . . . . . . . . . 29
3.5.2 Fixing Local Reference Regions . . . . . . . . . . . . . . . . . . 30
Chapter 4 Simulation Study 31
4.1 Simulation Settings and Scenarios . . . . . . . . . . . . . . . . . . 31
4.2 Competitors and Implementation Details . . . . . . . . . . . . . . 33
4.3 Performance Metrics . . . . . . . . . . . . . . . . . . . . . . . . . 35
4.4 Simulation Results and Discussion . . . . . . . . . . . . . . . . . 36
4.4.1 Discussion of Algorithmic Properties . . . . . . . . . . . . . . . 41
Chapter 5 Conclusion 43
References 46
-
dc.language.isoen-
dc.subject分群-
dc.subject深度函數-
dc.subject袋形距離-
dc.subject密度異質性-
dc.subjectClustering-
dc.subjectDepth Functions-
dc.subjectBagdistance-
dc.subjectDensity Heterogeneity-
dc.title應用一般化袋形距離於深度局部中心分群之框架研究zh_TW
dc.titleA Depth-Based Local Center Clustering Framework with Generalized Bagdistanceen
dc.typeThesis-
dc.date.schoolyear114-1-
dc.description.degree碩士-
dc.contributor.oralexamcommittee黃世豪;張馨文;藍俊宏zh_TW
dc.contributor.oralexamcommitteeShih-Hao Huang;Hsin-Wen Chang;Jakey Blueen
dc.subject.keyword分群,深度函數袋形距離密度異質性zh_TW
dc.subject.keywordClustering,Depth FunctionsBagdistanceDensity Heterogeneityen
dc.relation.page49-
dc.identifier.doi10.6342/NTU202600688-
dc.rights.note未授權-
dc.date.accepted2026-02-11-
dc.contributor.author-college共同教育中心-
dc.contributor.author-dept統計碩士學位學程-
dc.date.embargo-liftN/A-
顯示於系所單位:統計碩士學位學程

文件中的檔案:
檔案 大小格式 
ntu-114-1.pdf
  未授權公開取用
4.36 MBAdobe PDF
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved