Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 公共衛生學院
  3. 健康數據拓析統計研究所
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/102156
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor王彥雯zh_TW
dc.contributor.advisorCharlotte Wangen
dc.contributor.author蘇心俞zh_TW
dc.contributor.authorSin-Yu Suen
dc.date.accessioned2026-03-13T16:52:32Z-
dc.date.available2026-03-14-
dc.date.copyright2026-03-13-
dc.date.issued2025-
dc.date.submitted2026-01-15-
dc.identifier.citationAadland, E., Kvalheim, O. M., Anderssen, S. A., Resaland, G. K., and Andersen, L. B. (2019). Multicollinear physical activity accelerometry data and associations tocardiometabolic health: Challenges, pitfalls, and potential solutions. International Journal of Behavioral Nutrition and Physical Activity, 16(1), 74.
Adametz, D., and Roth, V. (2011). Bayesian Partitioning of Large-Scale Distance Data. Advances in Neural Information Processing Systems, 24.
Adams, J., Berman, B., Michalenko, J., and Tucker, J. D. (2025). Conformal Anomaly Detection for Functional Data with Elastic Distance Metrics. arXiv preprint, arXiv: 2504.01172.
Agbaje, A. O. (2023). Associations of accelerometer-based sedentary time, light physical activity and moderate-to-vigorous physical activity with resting cardiac structure and function in adolescents according to sex, fat mass, lean mass, BMI, and hypertensive status. Scandinavian Journal of Medicine & Science in Sports, 33(8), 1399–1411.
Ahmadi, M. N., Pavey, T. G., and Trost, S. G. (2020). Machine learning models for classifying physical activity in free-living preschool children. Sensors, 20(16), 4364.
Bai, J., Di, C., Xiao, L., Evenson, K. R., LaCroix, A. Z., Crainiceanu, C. M., and Buchner, D. M. (2016). An activity index for raw accelerometry data and its comparison with other activity metrics. PLOS ONE, 11(8), e0160644.
Bai, J., Goldsmith, J., Caffo, B., Glass, T. A., and Crainiceanu, C. M. (2012). Movelets: A dictionary of movement. Electronic Journal of Statistics, 6, 559–578.
Bertsekas, D. P. (2012). Dynamic Programming and Optimal Control: Volume I (Vol. 4). Athena Scientific.
Cornet, V. P., and Holden, R. J. (2018). Systematic review of smartphone-based passive sensing for health and wellbeing. Journal of Biomedical Informatics, 77, 120–132.
da Silva, I. C., van Hees, V. T., Ramires, V. V., Knuth, A. G., Bielemann, R. M., Ekelund, U., Brage, S., and Hallal, P. C. (2014). Physical activity levels in three brazilian birth cohorts as assessed with raw triaxial wrist accelerometry. International Journal of Epidemiology, 43(6), 1959–1968.
Doherty, A., Jackson, D., Hammerla, N., Plötz, T., Olivier, P., Granat, M. H., White, T., Hees, V. T. v., Trenell, M. I., Owen, C. G., Preece, S. J., Gillions, R., Sheard, S., Peakman, T., Brage, S., and Wareham, N. J. (2017). Large Scale Population Assessment of Physical Activity Using Wrist Worn Accelerometers: The UK Biobank Study. PLOS ONE, 12(2), 1–14.
Ferraty, F., and Vieu, P. (2006). Nonparametric Functional Data Analysis: Theory and Practice. New York: Springer.
Fraley, C., and Raftery, A. E. (2002). Model-Based Clustering, Discriminant Analysis, and Density Estimation. Journal of the American Statistical Association, 97(458), 611–631.
Garcia-Ceja, E., Riegler, M., Jakobsen, P., Tørresen, J., Nordgreen, T., Oedegaard, K. J., and Fasmer, O. B. (2018). Depresjon: A motor activity database of depression episodes in unipolar and bipolar patients. Proceedings of the 9th ACM Multimedia Systems Conference, 472–477.
Hastie, T., Tibshirani, R., and Friedman, J. (2009). The Elements of Statistical Learning. Springer.
Jacques, J., and Preda, C. (2014). Functional data clustering: A survey. Advances in Data Analysis and Classification, 8(3), 231–255.
Jadhav, S., Tekwe, C. D., and Luan, Y. (2022). A function-based approach to model the measurement error in wearable devices. Statistics in Medicine, 41(24), 4886–4902.
Jain, A. K. (2010). Data clustering: 50 years beyond K-means. Pattern Recognition Letters, 31(8), 651–666.
Jakobsen, P., Garcia-Ceja, E., Stabell, L. A., Oedegaard, K. J., Berle, J. O., Thambawita, V., Hicks, S. A., Halvorsen, P., Fasmer, O. B., and Riegler, M. A. (2020). PSYKOSE: A Motor Activity Database of Patients with Schizophrenia. 2020 IEEE 33rd International Symposium on Computer-Based Medical Systems (CBMS), 303–308.
James, G. M., and Sugar, C. A. (2003). Clustering for Sparsely Sampled Functional Data. Journal of the American Statistical Association, 98(462), 397–408.
Jin, X., Chen, Y., Feng, H., Zhou, M., Chan, J. W. Y., Liu, Y., Kong, A. P. S., Tan, X., Wing, Y.-K., Liang, Y. Y., and Zhang, J. (2024). Association of accelerometermeasured sleep duration and different intensities of physical activity with incident type 2 diabetes in a population-based cohort study. Journal of Sport and Health Science, 13(2), 222–232.
Jo, A., Coronel, B. D., Coakes, C. E., and Mainous, A. G. (2019). Is There a Benefit to Patients Using Wearable Devices Such as Fitbit or Health Apps on Mobiles? a Systematic Review. The American Journal of Medicine, 132(12), 1394–1400.e1.
Lee, K., and Kwan, M.-P. (2018). Physical activity classification in free-living conditions using smartphone accelerometer data and exploration of predicted results. Computers, Environment and Urban Systems, 67, 124–131.
Li, L., and Nakamura, T. (2019). An epidemiological sleep study based on a large-scale physical activity database. 2019 IEEE 1st Global Conference on Life Sciences and Technologies (LifeTech), 292–293.
Liang, Y.-T., and Wang, C. (2025). Motif clustering and digital biomarker extraction for free-living physical activity analysis. BioData Mining, 18(1), 8.
Liang, Y.-T., Wang, C., and Hsiao, C. K. (2024). Data analytics in physical activity studies with accelerometers: Scoping review. Journal of Medical Internet Research, 26(1), e59497.
Lu, L., Zhang, J., Xie, Y., Gao, F., Xu, S., Wu, X., and Ye, Z. (2020). Wearable health devices in health care: Narrative systematic review. JMIR mHealth and uHealth, 8(11), e18907.
Marron, J. S., Ramsay, J. O., Sangalli, L. M., and Srivastava, A. (2015). Functional data analysis of amplitude and phase variation. Statistical Science, 30(4), 468–484.
Migueles, J. H., Cadenas-Sanchez, C., Ekelund, U., Delisle Nyström, C., Mora-Gonzalez, J., Löf, M., Labayen, I., Ruiz, J. R., and Ortega, F. B. (2017). Accelerometer Data Collection and Processing Criteria to Assess Physical Activity and Other Outcomes: A Systematic Review and Practical Considerations. Sports Medicine (Auckland, N.Z.), 47(9), 1821–1845.
Müller, M. (2007). Dynamic time warping. In Information retrieval for music and motion (pp. 69–84). Springer Berlin Heidelberg.
National Center for Health Statistics. (2025). National health and nutrition examination survey [Accessed: 2025-07-18]. https://wwwn.cdc.gov/nchs/nhanes/
Pitman, J. (2006). Combinatorial Stochastic Processes (J. Picard, Ed.; Vol. 1875). Springer-Verlag.
Ramsay, J., Hooker, G., and Graves, S. (2009). Functional data analysis with r and MATLAB. Springer.
Ramsay, J., and Silverman, B. (2005). Functional Data Analysis. Springer.
Srivastava, A., and Klassen, E. P. (2016). Functional and Shape Data Analysis. Springer.
Srivastava, A., Wu, W., Kurtek, S., Klassen, E., and Marron, J. S. (2011). Registration of Functional Data Using Fisher-Rao Metric.
Srivastava, M. S. (2003). Singular wishart and multivariate beta distributions. The Annals of Statistics, 31(5), 1537–1560.
Staudenmayer, J., Pober, D., Crouter, S., Bassett, D., and Freedson, P. (2009). An artificial neural network to estimate physical activity energy expenditure and identify physical activity type from an accelerometer. Journal of Applied Physiology, 107(4), 1300–1307.
Stewart, T., Narayanan, A., Hedayatrad, L., Neville, J., Mackay, L., and Duncan, S. (2018). A dual-accelerometer system for classifying physical activity in children and adults. Medicine & Science in Sports & Exercise, 50(12), 2595.
Teh, Y. W., Jordan, M. I., Beal, M. J., and Blei, D. M. (2006). Hierarchical Dirichlet Processes. Journal of the American Statistical Association, 101(476), 1566–1581.
Troiano, R. P., McClain, J. J., Brychta, R. J., and Chen, K. Y. (2014). Evolution of accelerometer methods for physical activity research. British Journal of Sports Medicine, 48(13), 1019–1023.
Tucker, J. D. (2014). Functional component analysis and regression using elastic methods (order no. 3638089) [Doctoral dissertation, The Florida State University]. ProQuest Dissertations; Theses 3638089.
Tucker, J. D., and Stamm, A. (2025). Fdasrvf: Elastic Functional Data Analysis.
Tucker, J. D., Wu, W., and Srivastava, A. (2013). Generative models for functional data using phase and amplitude separation. Computational Statistics & Data Analysis, 61, 50–66.
van Hees, V. T., Sabia, S., Jones, S. E., Wood, A. R., Anderson, K. N., Kivimäki, M., Frayling, T. M., Pack, A. I., Bucan, M., Trenell, M. I., Mazzotti, D. R., Gehrman, P. R., Singh-Manoux, B. A., and Weedon, M. N. (2018). Estimating sleep parameters using an accelerometer without sleep diary. Scientific Reports, 8(1), 12975.
Vogt, J. E., Prabhakaran, S., Fuchs, T. J., and Roth, V. (2010). The translation-invariant wishart-dirichlet process for clustering distance data. Proceedings of the 27th International Conference on International Conference on Machine Learning, 1111–1118.
Wang, J.-L., Chiou, J.-M., and Müller, H.-G. (2016). Functional data analysis. Annual Review of Statistics and Its Application, 3, 257–295.
Xu, D., and Tian, Y. (2015). A Comprehensive Survey of Clustering Algorithms. Annals of Data Science, 2(2), 165–193.
Zhang, Z., Pati, D., and Srivastava, A. (2015). Bayesian clustering of shapes of curves. Journal of Statistical Planning and Inference, 166, 171–186.
Zhuang, K.-H. (2021, August 23). Analysis of variance for functional data via random projection, with application to wearable device data [Master's thesis]. National Central University.
-
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/102156-
dc.description.abstract透過自由生活身體活動的分析讓研究者有機會探索身體活動與疾病或健康事件間的關係,然而透過穿戴式裝置所蒐集的的活動資料並無活動類型或標籤的相關訊息。應用非監督分群分析可以讓我們有機會探索可能的活動類型或標籤。因此,基於曲線形狀的非監督分群分析在這類型的科學應用中至關重要。然而,現有函數資料分群方法常側重於振幅(amplitude)差異,卻忽略了對身體活動(physical activity)資料分析影響顯著的相位(phase)變化。識別活動曲線之間的相似性需要同時考量相位與振幅的差異變化。儘管分群分析在發現活動模式方面具有潛力,但相關研究仍然有限。為彌補這些不足,本研究提出一種貝氏分群方法,從穿戴式裝置採集的自由生活身體活動資料識別出不同的活動模式(motif)。
我們將24小時活動曲線分割成固定時間段。接著,應用彈性函數資料分析,透過彈性距離(elastic distance)矩陣量化曲線之間的差異性,並將相位與振幅距離的權重總和定義為曲線間的差異距離,再分別使用Von Mises distribution和Gamma distribution建模。應用Dirichlet process貝氏無母數分群方法(Bayesian nonparametric cluster method)架構,得到分群結果與分群數量的自動推斷。最後識別出的群集,可以為進一步分析身體活動資料時,用於定義新數位生物標記的選擇。
我們透過實際資料的應用,驗證所提出方法的表現。各群集皆展現出不同的特徵,其中部分群集較依賴振幅距離作為群集特徵,而另一些則較依賴相位距離。本研究提出的貝氏函數資料分群方法,透過後驗分析確定最佳群集數量,不需要預先指定群集數量。此方法將透過應用實際資料來進行驗證。此架構為函數資料分析提供了一個穩健的data-driven分群方法,有助於關聯性研究,並透過揭示有意義的活動模式來加強健康事件研究。
zh_TW
dc.description.abstractAnalyzing free-living physical activity (PA) offers researchers a valuable opportunity to explore the relationship between PA and various health outcomes. However, a significant challenge arises from the free-living PA data collected by wearable devices: it often lacks information regarding activity type or labels. To address this, applying unsupervised cluster analysis becomes crucial, allowing researchers to identify potential activity types or labels within the unlabeled PA data. Hence, unsupervised cluster analysis on curve shapes is a significant problem for this scientific application. However, many functional clustering methods focus on amplitude differences, neglecting the considerable impact of phase variations, particularly in physical activity data. Analyzing the similarity between two activity curves necessitates considering phase and amplitude variations for meaningful insights. Moreover, research discussing or proposing methods for studying PA data through cluster analysis to identify activity patterns remains limited. Hence, this study aims to develop a novel Bayesian motif-based clustering method to uncover distinct activity patterns (motifs) within free-living PA data collected from wearable devices.
Initially, we segment the 24-hour activity curve into fixed-time intervals, using the fundamental time unit of an activity as the basis for segmentation. Subsequently, elastic shape analysis is employed for these activity segments, and elastic distance is utilized to quantify curve dissimilarity. This curve dissimilarity is decomposed into phase and amplitude distance components, which are modeled using the Von Mises and Gamma distributions, respectively. A Bayesian nonparametric clustering framework with a Dirichlet process was proposed. We derive cluster results and the posterior distribution, thereby inferring the posterior distribution of the number of clusters. Finally, these identified activity clusters could be used to define new digital biomarkers as PA features for further analysis.
This study developed a novel Bayesian functional clustering methodology for activity pattern discovery, leveraging elastic functional data analysis to compare activity curves and eliminating the necessity for pre-specifying the number of clusters. By employing a flexible prior on the space of data partitions and analyzing the resulting posterior distribution, our approach will effectively determine the optimal clustering configuration. Each cluster exhibits distinct characteristics, with some relying more on the amplitude distance component, while others are more dependent on the phase distance component. We validated the performance of our proposed method through real-world applications.
We hope this framework will provide a practical solution for real-world datasets in prospective applications. This method will facilitate data-driven partitioning within functional data analysis, thereby making a significant contribution to ongoing research, such as association studies, by enabling the discovery of meaningful patterns and relationships within health events. Moreover, it is hypothesized that the motifs identified through this method can serve as a foundational basis for establishing digital biomarkers, subsequently advancing research in physical activity analysis.
en
dc.description.provenanceSubmitted by admin ntu (admin@lib.ntu.edu.tw) on 2026-03-13T16:52:32Z
No. of bitstreams: 0
en
dc.description.provenanceMade available in DSpace on 2026-03-13T16:52:32Z (GMT). No. of bitstreams: 0en
dc.description.tableofcontents口試委員會審定書i
Acknowledgements ii
Chinese Abstract iii
English Abstract v
Contents viii
List of Figures x
List of Tables xii
Chapter 1 Introduction 1
1.1 Physical Activity . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Brief Introduction to Functional Data . . . . . . . . . . . . . . . . . 2
1.3 Cluster Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.4 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
Chapter 2 Methodology Background 13
2.1 Functional Data Analysis . . . . . . . . . . . . . . . . . . . . . . . . 13
2.2 Square-Root Velocity Function . . . . . . . . . . . . . . . . . . . . . 15
2.3 Bayesian Clustering Models . . . . . . . . . . . . . . . . . . . . . . 20
Chapter 3 Proposed Method 26
3.1 Proposed Bayesian Motif-clustering Method . . . . . . . . . . . . . 26
Chapter 4 Results 35
4.1 Depresjon & PSYKOSE Datasets . . . . . . . . . . . . . . . . . . . 36
4.1.1 Datasets Description . . . . . . . . . . . . . . . . . . . . . . . . . 36
4.1.2 Preprocessing Pipeline for Depresjon & PSYKOSE Datasets . . . . 37
4.1.3 Results of Depresjon & PSYKOSE Datasets . . . . . . . . . . . . . 41
4.2 NHANES surveys in 2011–2012 . . . . . . . . . . . . . . . . . . . . 60
4.2.1 Dataset Description . . . . . . . . . . . . . . . . . . . . . . . . . . 60
4.2.2 Preprocessing Pipeline for NHANES Dataset . . . . . . . . . . . . 62
4.2.3 Results of NHANES Dataset . . . . . . . . . . . . . . . . . . . . . 66
Chapter 5 Discussion and Conclusion 74
5.1 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
5.2 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
References 81
Appendix A — Others Model Results 87
A.1 Depresjon & PSYKOSE Datasets . . . . . . . . . . . . . . . . . . . 87
A.2 NHANES - Random Select 1 day per Child . . . . . . . . . . . . . . 90
Appendix B — fdasrvf package 96
B.1 User Guide – fdasrvf package . . . . . . . . . . . . . . . . . . . . 96
B.1.1 Package Information . . . . . . . . . . . . . . . . . . . . . . . . . 96
B.1.2 Function Naming Conventions . . . . . . . . . . . . . . . . . . . . 96
B.1.3 Functions for Conversion (Original Space→ SRSF/SRVF Space) . 97
B.1.4 Functions for Alignment . . . . . . . . . . . . . . . . . . . . . . . 97
B.1.5 Functions for Distance Calculation . . . . . . . . . . . . . . . . . . 99
B.1.6 Functions for Smoothing . . . . . . . . . . . . . . . . . . . . . . . 100
-
dc.language.isoen-
dc.subject活動模式-
dc.subject貝氏聚類分析-
dc.subject彈性距離-
dc.subject彈性形狀分析-
dc.subject函數資料分析-
dc.subject身體活動-
dc.subject穿戴式裝置資料-
dc.subjectactivity pattern-
dc.subjectBayesian clustering method-
dc.subjectelastic distance-
dc.subjectelastic shape analysis-
dc.subjectfunctional data analysis-
dc.subjectphysical activity-
dc.subjectwearable device data-
dc.title利用貝氏主題分群方法探索來自穿戴式裝置之自由生活身體活動資料中的活動模式zh_TW
dc.titleUncovering Activity Patterns in Free-Living Physical Activity Data from Wearable Devices via Bayesian Motif-Based Clustering Methoden
dc.typeThesis-
dc.date.schoolyear114-1-
dc.description.degree碩士-
dc.contributor.oralexamcommittee林煜軒;李百靈;蕭朱杏zh_TW
dc.contributor.oralexamcommitteeYu-Hsuan Lin;Pai-Ling Li;Chuhsing Kate Hsiaoen
dc.subject.keyword活動模式,貝氏聚類分析彈性距離彈性形狀分析函數資料分析身體活動穿戴式裝置資料zh_TW
dc.subject.keywordactivity pattern,Bayesian clustering methodelastic distanceelastic shape analysisfunctional data analysisphysical activitywearable device dataen
dc.relation.page100-
dc.identifier.doi10.6342/NTU202502593-
dc.rights.note同意授權(限校園內公開)-
dc.date.accepted2026-01-15-
dc.contributor.author-college公共衛生學院-
dc.contributor.author-dept健康數據拓析統計研究所-
dc.date.embargo-lift2030-07-26-
顯示於系所單位:健康數據拓析統計研究所

文件中的檔案:
檔案 大小格式 
ntu-114-1.pdf
  未授權公開取用
17.42 MBAdobe PDF檢視/開啟
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved