Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 文學院
  3. 語言學研究所
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/96504
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor謝舒凱zh_TW
dc.contributor.advisorShu-Kai Hsiehen
dc.contributor.author葉凱晴zh_TW
dc.contributor.authorKai-Ching Yehen
dc.date.accessioned2025-02-19T16:16:11Z-
dc.date.available2025-02-20-
dc.date.copyright2025-02-19-
dc.date.issued2025-
dc.date.submitted2025-02-04-
dc.identifier.citationAI, E. (2024). Spacy: industrial-strength natural language processing in python [Version 3.6.0]. https://spacy.io
Argyle, M., Cook, M., & Cramer, D. (1994). Gaze and mutual gaze. The British Journal of Psychiatry, 165(6), 848–850.
Banitalebi, Z., Jabbari, A. A., & Razmi, M. H. (2020). Cross-linguistic gender differences in EFL learners' pause frequency and duration. Iranian Journal of Learning and Memory, 3(9), 19–27.
Barbur, V. A. (1994). Introduction to linear regression analysis.
Bargiela-Chiappini, F., & Haugh, M. (2009). Face, communication and social interaction. Equinox Publishing.
Bennett, P. R. (1986). The role of pause in discourse and its place in linguistics: Some evidence from Eastern Bantu. Language Sciences, 8(1), 63–79.
Boersma, P., & Weenink, D. (2024). Praat: Doing phonetics by computer [Version 6.3.0]. Retrieved December 30, 2024, from https://www.fon.hum.uva.nl/praat/
Busso, C., Deng, Z., Yildirim, S., Bulut, M., Lee, C. M., Kazemzadeh, A., Lee, S., Neumann, U., & Narayanan, S. (2004). Analysis of emotion recognition using facial expressions, speech and multimodal information. Proceedings of the 6th International Conference on Multimodal Interfaces, 205–211.
Butterworth, B., & Hadar, U. (1989). Gesture, speech, and computational stages: A reply to McNeill.
Campione, E., & Véronis, J. (2002). A large-scale multilingual study of silent pause duration. Speech Prosody 2002, International Conference.
Charlet, D., Barras, C., & Liénard, J.-S. (2013). Impact of overlapping speech detection on speaker diarization for broadcast news and debates. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 7707–7711.
Chen, A. C.-H., & Tseng, S.-C. (2019). Prosodic encoding in Mandarin spontaneous speech: Evidence for clause-based advanced planning in language production. Journal of Phonetics, 76, 100912.
Cohen, J. (2016). A power primer.
Corley, M., MacGregor, L. J., & Donaldson, D. I. (2007). It's the way that you, er, say it: Hesitations in speech affect language comprehension. Cognition, 105(3), 658–668.
Daly, J. A., Bell, R. A., Glenn, P. J., & Lawrence, S. (1985). Conceptualizing conversational complexity. Human Communication Research, 12(1), 30–53.
Developers, P. (2024). Pytube (Version 15.0.0). Retrieved December 30, 2024, from https://github.com/pytube/pytube
Finlayson, I. R., & Corley, M. (2012). Disfluency in dialogue: An intentional signal from the speaker? Psychonomic Bulletin & Review, 19, 921–928.
Goldman-Eisler, F. (1958). Speech production and the predictability of words in context. Quarterly Journal of Experimental Psychology, 10(2), 96–106.
Goldman-Eisler, F. (1961). The distribution of pause durations in speech. Language and Speech, 4(4), 232–237.
Goyal, S., Ji, Z., Rawat, A. S., Menon, A. K., Kumar, S., & Nagarajan, V. (2023). Think before you speak: Training language models with pause tokens. arXiv preprint arXiv:2310.02226.
Graziano, M., & Gullberg, M. (2018). When speech stops, gesture stops: Evidence from developmental and crosslinguistic comparisons. Frontiers in Psychology, 9, 879.
Hadar, U., Wenkert-Olenik, D., Krauss, R., & Soroker, N. (1998). Gesture and the processing of speech: Neuropsychological evidence. Brain and Language, 62(1), 107–126.
Heldner, M., & Edlund, J. (2010). Pauses, gaps and overlaps in conversations. Journal of Phonetics, 38(4), 555–568.
Hirvenkari, L., Ruusuvuori, J., Saarinen, V.-M., Kivioja, M., Peräkylä, A., & Hari, R. (2013). Influence of turn-taking in a two-person conversation on the gaze of a viewer. PLoS One, 8(8), e71569.
Jehoul, A. (2019). A multimodal study of filled pauses [Doctoral dissertation, PhD thesis, University of Leuven].
Jiang, B., Ekstedt, E., & Skantze, G. (2023). What makes a good pause? Investigating the turn-holding effects of fillers. arXiv preprint arXiv:2305.02101.
Kendon, A. (1967). Some functions of gaze-direction in social interaction. Acta Psychologica, 26, 22–63.
Levinson, S. C., & Torreira, F. (2015). Timing in turn-taking and its implications for processing models of language. Frontiers in Psychology, 6, 731.
Newman, H. M. (1982). The sounds of silence in communicative encounters. Communication Quarterly, 30(2), 142–149.
O'Connell, D. C., & Kowal, S. (2005). Uh and um revisited: Are they interjections for signaling delay? Journal of Psycholinguistic Research, 34, 555–576.
OpenAI. (2024). Whisper: Automatic speech recognition by OpenAI [Version 1.0]. Retrieved December 30, 2024, from https://openai.com/research/whisper
Price, J. M. (2021). The perceived effect of pause length and location on speaker likability and communicative effectiveness [Master’s thesis, Brigham Young University].
Templeton, E. M., Chang, L. J., Reynolds, E. A., Cone LeBeaumont, M. D., & Wheatley, T. (2022). Fast response times signal social connection in conversation. Proceedings of the National Academy of Sciences, 119(4), e2116915119.
Yuan, J., Xu, X., Lai, W., & Liberman, M. (2016). Pauses and pause fillers in Mandarin monologue speech: The effects of sex and proficiency. Proceedings of Speech Prosody, 2016, 1167–1170.
Zellner, B. (1994). Pauses and the temporal structure of speech. In E. Keller (Ed.), Fundamentals of speech synthesis and speech recognition (pp. 41–62). John Wiley.
Zimmermann, D. H., & West, C. (1996). Sex roles, interruptions and silences in conversation. In Amsterdam studies in the theory and history of linguistic science series 4 (pp. 211–236). John Benjamins BV.
重光由加. (2005). Different interpretations of pauses in natural conversation–Japanese, Chinese and Americans. 東京工芸大学工学部紀要. 人文・社会編, 28(2), 8–14.
-
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/96504-
dc.description.abstract本研究聚焦於中文談話中的停頓現象,試圖了解停頓在談話中的角色和意義。本研究對停頓進行分類和分析,結果發現,不同類型的停頓在使用頻率、持續時間,和語言結構上,都有明顯的差異。本研究還發現,停頓前後的詞性和詞彙搭配有一定的規律,這些規律反映了對話者如何利用停頓來傳遞信息或表達重點。這項研究不僅加深了我們對中文談話中停頓的理解,也為自然語言處理的應用提供了新的參考方向。zh_TW
dc.description.abstractThis study focuses on the phenomenon of pauses in Mandarin podcasts, aiming to understand their roles and significance in podcasts. By classifying and analyzing pauses, the results reveal significant differences in their frequency, duration, and linguistic structures. The study also identifies patterns in the parts of speech (POS) and lexical collocations before and after pauses, which reflect how speakers use pauses to convey information or emphasize key points. This research not only deepens our understanding of pauses in Mandarin discourse but also provides new insights for applications in natural language processing (NLP).en
dc.description.provenanceSubmitted by admin ntu (admin@lib.ntu.edu.tw) on 2025-02-19T16:16:11Z
No. of bitstreams: 0
en
dc.description.provenanceMade available in DSpace on 2025-02-19T16:16:11Z (GMT). No. of bitstreams: 0en
dc.description.tableofcontents致謝 iii
摘要 v
Abstract vii
List of Figures ix
List of Tables xi
1 Introduction 1
1.1 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.3 Purposes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2 Literature Review 5
2.1 Linguistic Approaches on Pauses in Discourse . . . . . . . . . . . . . 6
2.1.1 Turn-taking mechanisms . . . . . . . . . . . . . . . . . . . . . 6
2.1.2 Pause Length . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.1.3 Pause Placement . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.1.4 Cultural Variations . . . . . . . . . . . . . . . . . . . . . . . . 9
2.2 Machine Learning Approaches on Pauses . . . . . . . . . . . . . . . . 10
2.2.1 Human Interactions vs. Human-robot interactions . . . . . . . 10
2.2.2 Pauses in Speech Synthesis . . . . . . . . . . . . . . . . . . . 11
2.2.3 Machine Learning and Pause Tokens . . . . . . . . . . . . . . 11
2.3 Multimodal Approaches on Pauses . . . . . . . . . . . . . . . . . . . 11
2.3.1 Multimodal in Conversations . . . . . . . . . . . . . . . . . . 12
3 Data Collection 15
3.1 Video Selection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
3.2 Corpus Compilation . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
4 Data Annotation 19
4.1 Processing and Transcription . . . . . . . . . . . . . . . . . . . . . . 20
4.1.1 Audio Conversion . . . . . . . . . . . . . . . . . . . . . . . . . 21
4.1.2 Audio Transcription . . . . . . . . . . . . . . . . . . . . . . . 21
4.2 Pause Detection and Manual Adjustment . . . . . . . . . . . . . . . . 22
4.2.1 Pause Detection . . . . . . . . . . . . . . . . . . . . . . . . . 22
4.2.2 Manual Verification . . . . . . . . . . . . . . . . . . . . . . . . 23
4.3 Pause Classification . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
4.3.1 Classification Categories . . . . . . . . . . . . . . . . . . . . . 24
4.3.2 Manual Annotation Check . . . . . . . . . . . . . . . . . . . . 25
4.3.3 Inter-Annotator Reliability Check . . . . . . . . . . . . . . . . 25
4.3.4 Explanation of the Terms . . . . . . . . . . . . . . . . . . . . 26
4.3.5 Interpretation of Cohen's Kappa . . . . . . . . . . . . . . . 26
5 Data Analysis 29
5.1 Pause Frequency and Length Analysis . . . . . . . . . . . . . . . . . 29
5.1.1 Proportion of Different Pause Types . . . . . . . . . . . . . . 31
5.1.2 Length of Different Pause Types . . . . . . . . . . . . . . . . 35
5.1.3 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
5.2 Analysis of Syntactic Correlations with Pauses . . . . . . . . . . . . . 38
5.2.1 Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
5.2.2 Regression Analysis . . . . . . . . . . . . . . . . . . . . . . . 41
5.2.3 Analysis of Pauses Types Respectively . . . . . . . . . . . . . 43
5.2.4 Comprehensive Analysis of Pauses Types . . . . . . . . . . . . 47
5.3 Analysis of Collocations Across Different Pause Types . . . . . . . . . 49
5.3.1 Distribution Analysis of High, Medium, and Low Frequency Collocations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
5.3.2 Chi-square Statistical Testing . . . . . . . . . . . . . . . . . . 50
5.3.3 Frequency Distribution . . . . . . . . . . . . . . . . . . . . . . 51
5.3.4 Examples and Analysis of Collocation Frequencies Across Pause Types . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
6 Conclusion 57
6.1 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
6.2 Research Limitations . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
6.3 Future Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
References 61
-
dc.language.isoen-
dc.subject詞彙搭配zh_TW
dc.subject自然語言處理zh_TW
dc.subject語法分析zh_TW
dc.subject中文談話zh_TW
dc.subject停頓zh_TW
dc.subjectnatural language processingen
dc.subjectpauseen
dc.subjectMandarin podcastsen
dc.subjectsyntactic analysisen
dc.subjectcollocationsen
dc.title中文談話中的停頓類型之語言分析zh_TW
dc.titleA Linguistic Analysis of Pause Types in Mandarin Podcasten
dc.typeThesis-
dc.date.schoolyear113-1-
dc.description.degree碩士-
dc.contributor.oralexamcommittee陳正賢;張瑜芸zh_TW
dc.contributor.oralexamcommitteeCheng-Hsien Chen;Yu-Yun Changen
dc.subject.keyword停頓,中文談話,語法分析,詞彙搭配,自然語言處理,zh_TW
dc.subject.keywordpause,Mandarin podcasts,syntactic analysis,collocations,natural language processing,en
dc.relation.page67-
dc.identifier.doi10.6342/NTU202500343-
dc.rights.note未授權-
dc.date.accepted2025-02-04-
dc.contributor.author-college文學院-
dc.contributor.author-dept語言學研究所-
dc.date.embargo-liftN/A-
顯示於系所單位:語言學研究所

文件中的檔案:
檔案 大小格式 
ntu-113-1.pdf
  未授權公開取用
5.38 MBAdobe PDF
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved