Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 資訊工程學系
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/92889
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor許永真zh_TW
dc.contributor.advisorYung-jen Hsuen
dc.contributor.author林珮盈zh_TW
dc.contributor.authorPei-Ying Linen
dc.date.accessioned2024-07-03T16:08:05Z-
dc.date.available2024-07-04-
dc.date.copyright2024-07-03-
dc.date.issued2024-
dc.date.submitted2024-06-28-
dc.identifier.citation[1] D. H. Ackley, G. E. Hinton, and T. J. Sejnowski. A learning algorithm for boltzmann machines. Cognitive science, 9(1):147–169, 1985.
[2] S. An, B. Zhou, Z. Lin, Q. Fu, B. Chen, N. Zheng, W. Chen, and J.-G. Lou. Skill-based few-shot selection for in-context learning, 2023.
[3] T. B. Brown, B. Mann, N. Ryder, M. Subbiah, J. Kaplan, P. Dhariwal, A. Nee-lakantan, P. Shyam, G. Sastry, A. Askell, S. Agarwal, A. Herbert-Voss, G. Krueger, T. Henighan, R. Child, A. Ramesh, D. M. Ziegler, J. Wu, C. Winter, C. Hesse, M. Chen, E. Sigler, M. Litwin, S. Gray, B. Chess, J. Clark, C. Berner, S. McCandlish, A. Radford, I. Sutskever, and D. Amodei. Language models are few-shot learners, 2020.
[4] M. Chen, J. Tworek, H. Jun, Q. Yuan, H. P. d. O. Pinto, J. Kaplan, H. Edwards,Y. Burda, N. Joseph, G. Brockman, et al. Evaluating large language models trained on code. arXiv preprint arXiv:2107.03374, 2021.
[5] J. J. Y. Chung, E. Kamar, and S. Amershi. Increasing diversity while maintaining accuracy: Text data generation with large language models and human interventions. arXiv preprint arXiv:2306.04140, 2023.
[6] P. Clark, I. Cowhey, O. Etzioni, T. Khot, A. Sabharwal, C. Schoenick, and O. Tafjord. Think you have solved question answering? try arc, the ai2 reasoning challenge, 2018.
[7] K. Cobbe, V. Kosaraju, M. Bavarian, M. Chen, H. Jun, L. Kaiser, M. Plappert, J. Tworek, J. Hilton, R. Nakano, C. Hesse, and J. Schulman. Training verifiers to solve math word problems, 2021.
[8] Y. Fei, Y. Hou, Z. Chen, and A. Bosselut. Mitigating label biases for in-context learning, 2023.
[9] J. Ficler and Y. Goldberg. Controlling linguistic style aspects in neural language generation. arXiv preprint arXiv:1707.02633, 2017.
[10] T. Gao, X. Yao, and D. Chen. Simcse: Simple contrastive learning of sentence embeddings, 2022.
[11] M. Geva, D. Khashabi, E. Segal, T. Khot, D. Roth, and J. Berant. Did aristotle use a laptop? a question answering benchmark with implicit reasoning strategies, 2021.
[12] Z. Han, Y. Hao, L. Dong, Y. Sun, and F. Wei. Prototypical calibration for few-shot learning of language models, 2022.
[13] T. Hartvigsen, S. Gabriel, H. Palangi, M. Sap, D. Ray, and E. Kamar. Toxigen: A large-scale machine-generated dataset for adversarial and implicit hate speech detection, 2022.
[14] A. Holtzman, P. West, V. Shwartz, Y. Choi, and L. Zettlemoyer. Surface form competition: Why the highest probability answer isn’t always right, 2022.
[15] A. Q. Jiang, A. Sablayrolles, A. Mensch, C. Bamford, D. S. Chaplot, D. de las Casas, F. Bressand, G. Lengyel, G. Lample, L. Saulnier, L. R. Lavaud, M.-A. Lachaux, P. Stock, T. L. Scao, T. Lavril, T. Wang, T. Lacroix, and W. E. Sayed. Mistral 7b, 2023.
[16] Z. Jiang, F. F. Xu, L. Gao, Z. Sun, Q. Liu, J. Dwivedi-Yu, Y. Yang, J. Callan, and G. Neubig. Active retrieval augmented generation, 2023.
[17] E. Kamalloo, N. Dziri, C. L. A. Clarke, and D. Rafiei. Evaluating open-domain question answering in the era of large language models, 2023.
[18] I. Levy, B. Bogin, and J. Berant. Diverse demonstrations improve in-context compositional generalization, 2023.
[19] J. Liu, J. Jin, Z. Wang, J. Cheng, Z. Dou, and J.-R. Wen. Reta-llm: A retrieval-augmented large language model toolkit, 2023.
[20] J. Liu, D. Shen, Y. Zhang, B. Dolan, L. Carin, and W. Chen. What makes good in-context examples for gpt-3?, 2021.
[21] P. Liu, W. Yuan, J. Fu, Z. Jiang, H. Hayashi, and G. Neubig. Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing, 2021.
[22] Y. Liu, K. Shi, K. S. He, L. Ye, A. R. Fabbri, P. Liu, D. Radev, and A. Cohan. On learning to summarize with large language models as references, 2023.
[23] V. Liévin, C. E. Hother, A. G. Motzfeldt, and O. Winther. Can large language models reason about medical questions?, 2023.
[24] S. Min, X. Lyu, A. Holtzman, M. Artetxe, M. Lewis, H. Hajishirzi, and L. Zettlemoyer. Rethinking the role of demonstrations: What makes in-context learningwork?, 2022.
[25] R. Naik, V. Chandrasekaran, M. Yuksekgonul, H. Palangi, and B. Nushi. Diversity of thought improves reasoning abilities of llms, 2024.
[26] OpenAI, J. Achiam, S. Adler, S. Agarwal, L. Ahmad, I. Akkaya, F. L. Aleman, D. Almeida, J. Altenschmidt, S. Altman, S. Anadkat, R. Avila, I. Babuschkin, S. Balaji, V. Balcom, P. Baltescu, H. Bao, M. Bavarian, J. Belgum, I. Bello, et al. Gpt-4 technical report, 2024.
[27] A. Patel, S. Bhattamishra, and N. Goyal. Are nlp models really able to solve simple math word problems?, 2021.
[28] J. W. Rae, S. Borgeaud, T. Cai, K. Millican, J. Hoffmann, F. Song, J. Aslanides, S. Henderson, R. Ring, S. Young, E. Rutherford, T. Hennigan, J. Menick, A. Cassirer, R. Powell, G. van den Driessche, L. A. Hendricks, M. Rauh, P.-S. Huang, A. Glaese, J. Welbl, S. Dathathri, S. Huang, J. Uesato, J. Mellor, I. Higgins, A. Creswell, N. McAleese, A. Wu, E. Elsen, S. Jayakumar, E. Buchatskaya, D. Budden, E. Sutherland, K. Simonyan, M. Paganini, L. Sifre, L. Martens, X. L. Li, A. Kuncoro, A. Nematzadeh, E. Gribovskaya, D. Donato, A. Lazaridou, A. Mensch, J.-B. Lespiau, M. Tsimpoukelli, N. Grigorev, D. Fritz, T. Sottiaux, M. Pajarskas, T. Pohlen, Z. Gong, D. Toyama, C. de Masson d’Autume, Y. Li, T. Terzi, V. Mikulik, I. Babuschkin, A. Clark, D. de Las Casas, A. Guy, C. Jones, J. Bradbury, M. Johnson, B. Hechtman, L. Weidinger, I. Gabriel, W. Isaac, E. Lockhart, S. Osindero, L. Rimell, C. Dyer, O. Vinyals, K. Ayoub, J. Stanway, L. Bennett, D. Hassabis, K. Kavukcuoglu, and G. Irving. Scaling language models: Methods, analysis & insights from training gopher, 2022.
[29] J. Robinson, C. M. Rytting, and D. Wingate. Leveraging large language models for multiple choice question answering, 2023.
[30] G. Sahu, P. Rodriguez, I. H. Laradji, P. Atighehchian, D. Vazquez, and D. Bahdanau. Data augmentation for intent classification with off-the-shelf large language models, 2022.
[31] M. Sap, H. Rashkin, D. Chen, R. LeBras, and Y. Choi. Socialiqa: Commonsense reasoning about social interactions, 2019.
[32] Y. Shen, Q. Liu, Z. Mao, Z. Wan, F. Cheng, and S. Kurohashi. Seeking diverse reasoning logic: Controlled equation expression generation for solving math word problems, 2022.
[33] T. Shin, Y. Razeghi, R. L. L. I. au2, E. Wallace, and S. Singh. Autoprompt: Eliciting knowledge from language models with automatically generated prompts, 2020.
[34] S. Smith, M. Patwary, B. Norick, P. LeGresley, S. Rajbhandari, J. Casper, Z. Liu, S. Prabhumoye, G. Zerveas, V. Korthikanti, E. Zhang, R. Child, R. Y. Aminabadi, J. Bernauer, X. Song, M. Shoeybi, Y. He, M. Houston, S. Tiwary, and B. Catanzaro. Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model, 2022.
[35] A. Srivastava, A. Rastogi, A. Rao, A. A. M. Shoeb, A. Abid, A. Fisch, A. R. Brown, A. Santoro, A. Gupta, A. Garriga-Alonso, A. Kluska, A. Lewkowycz, A. Agarwal, A. Power, A. Ray, et al. Beyond the imitation game: Quantifying and extrapolating the capabilities of language models, 2023.
[36] A. Talmor, J. Herzig, N. Lourie, and J. Berant. Commonsenseqa: A question answering challenge targeting commonsense knowledge, 2019.
[37] H. Touvron, L. Martin, K. Stone, P. Albert, A. Almahairi, Y. Babaei, N. Bashlykov, S. Batra, P. Bhargava, S. Bhosale, D. Bikel, L. Blecher, C. C. Ferrer, M. Chen, G. Cucurull, D. Esiobu, J. Fernandes, J. Fu, W. Fu, B. Fuller, C. Gao, V. Goswami, N. Goyal, A. Hartshorn, S. Hosseini, R. Hou, H. Inan, M. Kardas, V. Kerkez, M. Khabsa, I. Kloumann, A. Korenev, P. S. Koura, M.-A. Lachaux, T. Lavril, J. Lee, D. Liskovich, Y. Lu, Y. Mao, X. Martinet, T. Mihaylov, P. Mishra, I. Molybog, Y. Nie, A. Poulton, J. Reizenstein, R. Rungta, K. Saladi, A. Schelten, R. Silva, E. M. Smith, R. Subramanian, X. E. Tan, B. Tang, R. Taylor, A. Williams, J. X. Kuan, P. Xu, Z. Yan, I. Zarov, Y. Zhang, A. Fan, M. Kambadur, S. Narang, A. Rodriguez, R. Stojnic, S. Edunov, and T. Scialom. Llama 2: Open foundation and fine-tuned chat models, 2023.
[38] X. Wang, J. Wei, D. Schuurmans, Q. Le, E. Chi, S. Narang, A. Chowdhery, and D. Zhou. Self-consistency improves chain of thought reasoning in language models, 2023.
[39] Y. Wang, Z. Zhang, and R. Wang. Element-aware summarization with large language models: Expert-aligned evaluation and chain-of-thought method, 2023.
[40] J. Wei, M. Bosma, V. Zhao, K. Guu, A. W. Yu, B. Lester, N. Du, A. M. Dai, and Q. V. Le. Finetuned language models are zero-shot learners. In International Conference on Learning Representations, 2022.
[41] J. Wei, Y. Tay, R. Bommasani, C. Raffel, B. Zoph, S. Borgeaud, D. Yogatama, M. Bosma, D. Zhou, D. Metzler, E. H. Chi, T. Hashimoto, O. Vinyals, P. Liang, J. Dean, and W. Fedus. Emergent abilities of large language models. Transactions on Machine Learning Research, 2022. Survey Certification.
[42] J. Wei, X. Wang, D. Schuurmans, M. Bosma, B. Ichter, F. Xia, E. Chi, Q. Le, and D. Zhou. Chain-of-thought prompting elicits reasoning in large language models, 2023.
[43] K. M. Yoo, J. Kim, H. J. Kim, H. Cho, H. Jo, S.-W. Lee, S. goo Lee, and T. Kim. Ground-truth labels matter: A deeper look into input-label demonstrations, 2022.
[44] B. Zhang, B. Haddow, and A. Birch. Prompting large language model for machine translation: A case study, 2023.
[45] Y. Zhang, S. Feng, and C. Tan. Active example selection for in-context learning, 2022.
[46] Z. Zhang, A. Zhang, M. Li, and A. Smola. Automatic chain of thought prompting in large language models, 2022.
[47] T. Z. Zhao, E. Wallace, S. Feng, D. Klein, and S. Singh. Calibrate before use: Improving few-shot performance of language models, 2021.
[48] Y. Zhou, A. I. Muresanu, Z. Han, K. Paster, S. Pitis, H. Chan, and J. Ba. Large language models are human-level prompt engineers, 2023.
[49] W. Zhu, H. Liu, Q. Dong, J. Xu, S. Huang, L. Kong, J. Chen, and L. Li. Multilingual machine translation with large language models: Empirical results and analysis, 2023.
[50] W. Zhu, H. Liu, Q. Dong, J. Xu, L. Kong, J. Chen, L. Li, and S. Huang. Multilingual machine translation with large language models: Empirical results and analysis. arXiv preprint arXiv:2304.04675, 2023.
[51] Y. Zhu, H. Yuan, S. Wang, J. Liu, W. Liu, C. Deng, Z. Dou, and J.-R. Wen. Large language models for information retrieval: A survey. arXiv preprint arXiv:2308.07107, 2023.
-
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/92889-
dc.description.abstract雖然大型語言模型(LLMs)在許多自然語言處理任務中展示了出色的最新技術(SOTA)表現,但多選問答任務對於大型語言模型來說,仍然是一個挑戰。儘管通過利用少量樣本上下文學習,大型語言模型在多選問答任務上取得了不錯的成果。生成或選擇示範樣本並不是一項簡單的工作。在示範樣本的設計中必須考慮到許多因素,包括它們的順序、主要標籤和最近的標籤。最近有幾項研究專注於示範樣本的設計,以期望能達到更好的性能。因此,我們想知道如果不考慮示範樣本,我們如何通過零樣本學習來提高大型語言模型在多選推理任務中的性能?
當人們面臨多選推理任務時,我們通常依賴我們的先驗知識和常識來形成初步的答案。隨後,將這個初步答案與提供的選項進行比較,並選擇最有可能的選項作為最終答案。因此,我們提出聚合語義匹配檢索(ASMR)作為多選推理任務的解決方案。為了模仿人類解決多選推理任務的過程,我們利用大型語言模型的能力,首先通過開放式問題生成初步可能的答案,透過比較這個初步的答案與提供的選項,如此有助於從給定的選項中檢索出更有可能的答案選項。我們的實驗表明,ASMR在流行的常識推理資料集以及BIG-BENCH資料集上取得了良好的成果。
zh_TW
dc.description.abstractWhile Large Language Models (LLMs) have showcased remarkable state-of-the-art (SOTA) performance across numerous natural language processing tasks, their effectiveness in multiple-choice question answering tasks continues to pose a challenge. Although by utilizing few-shot in-context learning, LLMs have chieved impressive results on multiple choice question answering tasks. To generate or select of demonstration samples is not a simple work. Several factors must be taken into account during the design of demonstration samples, including their sequence, predominant label, and recentness label. There are several studies recently focus on the design of demonstration samples to achieve better performance. Therefore, We are wondering that without considering demonstration samples, how can we enhance the performance of Large Language Models on multiple-choice reasoning tasks through zero-shot learning?
When confronted with multiple-choice reasoning tasks, humans typically rely on their prior knowledge and commonsense to formulate a preliminary answer in mind. Subsequently, they compare this preliminary answer to the provided choices, and select the most likely choice as the final answer.We introduce Aggregated Semantic Matching Retrieval (ASMR) as a solution for multiple-choice reasoning tasks. To mimic the process of humans solving reasoning tasks with multiple choices, we leverage the capabilities of LLMs to first generate the preliminary possible answers through open-ended question which aids in enhancing the process of retrieving relevant answers to the question from the given choices. Our experiments demonstrate the effectiveness of ASMR on popular commonsense reasoning benchmark and BIG-BENCH datasets.
en
dc.description.provenanceSubmitted by admin ntu (admin@lib.ntu.edu.tw) on 2024-07-03T16:08:05Z
No. of bitstreams: 0
en
dc.description.provenanceMade available in DSpace on 2024-07-03T16:08:05Z (GMT). No. of bitstreams: 0en
dc.description.tableofcontents口試委員審定書 i
Acknowledgements ii
Abstract iii
摘要 v
List of Figures x
List of Tables xi
Chapter 1 Introdution 1
1.1 Background and Motivation . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Research Objective . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.3 Thesis Organization . . . . . . . . . . . . . . . . . . . . . . . . . . 3
Chapter 2 Related Work 5
2.1 Large Language Model . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.1.1 Prompting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2.1.2 In-context Learning . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2.1.3 Chain-of-Thought Prompting . . . . . . . . . . . . . . . . . . . . . 7
2.1.4 Self-Consistency Prompting . . . . . . . . . . . . . . . . . . . . . 7
2.2 Multiple Choice Question Answering . . . . . . . . . . . . . . . . . 8
2.2.1 LLMs on MCQA tasks . . . . . . . . . . . . . . . . . . . . . . . . 9
2.2.1.1 Cloze Prompting . . . . . . . . . . . . . . . . . . . . . 9
2.2.1.2 Multiple Choice Prompting . . . . . . . . . . . . . . . 9
Chapter 3 Problem Definition 10
3.1 Multiple-Choice Question Answering Task . . . . . . . . . . . . . . . . . . . . 10
3.2 Enhancing Multiple Choice Question Answering Tasks with Large Language Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
3.2.1 Definition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
3.2.2 Notations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
Chapter 4 Methodology 13
4.1 The Main Idea . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
4.1.1 Tips for Multiple Choice Exams . . . . . . . . . . . . . . . . . . . 13
4.1.2 Diverse Reasoning Path . . . . . . . . . . . . . . . . . . . . . . . . 14
4.2 ASMR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
4.2.1 Semantic Aggregation . . . . . . . . . . . . . . . . . . . . . . . . . 17
4.2.1.1 Concatenation (ASMR-C) . . . . . . . . . . . . . . . . 17
4.2.1.2 Aggregated Sum (ASMR-A) . . . . . . . . . . . . . . 18
4.2.2 Process of ASMR . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
Chapter 5 Experiments and Analysis 21
5.1 Dataset . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
5.1.1 Commonsense Reasoning Dataset . . . . . . . . . . . . . . . . . . 21
5.1.1.1 CommonsenseQA . . . . . . . . . . . . . . . . . . . . 22
5.1.1.2 SocialIQA . . . . . . . . . . . . . . . . . . . . . . . . 23
5.1.1.3 ARC . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
5.1.2 BIG-BENCH Dataset . . . . . . . . . . . . . . . . . . . . . . . . . 24
5.1.2.1 arithmetic . . . . . . . . . . . . . . . . . . . . . . . . 25
5.1.2.2 elementary_math_qa . . . . . . . . . . . . . . . . . . . 25
5.1.2.3 penguins_in_a_table . . . . . . . . . . . . . . . . . . 26
5.1.2.4 physical_intuition . . . . . . . . . . . . . . . . . . . . 27
5.1.2.5 riddle_sense . . . . . . . . . . . . . . . . . . . . . . . 27
5.1.2.6 date_understanding . . . . . . . . . . . . . . . . . . . 29
5.1.2.7 similarities_abstraction . . . . . . . . . . . . . . . . . 29
5.1.2.8 odd_one_out . . . . . . . . . . . . . . . . . . . . . . . 29
5.1.2.9 understanding_fables . . . . . . . . . . . . . . . . . . 30
5.2 Metric of Evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . 31
5.3 Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
5.4 Implementation Details . . . . . . . . . . . . . . . . . . . . . . . . . 32
5.4.1 Baseline Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
5.5 Main Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
5.5.1 Commonsense Reasoning Dataset . . . . . . . . . . . . . . . . . . 33
5.5.1.1 Response Analysis . . . . . . . . . . . . . . . . . . . . 34
5.5.1.2 Case Study . . . . . . . . . . . . . . . . . . . . . . . . 35
5.5.2 BIG-BENCH Dataset . . . . . . . . . . . . . . . . . . . . . . . . . 35
5.5.2.1 Response Analysis . . . . . . . . . . . . . . . . . . . . 36
5.5.2.2 Case Study . . . . . . . . . . . . . . . . . . . . . . . . 37
5.5.3 Model-Agnostic Property . . . . . . . . . . . . . . . . . . . . . . . 38
5.6 Ablation Study . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
5.6.1 Effect of Different Decoding Strategies. . . . . . . . . . . . . . . . 38
5.6.2 Effect of the Number of Selected Answer Choices. . . . . . . . . . 40
5.6.3 Effect of ASMR with Self-Consistency. . . . . . . . . . . . . . . . 41
5.7 Comparative Analysis of Different Size LMs . . . . . . . . . . . . . 42
5.8 Diversity of Responses . . . . . . . . . . . . . . . . . . . . . . . . . 43
5.8.1 Prompting Methods . . . . . . . . . . . . . . . . . . . . . . . . . . 43
5.8.2 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
5.9 Weights Assignment to different responses . . . . . . . . . . . . . . 46
5.9.1 Weighted Sum Methods . . . . . . . . . . . . . . . . . . . . . . . . 46
5.9.2 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
Chapter 6 Conclusions 49
6.1 Contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
6.2 Limitaion and Future Work . . . . . . . . . . . . . . . . . . . . . . . 50
Bibliography 51
Appendix A — Prompt Examples for all methods 59
Appendix B — ASMR Success Case Study for Each Dataset 61
B.1 Commonsense Reasoning Dataset . . . . . . . . . . . . . . . . . . . 61
B.2 BIG-BENCH Dataset . . . . . . . . . . . . . . . . . . . . . . . . . . 61
Appendix C — ASMR Failure Case Study for Each Dataset 75
C.1 Commonsense Reasoning Dataset . . . . . . . . . . . . . . . . . . . 75
C.2 BIG-BENCH Dataset . . . . . . . . . . . . . . . . . . . . . . . . . . 75
-
dc.language.isoen-
dc.subject多選問答zh_TW
dc.subject提示工程zh_TW
dc.subject大型語言模型zh_TW
dc.subject推理zh_TW
dc.subject語義匹配zh_TW
dc.subject零樣本情境學習zh_TW
dc.subjectLarge Language Modelen
dc.subjectReasoningen
dc.subjectSemantic Matchingen
dc.subjectPrompt Engineeringen
dc.subjectZero-shot In-context learningen
dc.subjectMultiple Choice Question Answeringen
dc.title匯總語義匹配檢索:透過開放式問題回答來釋放大型語言模型的能力zh_TW
dc.titleASMR: Aggregated Semantic Matching Retrieval Unleashing the Ability of LLM through Open-Ended Question Answeringen
dc.typeThesis-
dc.date.schoolyear112-2-
dc.description.degree碩士-
dc.contributor.coadvisor項潔zh_TW
dc.contributor.coadvisorJieh Hsiangen
dc.contributor.oralexamcommittee郭彥伶;蔡芸琤;蔡宗翰zh_TW
dc.contributor.oralexamcommitteeYen-Ling Kuo;Yun-Cheng Tsai ;Tzong-Han Tsaien
dc.subject.keyword大型語言模型,多選問答,語義匹配,推理,零樣本情境學習,提示工程,zh_TW
dc.subject.keywordLarge Language Model,Multiple Choice Question Answering,Semantic Matching,Reasoning,Zero-shot In-context learning,Prompt Engineering,en
dc.relation.page88-
dc.identifier.doi10.6342/NTU202401216-
dc.rights.note未授權-
dc.date.accepted2024-06-29-
dc.contributor.author-college電機資訊學院-
dc.contributor.author-dept資訊工程學系-
顯示於系所單位:資訊工程學系

文件中的檔案:
檔案 大小格式 
ntu-112-2.pdf
  未授權公開取用
1.24 MBAdobe PDF
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved