Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 電機工程學系
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/94049
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor陳銘憲zh_TW
dc.contributor.advisorMing-Syan Chenen
dc.contributor.author賴宥辰zh_TW
dc.contributor.authorYu-Chen Laien
dc.date.accessioned2024-08-14T16:26:09Z-
dc.date.available2024-08-15-
dc.date.copyright2024-08-13-
dc.date.issued2024-
dc.date.submitted2024-08-10-
dc.identifier.citation[1] Ahmedhelow. Exploring llms’ performance on accounting exams. Published in Sage Ai, March 27 2024.
[2] Marc Eulerich, Aida Sanatizadeh, Hamid Vakilzadeh, and David A. Wood. Is it all hype? chatgpt’s performance and disruptive potential in the accounting and auditing industries, November 17 2023. Available at SSRN: https://ssrn. com/abstract=4452175 or http://dx.doi.org/10.2139/ssrn.4452175.
[3] Chris Gaetano. We had chatgpt take the cpa exam — and it failed. Accounting Today, May 08 2023.
[4] Marc Eulerich, Aida Sanatizadeh, Hamid Vakilzadeh, and David Wood. Can artificial intelligence pass accounting certification exams? chatgpt: Cpa, cma, cia, and ea? SSRN Electronic Journal, 01 2023.
[5] OpenAI. Gpt-3.5 technical report, 2022.
[6] Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Ku ̈ttler, Mike Lewis, Wen-tau Yih, Tim Rocktaschel, Sebastian Riedel, and Douwe Kiela. Retrieval-augmented generation for knowledge-intensive NLP tasks. In H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin, editors, Advances in Neural Information Processing Systems, volume 33, pages 9459–9474. Curran Associates, Inc., 2020.
[7] Kelvin Guu, Kenton Lee, Zora Tung, Panupong Pasupat, and Mingwei Chang. Retrieval augmented language model pre-training. In Hal Daum ́e III and Aarti Singh, editors, Proceedings of the 37th International Conference on Machine Learning, volume 119 of Proceedings of Machine Learning Research, pages 3929–3938. PMLR, 13–18 Jul 2020.
[8] Yunfan Gao, Yun Xiong, Xinyu Gao, Kangxiang Jia, Jinliu Pan, Yuxi Bi, Yi Dai, Jiawei Sun, Meng Wang, and Haofen Wang. Retrieval-augmented generation for large language models: A survey, 2024.
[9] Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc Le, and Denny Zhou. Chain-of-thought prompting elicits reasoning in large language models, 2023.
[10] Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik Narasimhan, and Yuan Cao. ReAct: Synergizing reasoning and acting in language models. In International Conference on Learning Representations (ICLR), 2023.
[11] ekmungai. Python accounting, 2024.
[12] LangChain Contributors. Langchain. https://github.com/hwchase17/langchain, 2024.
[13] Bernardino Romera-Paredes and Philip Torr. An embarrassingly simple approach to zero-shot learning. In Francis Bach and David Blei, editors, Proceedings of the 32nd International Conference on Machine Learning, volume 37 of Proceedings of Machine Learning Research, pages 2152–2161, Lille, France, 07–09 Jul 2015. PMLR.
[14] Michael Wooldridge and Nicholas R Jennings. Intelligent agents: Theory and practice. The knowledge engineering review, 10(2):115–152, 1995.
[15] Scott Reed, Konrad Zolna, Emilio Parisotto, Sergio Gomez Colmenarejo, Alexander Novikov, Gabriel Barth-Maron, Mai Gimenez, Yury Sulsky, Jackie Kay, Jost Tobias Springenberg, et al. A generalist agent. arxiv 2022. arXiv preprint arXiv:2205.06175, pages 1–40.
[16] Zhiheng Xi, Wenxiang Chen, Xin Guo, Wei He, Yiwen Ding, Boyang Hong, Ming Zhang, Junzhe Wang, Senjie Jin, Enyu Zhou, et al. The rise and potential of large language model based agents: A survey. arXiv preprint arXiv:2309.07864, 2023.
[17] Zhiheng Xi, Yiwen Ding, Wenxiang Chen, Boyang Hong, Honglin Guo, Junzhe Wang, Dingwen Yang, Chenyang Liao, Xin Guo, Wei He, et al. Agentgym: Evolving large language model-based agents across diverse environments. arXiv preprint arXiv:2406.04151, 2024.
[18] OpenAI. Openai embedding model. https://platform.openai.com/docs/guides/embeddings, 2024.
[19] Anton Troynikov Suvansh Sanjeev. Chroma technical reprot. https://github.com/chroma-core/chroma, https://www.trychroma.com/, 2024.
-
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/94049-
dc.description.abstract人工智慧中大型語言模型(LLM)的使用激增凸顯了它們在文字處理和生成方面的先進能力。然而,它們在會計和金融等專業領域的熟練程度仍然受到審查,特別是在註冊會計師(CPA)考試等複雜任務方面。在美國,CPA考試由美國註冊會計師協會(AICPA)監督,包括四個部分:審計和簽證(AUD)、商業環境與理論(BEC)、財務會計和報告(FAR)以及法規(REG)。過往研究表明,包括ChatGPT在內的LLM在CPA考試中遇到了複雜的問題解決場景和多樣化的問題類型,這表明需要進一步改進才能有效地處理此類特定領域的任務。
為了解決CPA考試的挑戰,引入了一種稱為檢索增強推理(RAR)的新方法,將平均通過率從0.5提高到0.62。RAR使用任務路由器將問題分為知識密集和推理密集型類別。對於知識密集型問題,它使用檢索增強生成(RAG)從外部資料庫中提取相關信息,以提高答案準確性。對於推理密集問題,RAR採用推理行動(ReAct)、代理人(Agent)和思想鏈(CoT)方法,並整合會計Python庫等外部工具,模仿真實考試環境,有效解決複雜問題。
zh_TW
dc.description.abstractThe surge in the use of Large Language Models (LLMs) in artificial intelligence highlights their advanced capabilities in text processing and generation. However, their proficiency in specialized fields, such as accounting and finance, remains under scrutiny, particularly regarding complex tasks like the Certified Public Accountant (CPA) examination. The CPA exam, overseen by the American Institute of CPAs, encompasses four sections: Auditing and Attestation (AUD), Business Environment and Concepts (BEC), Financial Accounting and Reporting (FAR), and Regulation (REG). Research indicates that LLMs, including ChatGPT, struggle with the exam's complex problem-solving scenarios and varied question types, demonstrating the need for further improvement to handle such domain-specific tasks effectively.
To address the challenges of the CPA exam, a new method called Retrieval Augmented Reasoning (RAR) has been introduced, improving the average pass rate from 0.5 to 0.62. RAR employs a task router to classify questions into knowledge-intensive and reasoning-intensive categories. For knowledge-intensive questions, it uses Retrieval Augmented Generation (RAG) to extract relevant information from external databases, enhancing answer accuracy. For reasoning-intensive questions, RAR utilizes ReAct, Agent, and Chain of Thought (CoT) approach, and integrates external tools like the accounting Python library to solve complex problems effectively, mimicking the real exam environment.
en
dc.description.provenanceSubmitted by admin ntu (admin@lib.ntu.edu.tw) on 2024-08-14T16:26:09Z
No. of bitstreams: 0
en
dc.description.provenanceMade available in DSpace on 2024-08-14T16:26:09Z (GMT). No. of bitstreams: 0en
dc.description.tableofcontents口試委員審定書 ........................... i
致謝 ........................... ii
摘要 ........................... iii
Abstract ........................... iv
Contents ........................... v
List of Tables ........................... vii
List of Figures ........................... viii
1 Introduction ........................... 1
2 Problems and Contribution ........................... 3
2.1 Problems.................................. 3
2.2 Contribution................................ 4
3 Related Works ........................... 6
3.1 Use ChatGPT to take the CPA exam.................. 6
3.1.1 Retrieval Augmented Generation(RAG).................. 7
3.1.2 ReAct ............................... 7
3.1.3 Agent ............................... 7
4 Methodology ........................... 9
4.1 Task router for classification....................... 10
4.2 RAG for knowledge-intensive questions ................. 11
4.3 RAR for reasoning-intensive questions ................. 12
4.3.1 Agent from ReAct ........................ 13
4.3.2 Chain of Thought(CoT)..................... 14
5 Experiments ........................... 15
5.1 Datasets and experiment setting..................... 15
5.2 Zero-shot prompting ........................... 15
5.3 Results................................... 16
5.3.1 Comparison of zero-shot prompting, RAG, and RAR ........................... 16
5.3.2 RAG on knowledge-intensive questions ........................... 18
5.3.3 RAR on reasoning-intensive questions ........................... 19
5.3.4 More analysis ........................... 20
6 Conclusion ........................... 23
7 Future Works ........................... 24
Reference ........................... 25
-
dc.language.isoen-
dc.subject思想鏈zh_TW
dc.subject註冊會計師考試zh_TW
dc.subject檢索增強推理zh_TW
dc.subject檢索增強生成zh_TW
dc.subject推理行動zh_TW
dc.subject代理人zh_TW
dc.subjectReActen
dc.subjectRARen
dc.subjectRAGen
dc.subjectCoTen
dc.subjectCPAen
dc.title利用檢索增強推理解決註冊會計師考試zh_TW
dc.titleRAR: Tackling Reasoning-Intensive CPA Exams with Retrieval Augmented Reasoningen
dc.typeThesis-
dc.date.schoolyear112-2-
dc.description.degree碩士-
dc.contributor.oralexamcommittee葉彌妍;賴冠廷;吳齊人zh_TW
dc.contributor.oralexamcommitteeMi-Yen Yeh;Kuan-Ting Lai;Chi-Jen Wuen
dc.subject.keyword註冊會計師考試,檢索增強推理,檢索增強生成,推理行動,代理人,思想鏈,zh_TW
dc.subject.keywordCPA,RAR,RAG,ReAct,CoT,en
dc.relation.page27-
dc.identifier.doi10.6342/NTU202402573-
dc.rights.note同意授權(全球公開)-
dc.date.accepted2024-08-13-
dc.contributor.author-college電機資訊學院-
dc.contributor.author-dept電機工程學系-
dc.date.embargo-lift2025-12-31-
顯示於系所單位:電機工程學系

文件中的檔案:
檔案 大小格式 
ntu-112-2.pdf1.91 MBAdobe PDF檢視/開啟
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved