請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/61223
標題: | 應用巨量健保資料分析診斷治療之趨勢 Trend Analysis of Theranostics in Big Data Derived from the National Health Insurance Research Database |
作者: | Han-Fang Cheng 鄭涵方 |
指導教授: | 翁昭旼(Jau-Min Wong) |
共同指導教授: | 蔣以仁(I-Jen Chiang) |
關鍵字: | 巨量資料,NoSQL資料庫,MongoDB,Shard Key, Big Data,NoSQL database,MongoDB,Shard Key, |
出版年 : | 2013 |
學位: | 碩士 |
摘要: | 有別於傳統關聯式資料庫需依賴JOIN才能進行跨表單查詢的作法,非關聯式資料庫(NoSQL:Not Only SQL)具有Schema-free的資料儲存特性與Sharding的資料分片機制,因此適合被用來處理巨量資料。此外,依據Shard Key設定值執行Sharding,將巨量資料切割成小範圍區塊來加速查詢速度。即便如此,就我們所知,目前尚未有一個成熟且系統化的方式管理(包含檢索與視覺化)巨量健保資料。因此,本研究以病患歸人檔之文件導向方式儲存巨量健保資料,並探討健保資料庫所提供的欄位屬性,歸納出12項在診斷治療上的重要欄位,將此選定為Shard Key進行Sharding並執行目標查詢(Targeted Query)以提高檢索效率。並以一範例進行查詢時間效能測試,據實驗結果顯示,本研究所提出的資料處理方法確實能大幅度地縮短巨量的健保資料的查詢時間。 NoSQL (Not Only SQL) database has schema-free data format and the function of sharding. Comparing the NoSQL database with the relational database, the NoSQL database is more suitable to handle the big data. The big data is sharding into small blocks which are based on shard keys to speed up queries answering. To our knowledge, there is not yet a mature and systematic approach (including retrieval and visualization) to managing the big data derived from the National Health Insurance Research Database. Therefore, our research used patient document-oriented way to average the big medical data storage and to explore the field properties of the health insurance research database. By summarizing 12 important fields as shard keys in theranostics, before executing target queries can improve search efficiency. According to our experimental results, it shows that the proposed method of data processing can indeed significantly reduce the big data query time. |
URI: | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/61223 |
全文授權: | 有償授權 |
顯示於系所單位: | 醫學工程學研究所 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-102-1.pdf 目前未授權公開取用 | 4.41 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。