請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/66422
完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.advisor | 闕志鴻 | |
dc.contributor.author | Hsi-Yu Schive | en |
dc.contributor.author | 薛熙于 | zh_TW |
dc.date.accessioned | 2021-06-17T00:35:04Z | - |
dc.date.available | 2012-03-19 | |
dc.date.copyright | 2012-03-19 | |
dc.date.issued | 2011 | |
dc.date.submitted | 2012-02-06 | |
dc.identifier.citation | [1] O. Agertz et al. MNRAS, 380:963, 2007.
[2] D. Aubert and R. Teyssier. ApJ, 724:244, 2010. [3] J. Barnes and P. Hut. Nature, 324:446, 1986. [4] M. J. Berger and P. Colella. J. Comput. Phys., 82:64, 1989. [5] M. J. Berger and J. Oliger. J. Comput. Phys., 53:484, 1984. [6] E. Bertschinger. ApJS, 58:39, 1985. [7] G. L. Bryan and M. L. Norman. In ASP Conf. Ser. 123, Computational Astrophysics, 1997. [8] P. Colella. J. Comput. Phys., 87:171, 1990. [9] P. Colella and P. R. Woodward. J. Comput. Phys., 54:174, 1984. [10] G. Efstathiou, M. Davis, C. S. Frenk, and S. D. M. White. ApJS, 57:241, 1985. [11] B. Einfeldt, C. D. Munz, P. L. Roe, and B. SjAogreen. J. Comput. Phys., 92:273, 1991. [12] S. A. E. G. Falle. MNRAS, 250:581, 1991. [13] M. Frigo and S. G. Johnson. In ICASSP Conf. Proc., volume 3, page 1381, 1998. [14] B. Fryxell et al. ApJS, 131:273, 2000. [15] E. Gaburov, S. Harfst, and S. Portegies Zwart. New Astron., 14:630, 2009. [16] T. A. Gardiner and J. M. Stone. J. Comput. Phys., 227:4123, 2008. [17] R. A. Gingold and J. J. Monaghan. MNRAS, 181:375, 1977. [18] S. K. Godunov. Math. Sbornik, 47:271, 1959. [19] T. Hamada, K. Nitadori, K. Benkrid, Y. Ohno, G. Morimoto, T. Masada, Y. Shibata, K. Oguri, and M. Taiji. CSRD, 24:21, 2009. [20] A. H. Hassan, C. J. Fluke, and D. G. Barnes. New Astron., 16:100, 2011. [21] W. Hu, R. Barkana, and A. Gruzinov. Phys. Rev. Lett., 85:1158, 2000. [22] S. Jin and Z. Xin. Commun. Pure Appl. Math., 48:235, 1995. [23] A. A. Klypin and S. F. Shandarin. MNRAS, 204:891, 1983. [24] A. Knebe, A. Green, and J. Binney. MNRAS, 325:845, 2001. [25] A. V. Kravtsov, A. A. Klypin, and A. M. Khokhlov. ApJS, 111:73, 1997. [26] L. D. Landau and E. M. Lifshitz. Fluid Mechanics. Oxford: Pergamon Press, 2 edition. [27] L. B. Lucy. AJ, 82:1013, 1977. [28] P. MacNeice, K. M. Olson, C. Mobarry, R. de Fainchtein, and C. Packer. Comput. Phys. Commun., 126:330, 2000. [29] D. F. Martin and P. Colella. J. Comput. Phys., 163:271, 2000. [30] F. Miniati and P. Colella. J. Comput. Phys., 227:400, 2007. [31] NVIDIA, Santa Clara, CA, USA. NVIDIA CUDA C Programming Guide, 4.0 edition, 2011. [32] U. Pen, P. Arras, and S. Wong. ApJS, 149:447, 2003. [33] T. Plewa and E. MAuller. Comput. Phys. Commun., 138:101, 2001. [34] W. H. Press, S. A. Teukolsky, W. T. Vetterling, and B. P. Flannery. Numerical Recipes. The Art of Scientic Computing. Cambridge: Cambridge Univ. Press, 3 edition. [35] P. M. Ricker. ApJS, 176:293, 2008. [36] P. L. Roe. J. Comput. Phys., 43:357, 1981. [37] H. Schive, C. Chien, S.Wong, Y. Tsai, and T. Chiueh. New Astron., 13:418, 2008. [38] H. Schive, Y. Tsai, and T. Chiueh. ApJS, 186:457, 2010. [39] H. Schive, Y. Tsai, and T. Chiueh. In Computational Star Formation. Proceedings of IAU Symposium 270, volume 6, page 401, 2011. [40] H. Schive, U. Zhang, and T. Chiueh. IJHPCA, 2011. accepted for publication, DOI: 10.1177/1094342011428146. [41] U. Seljak and M. Zaldarriaga. ApJ, 469:437, 1996. [42] G. A. Sod. J. Comput. Phys., 27:1, 1978. [43] V. Springel, N. Yoshida, and S. D. M. White. New Astron., 6:79, 2001. [44] R. Spurzem, P. Berczik, I. Berentzen, W. Ge, X. Wang, H. Schive, K. Nitadori, T. Hamada, and J. Fiestas. chapter 3. Wiley Publishers, 2011. [45] J. M. Stone and T. A. Gardiner. New Astron., 14:139, 2009. [46] J. M. Stone, T. A. Gardiner, Hawley J. F. Teuben, P., and J. B. Simon. ApJS, 178:137, 2008. [47] G Strang. SIAM J. Numer. Anal., 5:506, 1968. [48] E. J. Tasker, R. Brunino, N. L. Mitchell, D. Michielsen, S. Hopton, F. R. Pearce, G. L. Bryan, and T. Theuns. MNRAS, 390:1267, 2008. [49] R. Teyssier. A & A, 385:337, 2002. [50] E. F. Toro. Riemann Solvers and Numerical Methods for Fluid Dynamics. A Practical Introduction. Berlin: Springer, 3 edition. [51] H. Trac and U. Pen. PASP, 115:303, 2003. [52] K. Umetsu, T. Broadhurst, A. Zitrin, E. Medezinski, D. Coe, and M. Postman. ApJ, 738:41, 2011. [53] B. van Leer. J. Comput. Phys., 14:361, 1974. [54] B. van Leer. J. Comput. Phys., 32:101, 1979. [55] P. Wang, T. Abel, and R. Kaehler. New Astron., 15:581, 2010. [56] T. Woo and T. Chiueh. ApJ, 697:850, 2009. [57] H. Xu, H. Li, D. Collins, S. Li, and M. L. Norman. ApJ, 681:61, 2008. [58] H. Xu, H. Li, D. Collins, S. Li, and M. L. Norman. ApJ, 698:14, 2009. [59] U. Ziegler. A&A, 435:385, 2005. | |
dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/66422 | - |
dc.description.abstract | We present the newly developed code, GAMER (GPU-accelerated Adaptive MEsh Refinement code), which has adopted a novel approach to improve the performance of adaptive mesh refinement (AMR) astrophysical simulations by a large factor with the use of the graphic processing units (GPU). The AMR implementation is based on constructing a hierarchy of grid patches with an octree data structure. A hybrid CPU/GPU model is adopted, in which the time-consuming PDE solvers are implemented into GPUs and the complicated AMR data structure is manipulated by CPUs. The code is fully parallelized using either the rectangular domain decomposition or the Hilbert space-filling curve method. Several performance optimization strategies have been implemented in order to fully exploit the computing power in heterogeneous CPU/GPU clusters. To demonstrate the extraordinary capability of GAMER, we have implemented two astrophysical applications into the code, namely, the hydrodynamic system with self-gravity and the extremely light bosonic dark matter (ELBDM) model. In both cases, we first verify the accuracy of the code by performing several test problems with analytical solutions and estimate the numerical errors. We then demonstrate the high performance of GAMER by comparing the performances with and without GPU acceleration, in which maximum performance speed-ups of 68 and 35 are achieved in the hydrodynamic simulations and the ELBDM simulations, respectively. | en |
dc.description.provenance | Made available in DSpace on 2021-06-17T00:35:04Z (GMT). No. of bitstreams: 1 ntu-100-F92222006-1.pdf: 136171913 bytes, checksum: 1c4e06b0eb7f3338115cfac753e87289 (MD5) Previous issue date: 2011 | en |
dc.description.tableofcontents | 誌謝 iii
Abstract v 1 Introduction 1 1.1 Adaptive Mesh Refinement (AMR) . . . . . . . . . . . . . . . . . . . 1 1.2 Graphic Processing Units (GPU) . . . . . . . . . . . . . . . . . . . . 5 2 GAMER: an AMR + GPU Framework 9 2.1 AMR Implementation . . . . . . . . . . . . . . . . . . . . . . . . . . 9 2.1.1 Data Structure . . . . . . . . . . . . . . . . . . . . . . . . . 9 2.1.2 Multi-level Integration . . . . . . . . . . . . . . . . . . . . . 11 2.1.3 Grid Refinement . . . . . . . . . . . . . . . . . . . . . . . . 12 2.1.4 Initialization . . . . . . . . . . . . . . . . . . . . . . . . . . 15 2.2 GPU Implementation . . . . . . . . . . . . . . . . . . . . . . . . . . 17 2.3 Parallelization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 2.3.1 Rectangular Domain Decomposition . . . . . . . . . . . . . . 20 2.3.2 Hilbert Space-filling Curve . . . . . . . . . . . . . . . . . . . 23 2.4 Performance Optimization . . . . . . . . . . . . . . . . . . . . . . . 26 2.4.1 Asynchronous Memory Copy between CPU and GPU . . . . 27 2.4.2 Concurrent Execution between CPU and GPU . . . . . . . . 28 2.4.3 Hybrid OpenMP/MPI/GPU parallelization . . . . . . . . . . . 32 2.5 Out-of-core Computing . . . . . . . . . . . . . . . . . . . . . . . . . 34 3 Application I: Hydrodynamics with Self-gravity 39 3.1 Numerical Algorithms . . . . . . . . . . . . . . . . . . . . . . . . . 40 3.1.1 Directionally Split Hydrodynamic Schemes . . . . . . . . . . 40 3.1.2 Directionally Unsplit Hydrodynamic Schemes . . . . . . . . 44 3.1.3 Self-gravity . . . . . . . . . . . . . . . . . . . . . . . . . . . 47 3.2 GPU Implementations . . . . . . . . . . . . . . . . . . . . . . . . . 50 3.2.1 Directionally Split Hydrodynamic Schemes . . . . . . . . . . 50 3.2.2 Directionally Unsplit Hydrodynamic Schemes . . . . . . . . 54 3.2.3 Self-gravity . . . . . . . . . . . . . . . . . . . . . . . . . . . 57 3.3 Accuracy Tests . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62 3.3.1 Acoustic Wave Test . . . . . . . . . . . . . . . . . . . . . . . 63 3.3.2 Shock Tube Test . . . . . . . . . . . . . . . . . . . . . . . . 65 3.3.3 Sedov-Taylor Blast Wave Test . . . . . . . . . . . . . . . . . 68 3.3.4 Spherical Collapse Test . . . . . . . . . . . . . . . . . . . . . 70 3.4 Performance Tests . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75 3.4.1 Performance of the GPU Hydrodynamic Solvers . . . . . . . 75 3.4.2 Performance of the GPU Poisson Solver . . . . . . . . . . . . 79 3.4.3 Overall Performance in Uniform Mesh . . . . . . . . . . . . 79 3.4.4 Overall Performance in AMR . . . . . . . . . . . . . . . . . 83 3.4.5 Future Prospects . . . . . . . . . . . . . . . . . . . . . . . . 88 4 Application II: Extremely Light Bosonic Dark Matter 91 4.1 Numerical Algorithms . . . . . . . . . . . . . . . . . . . . . . . . . 92 4.1.1 Integration Schemes . . . . . . . . . . . . . . . . . . . . . . 93 4.2 GPU Implementations . . . . . . . . . . . . . . . . . . . . . . . . . 95 4.3 Accuracy Tests . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97 4.3.1 Linear Waves in the Comoving Frame . . . . . . . . . . . . . 97 4.3.2 Gaussian Packet . . . . . . . . . . . . . . . . . . . . . . . . 99 4.4 Performance Tests . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101 4.5 Future Prospects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106 5 Discussions and Conclusions 109 Bibliography 113 | |
dc.language.iso | zh-TW | |
dc.title | 高速圖形顯示卡於自適性網格天文物理電腦模擬之應用與效能 | zh_TW |
dc.title | A Novel Adaptive Mesh Refinement Code with Graphic-Processing-Unit Acceleration and Its Applications to Astrophysics | en |
dc.type | Thesis | |
dc.date.schoolyear | 100-1 | |
dc.description.degree | 博士 | |
dc.contributor.oralexamcommittee | 黃偉彥,賴詩萍,麥善德(Sandor Molnar),梅津敬一(Keiichi Umetsu),湯姆 布羅德赫斯特(Tom Broadhurst) | |
dc.subject.keyword | 圖形顯示卡,自適性網格,流體力學,重力, | zh_TW |
dc.subject.keyword | graphic-processing-unit,adaptive-mesh-refinement,hydrodynamics,gravity, | en |
dc.relation.page | 116 | |
dc.rights.note | 有償授權 | |
dc.date.accepted | 2012-02-06 | |
dc.contributor.author-college | 理學院 | zh_TW |
dc.contributor.author-dept | 物理研究所 | zh_TW |
顯示於系所單位: | 物理學系 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-100-1.pdf 目前未授權公開取用 | 132.98 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。