Astronomy and Astrophysics – Astrophysics – Instrumentation and Methods for Astrophysics
Scientific paper
2011-06-09
Journal of Computational Physics. Volume 231, Issue 7, 1 April 2012, Pages 2825-2839
Astronomy and Astrophysics
Astrophysics
Instrumentation and Methods for Astrophysics
Accepted version. Published in Journal of Computational Physics. 35 pages, 12 figures, single column
Scientific paper
10.1016/j.jcp.2011.12.024
We present parallel algorithms for constructing and traversing sparse octrees on graphics processing units (GPUs). The algorithms are based on parallel-scan and sort methods. To test the performance and feasibility, we implemented them in CUDA in the form of a gravitational tree-code which completely runs on the GPU.(The code is publicly available at: http://castle.strw.leidenuniv.nl/software.html) The tree construction and traverse algorithms are portable to many-core devices which have support for CUDA or OpenCL programming languages. The gravitational tree-code outperforms tuned CPU code during the tree-construction and shows a performance improvement of more than a factor 20 overall, resulting in a processing rate of more than 2.8 million particles per second.
Bédorf Jeroen
Gaburov Evghenii
Portegies Zwart Simon
No associations
LandOfFree
A sparse octree gravitational N-body code that runs entirely on the GPU processor does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with A sparse octree gravitational N-body code that runs entirely on the GPU processor, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A sparse octree gravitational N-body code that runs entirely on the GPU processor will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-319508