Efficient implementation of the overlap operator on multi-GPUs

Physics – High Energy Physics – High Energy Physics - Lattice

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

8 pages with 10 figures; accepted for presentation at the 2011 Symposium on Application Accelerators in High Performance Compu

Scientific paper

Lattice QCD calculations were one of the first applications to show the potential of GPUs in the area of high performance computing. Our interest is to find ways to effectively use GPUs for lattice calculations using the overlap operator. The large memory footprint of these codes requires the use of multiple GPUs in parallel. In this paper we show the methods we used to implement this operator efficiently. We run our codes both on a GPU cluster and a CPU cluster with similar interconnects. We find that to match performance the CPU cluster requires 20-30 times more CPU cores than GPUs.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Efficient implementation of the overlap operator on multi-GPUs does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Efficient implementation of the overlap operator on multi-GPUs, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Efficient implementation of the overlap operator on multi-GPUs will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-394102

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.