Communication-optimal Parallel and Sequential Cholesky Decomposition

Computer Science – Numerical Analysis

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

29 pages, 2 tables, 6 figures

Scientific paper

10.1137/090760969

Numerical algorithms have two kinds of costs: arithmetic and communication, by which we mean either moving data between levels of a memory hierarchy (in the sequential case) or over a network connecting processors (in the parallel case). Communication costs often dominate arithmetic costs, so it is of interest to design algorithms minimizing communication. In this paper we first extend known lower bounds on the communication cost (both for bandwidth and for latency) of conventional (O(n^3)) matrix multiplication to Cholesky factorization, which is used for solving dense symmetric positive definite linear systems. Second, we compare the costs of various Cholesky decomposition implementations to these lower bounds and identify the algorithms and data structures that attain them. In the sequential case, we consider both the two-level and hierarchical memory models. Combined with prior results in [13, 14, 15], this gives a set of communication-optimal algorithms for O(n^3) implementations of the three basic factorizations of dense linear algebra: LU with pivoting, QR and Cholesky. But it goes beyond this prior work on sequential LU by optimizing communication for any number of levels of memory hierarchy.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Communication-optimal Parallel and Sequential Cholesky Decomposition does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Communication-optimal Parallel and Sequential Cholesky Decomposition, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Communication-optimal Parallel and Sequential Cholesky Decomposition will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-718219

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.