Computer Science – Performance
Scientific paper
2009-10-26
Computer Science
Performance
18 pages
Scientific paper
The balance metric is a simple approach to estimate the performance of bandwidth-limited loop kernels. However, applying the method to in-cache situations and modern multi-core architectures yields unsatisfactory results. This paper analyzes the in uence of cache hierarchy design on performance predictions for bandwidth-limited loop kernels on current mainstream processors. We present a diagnostic model with improved predictive power, correcting the limitations of the simple balance metric. The importance of code execution overhead even in bandwidth-bound situations is emphasized. Finally we analyze the impact of synchronization overhead on multi-threaded performance with a special emphasis on the in uence of cache topology.
Hager Georg
Treibig Jan
Wellein Gerhard
No associations
LandOfFree
Multi-core architectures: Complexities of performance prediction and the impact of cache topology does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Multi-core architectures: Complexities of performance prediction and the impact of cache topology, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Multi-core architectures: Complexities of performance prediction and the impact of cache topology will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-562757