Computer Science – Distributed – Parallel – and Cluster Computing
Scientific paper
2010-04-26
Computer Science
Distributed, Parallel, and Cluster Computing
10 pages, 11 figures. Some clarifications and corrections
Scientific paper
Exploiting the performance of today's processors requires intimate knowledge of the microarchitecture as well as an awareness of the ever-growing complexity in thread and cache topology. LIKWID is a set of command-line utilities that addresses four key problems: Probing the thread and cache topology of a shared-memory node, enforcing thread-core affinity on a program, measuring performance counter metrics, and toggling hardware prefetchers. An API for using the performance counting features from user code is also included. We clearly state the differences to the widely used PAPI interface. To demonstrate the capabilities of the tool set we show the influence of thread pinning on performance using the well-known OpenMP STREAM triad benchmark, and use the affinity and hardware counter tools to study the performance of a stencil code specifically optimized to utilize shared caches on multicore chips.
Hager Georg
Treibig Jan
Wellein Gerhard
No associations
LandOfFree
LIKWID: A lightweight performance-oriented tool suite for x86 multicore environments does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with LIKWID: A lightweight performance-oriented tool suite for x86 multicore environments, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and LIKWID: A lightweight performance-oriented tool suite for x86 multicore environments will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-31851