next up previous
Next: Introduction

Online Remote Trace Analysis of Parallel Applications on High-Performance Clusters

Holger Brunst1,2, Allen D. Malony1, Sameer S. Shende1, Robert Bell1

1 Department for Computer and Information Science
University of Oregon, Eugene, USA
{brunst, malony, sameer, bertie}@cs.uoregon.edu

2 Center for High Performance Computing
Dresden University of Technology, Germany
brunst@zhr.tu-dresden.de

Abstract:

The paper presents the design and development of an online remote trace measurement and analysis system. The work combines the strengths of the TAU performance system with that of the VNG distributed parallel trace analyzer. Issues associated with online tracing are discussed and the problems encountered in system implementation are analyzed in detail. Our approach should port well to parallel platforms. Future work includes testing the performance of the system on large-scale machines.

Keywords: Parallel Computing, Performance Analysis, Performance Steering, Tracing, Clusters





Sameer Suresh Shende 2003-09-12