next up previous
Next: About this document ... Up: Overhead Compensation in Performance Previous: Conclusion


D. Bailey, T. Harris, W. Saphir, R. van der Wijngaart, A. Woo, M. Yarrow, ``The NAS Parallel Benchmarks 2.0,'' Technical Report NAS-95-020, NASA Ames Research Center, 1995.

G. Bronevetsky, D. Marques, K. Pingali, and P. Stodghill, ``Collective Operations in an Application-level Fault Tolerant MPI System,'' International Conference on Supercomputing (ICS), 2003.

G. Bronevetsky, D. Marques, K. Pingali, and P. Stodghill, ``Automated Application-level Checkpointing of MPI Programs,'' Principles and Practice of Parallel Programming (PPoPP), 2003.

S. Browne, J. Dongarra, N. Garner, G. Ho, and P. Mucci, ``A Portable Programming Interface for Performance Evaluation on Modern Processors,'' International Journal of High Performance Computing Applications, 14(3):189-204, Fall 2000.

H. Brunst, M. Winkler, W. Nagel, H.-C. Hoppe, ``Performance Optimization for Large Scale Computing: The Scalable VAMPIR Approach,'' In V. Alexandrov, J. Dongarra, B. Juliano, R. Renner, K. Tan, (eds.), International Conference on Computational Science, Part II, LNCS 2074, Springer, pp. 751-760, 2001.

L. De Rose, ``The Hardware Performance Monitor Toolkit,'' Euro-Par Conference, 2001.

L. De Rose and F. Wolf ``CATCH - A Call-Graph Based Automatic Tool for Capture of Hardware Performance Metrics for MPI and OpenMP Applications,'' Euro-Par Conference, LNCS 2400, Springer, pp. 167-176, 2002.

Alain Fagot and Jacques Chassin de Kergommeaux, ``Systems Assessment of the Overhead of Tracing Parallel Programs,'' Euromicro Workshop on Parallel and Distributed Processing, pp. 179-186, 1996.

T. Fahringer and C. Seragiotto, ``Experience with Aksum: A Semi-Automatic Multi-Experiment Performance Analysis Tool for Parallel and Distributed Applications,'' Workshop on Performance Analysis and Distributed Computing, 2002.

S. Graham, P. Kessler, and M. McKusick, ``gprof: A Call Graph Execution Profiler,'' SIGPLAN Symposium on Compiler Construction, pp. 120-126, June 1982.

R. Hall, ``Call Path Profiling,'' International Conference on Software Engineering, pp. 296-306, 1992.

J. Hollingsworth and B. Miller, ``An Adaptive Cost System for Parallel Program Instrumentation,'' Euro-Par Conference, Volume I, pp. 88-97, August 1996.

IBM, ``Profiling Parallel Programs with Xprofiler,'' IBM Parallel Environment for AIX: Operation and Use, Volume 2.

C. Janssen, ``The Visual Profiler,'' cljanss/perf/vprof/.

D. Kranzlmüller, R. Reussner, and C. Schaubschläger, ``Monitor Overhead Measurement with SKaMPI,'' EuroPVM/MPI Conference, LNCS 1697, pp. 43-50, 1999.

D. Knuth, ``An Empirical Study of FORTRAN Programs,'' Software Practice and Experience, 1:105-133, 1971.

A. Malony, ``Performance Observability,'' Ph.D. thesis, University of Illinois, Urbana-Champaign, 1991.

A. Malony, D. Reed, and H. Wijshoff, ``Performance Measurement Intrusion and Perturbation Analysis,'' IEEE Transactions on Parallel and Distributed Systems, 3(4):433-450, July 1992.

A. Malony and D. Reed, ``Models for Performance Perturbation Analysis,'' ACM/ONR Workshop on Parallel and Distributed Debugging, pp. 1-12, May 1991.

A. Malony ``Event Based Performance Perturbation: A Case Study,'' Principles and Practices of Parallel Programming (PPoPP), pp. 201-212, April 1991.

A. Malony, S. Shende, ``Performance Technology for Complex Parallel and Distributed Systems,'' In G. Kotsis, P. Kacsuk (eds.), Distributed and Parallel Systems, From Instruction Parallelism to Cluster Computing, Third Workshop on Distributed and Parallel Systems (DAPSYS 2000), Kluwer, pp. 37-46, 2000.

J. Mellor-Crummey, R. Fowler, and G. Marin, ``HPCView: A Tool for Top-down Analysis of Node Performance,'' Journal of Supercomputing, 23:81-104, 2002.

P. Mucci, ``Dynaprof,'' mucci/dynaprof

D. Reed, L. DeRose, and Y. Zhang, ``SvPablo: A Multi-Language Performance Analysis System,'' International Conference on Performance Tools, pp. 352-355, September 1998.

S. Sarukkai and A. Malony, ``Perturbation Analysis of High-Level Instrumentation for SPMD Programs,'' Principles and Practices of Parallel Programming (PPoPP), pp. 44-53, May 1993.

Unix Programmer's Manual, ``prof command,'' Section 1, Bell Laboratories, Murray Hill, NJ, January 1979.

J. Vetter, ``Dynamic Statistical Profiling of Communication Activity in Distributed Applications,'' ACM SIGMETRICS Joint International Conference on Measurement and Modeling of Computer Systems, ACM, 2002.

F. Wolf and B. Mohr, ``Automatic Performance Analysis of SMP Cluster Applications,'' Technical Report IB 2001-05, Research Centre Juelich, 2001.

W. Williams, T. Hoel, and D. Pase, ``The MPP Apprentice Performance Tool: Delivering the Performance of the Cray T3D,'' Programming Environments for Massively Parallel Distributed Systems, North-Holland, 1994.

M. Zagha, B. Larson, S. Turner, and M. Itzkowitz, ``Performance Analysis Using the MIPS R10000 Performance Counters,'' Supercomputing Conference, November 1996.

Sameer Shende 2004-06-08