 
 
 
 
 
   
 Next: About this document ...
 Up: Overhead Compensation in Performance
 Previous: Conclusion
 
- 
 
- 1
- 
D. Bailey, T. Harris, W. Saphir, R. van der Wijngaart,
A. Woo, M. Yarrow, ``The NAS Parallel Benchmarks 2.0,'' Technical
Report NAS-95-020, NASA Ames Research Center, 1995.
 
- 2
- 
G. Bronevetsky, D. Marques, K. Pingali, and P. Stodghill,      
``Collective Operations in an Application-level
Fault Tolerant MPI System,''      
International Conference on Supercomputing (ICS), 2003.
 
- 3
- 
G. Bronevetsky, D. Marques, K. Pingali, and P. Stodghill,      
``Automated Application-level Checkpointing of MPI Programs,''
Principles and Practice of Parallel Programming (PPoPP), 2003.
 
- 4
- 
S. Browne, J. Dongarra, N. Garner, G. Ho, and P. Mucci,
``A Portable Programming Interface for Performance Evaluation on Modern
Processors,'' International Journal of High Performance Computing
Applications, 14(3):189-204, Fall 2000.
 
- 5
- 
H. Brunst, M. Winkler, W. Nagel, H.-C. Hoppe,
``Performance Optimization for Large Scale Computing: The Scalable VAMPIR
Approach,'' In V. Alexandrov, J. Dongarra, B. Juliano, R. Renner, K. Tan,
(eds.), International Conference on Computational Science, Part
II, LNCS 2074, Springer, pp. 751-760, 2001.
 
- 6
- 
L. De Rose,
``The Hardware Performance Monitor Toolkit,''
Euro-Par Conference, 2001.
 
- 7
- 
L. De Rose and F. Wolf
``CATCH - A Call-Graph Based Automatic Tool for Capture
of Hardware Performance Metrics for MPI and OpenMP Applications,''
Euro-Par Conference, LNCS 2400, Springer, pp. 167-176, 2002.
 
- 8
- 
Alain Fagot and Jacques Chassin de Kergommeaux, ``Systems Assessment
of the Overhead of Tracing Parallel Programs,'' Euromicro Workshop on
Parallel and Distributed Processing, pp. 179-186, 1996.
 
- 9
- 
T. Fahringer and C. Seragiotto,
``Experience with Aksum: A Semi-Automatic Multi-Experiment Performance
Analysis Tool for Parallel and Distributed Applications,''
Workshop on Performance Analysis and Distributed Computing, 2002.
 
- 10
- 
S. Graham, P. Kessler, and M. McKusick,
``gprof: A Call Graph Execution Profiler,''
SIGPLAN Symposium on Compiler Construction, pp. 120-126,
June 1982.
 
- 11
- 
R. Hall, ``Call Path Profiling,''
International Conference on Software Engineering,
pp. 296-306, 1992.
 
- 12
- 
J. Hollingsworth and B. Miller, ``An Adaptive Cost System for Parallel
Program Instrumentation,'' Euro-Par Conference,
Volume I, pp. 88-97, August
1996.
 
- 13
- 
IBM, ``Profiling Parallel Programs with Xprofiler,''
IBM Parallel Environment for AIX: Operation and Use, Volume 2.
 
- 14
- 
C. Janssen,
``The Visual Profiler,''
http://aros.ca.sandia.gov/~ cljanss/perf/vprof/.
 
- 15
- 
D. Kranzlmüller, R. Reussner, and C. Schaubschläger, ``Monitor
Overhead Measurement with SKaMPI,'' EuroPVM/MPI Conference,
LNCS 1697, pp. 43-50, 1999.
 
- 16
- 
D. Knuth,
``An Empirical Study of FORTRAN Programs,''
Software Practice and Experience, 1:105-133, 1971.
 
- 17
- 
A. Malony, ``Performance Observability,''
Ph.D. thesis, University of Illinois, Urbana-Champaign, 1991.
 
- 18
- 
A. Malony, D. Reed, and H. Wijshoff,                               
``Performance Measurement Intrusion and Perturbation Analysis,''
IEEE Transactions on Parallel and Distributed Systems,
3(4):433-450, July 1992.
 
- 19
- 
A. Malony and D. Reed,
``Models for Performance Perturbation Analysis,''
ACM/ONR Workshop on Parallel and Distributed Debugging,
pp. 1-12, May 1991.
 
- 20
- 
A. Malony
``Event Based Performance Perturbation: A Case Study,''
Principles and Practices of Parallel Programming (PPoPP),
pp. 201-212, April 1991.
 
- 21
- 
A. Malony, S. Shende, ``Performance Technology for Complex
Parallel and Distributed Systems,'' In G. Kotsis, P. Kacsuk (eds.),
Distributed and Parallel Systems, From Instruction Parallelism to Cluster
Computing, Third Workshop on Distributed and Parallel Systems
(DAPSYS 2000), Kluwer, pp. 37-46, 2000.
 
- 22
- 
J. Mellor-Crummey, R. Fowler, and G. Marin,
``HPCView: A Tool for Top-down Analysis of Node Performance,''
Journal of Supercomputing, 23:81-104, 2002.
 
- 23
- 
P. Mucci, ``Dynaprof,'' http://www.cs.utk.edu/~ mucci/dynaprof
 
- 24
- 
D. Reed, L. DeRose, and Y. Zhang,
``SvPablo: A Multi-Language Performance Analysis System,''
International Conference on Performance Tools,
pp. 352-355, September 1998.
 
- 25
- 
S. Sarukkai and A. Malony,
``Perturbation Analysis of High-Level Instrumentation for
SPMD Programs,''
Principles and Practices of Parallel Programming (PPoPP),
pp. 44-53, May 1993.
 
- 26
- 
Unix Programmer's Manual,
``prof command,''
Section 1, Bell Laboratories, Murray Hill, NJ, January 1979.
 
- 27
- 
J. Vetter, ``Dynamic Statistical Profiling of
Communication Activity in Distributed Applications,''
ACM SIGMETRICS Joint International Conference on Measurement and
Modeling of Computer Systems, ACM, 2002.
 
- 28
- 
F. Wolf and B. Mohr,
``Automatic Performance Analysis of SMP Cluster Applications,''
Technical Report IB 2001-05, Research Centre Juelich, 2001.
 
- 29
- 
W. Williams, T. Hoel, and D. Pase, ``The MPP
Apprentice Performance Tool: Delivering the Performance of the Cray
T3D,''
Programming Environments for Massively Parallel Distributed
Systems, North-Holland, 1994.
 
- 30
- 
M. Zagha, B. Larson, S. Turner, and M. Itzkowitz,
``Performance Analysis Using the MIPS R10000 Performance Counters,''
Supercomputing Conference, November 1996.
 
Sameer Shende
2004-06-08