next up previous
Next: About this document ... Up: Performance and Memory Evaluation Previous: About the Authors

Bibliography

1
S. Shende, and A. Malony, ``The TAU Parallel Performance System,'' In International Journal of High Performance Computing Applications, ACTS Collection Special Issue, Summer 2006.

2
A. Malony, S. Shende, ``Performance Technology for Complex Parallel and Distributed Systems,'' In G. Kotsis, P. Kacsuk (eds.), Distributed and Parallel Systems, From Instruction Parallelism to Cluster Computing, Third Workshop on Distributed and Parallel Systems (DAPSYS 2000), Kluwer, pp. 37-46, 2000.

3
A. D. Malony, S. Shende, and A. Morris, ``Phase-Based Parallel Performance Profiling,'' In Proceedings of the PARCO 2005 conference, 2005.

4
S. Shende, A. D. Malony, and A. Morris, ``Optimization of Instrumentation in Parallel Performance Evaluation Tools,'' Proc. of PARA 2006 conference, June 2006.

5
R. Bell, A. D. Malony, and S. Shende, ``A Portable, Extensible, and Scalable Tool for Parallel Performance Profile Analysis'', Proc. EUROPAR 2003 conference, LNCS 2790, Springer, Berlin, pp. 17-26, 2003.

6
A. D. Malony, and S. S. Shende, ``Overhead Compensation in Performance Profiling,'' Proc. Europar 2004 Conference, LNCS 3149, Springer, pp. 119-132, 2004.

7
S. Shende, A. D. Malony, A. Morris, and F. Wolf, ``Performance Profiling Overhead Compensation for MPI Programs,'' in Proc. EuroPVM/MPI 2005 Conference, (eds. B. Di. Martino et. al.), LNCS 3666, Springer, pp. 359-367, 2005.

8
S. Shende, A. D. Malony, A. Morris, and F. Wolf, ``Performance Profiling Overhead Compensation for MPI Programs,'' in Proc. EuroPVM/MPI 2005 Conference, (eds. B. Di. Martino et. al.), LNCS 3666, Springer, pp. 359-367, 2005.

9
F. Song, F. Wolf, ``CUBE User Manual,'' ICL Technical Report, ICL-UT-04-01, February 2, 2004.

10
S. Graham, P. Kessler, and M. McKusick, ``gprof: A Call Graph Execution Profiler,'' SIGPLAN Symposium on Compiler Construction, pp. 120-126, June 1982.

11
S. Graham, P. Kessler, and M. McKusick, ``An Execution Profiler for Modular Programs,'' Software-Practice and Experience, Volume 13, pp. 671-685, August 1983.

12
B. Mohr, and F. Wolf, ``KOJAK - A Tool Set for Automatic Performance Analysis of Parallel Applications,'' Proc. of the European Conference on Parallel Computing, Springer-Verlag, LNCS 2790, pp. 1301-1304, August 26-29, 2003.

13
A. Malony, S. Shende, N. Trebon, J. Ray, R. Armstrong, C. Rasmussen, and M. Sottile, ``Performance Technology for Parallel and Distributed Component Software,'' Concurrency and Computation: Practice and Experience, Vol. 17, Issue 2-4, pp. 117-141, John Wiley & Sons, Ltd., Feb - Apr, 2005.

14
A. D. Malony, S. Shende, R. Bell, K. Li, L. Li, N. Trebon, ``Advances in the TAU Performance System,'' Chapter, ``Performance Analysis and Grid Computing,'' (Eds. V. Getov, M. Gerndt, A. Hoisie, A. Malony, B. Miller), Kluwer, Norwell, MA, pp. 129-144, 2003.

15
N. Trebon, A. Morris, J. Ray, S. Shende, and A. Malony, ``Performance Modeling of Component Assemblies with TAU,'' Proc. Workshop on Component Models and Frameworks in High Performance Computing (CompFrame 2005).

16
K. Lindlan, J. Cuny, A. Malony, S. Shende, B. Mohr, R. Rivenburgh, C. Rasmussen, ``A Tool Framework for Static and Dynamic Analysis of Object-Oriented Software with Templates,'' SC 2000 conference, 2000.

17
K. A. Huck, A. D. Malony, R. Bell, and A. Morris, ``Design and Implementation of a Parallel Performance Data Management Framework,'' In Proceedings of International Conference on Parallel Processing (ICPP 2005), IEEE Computer Society, 2005.

18
K. A. Huck, and A. D. Malony, ``PerfExplorer: A Performance Data Mining Framework for Large-Scale Parallel Computing,'' In Proceedings of SC 2005 conference, ACM, 2005.

19
S. Browne, J. Dongarra, N. Garner, G. Ho, and P. Mucci, ``A Portable Programming Interface for Performance Evaluation on Modern Processors,'' International Journal of High Performance Computing Applications, 14(3):189-204, Fall 2000.

20
B. Buck and J. Hollingsworth, ``An API for Runtime Code Patching'', Journal of High Performance Computing Applications, pp. 317-329, 14(4), 2000.

21
M. Syamlal, W. Rogers, T. O'Brien, ``MFIX documentation: Theory Guide, Technical Note,'' DOE/METC-95/1013, 1993.

22
M. Syamlal, ``MFIX documentation: Numerical Technique,'' EG&G Technical Report DE-AC21-95MC31346, 1998.

23
MFIX, URL: http://www.mfix.org, 2006.

24
C. Fryer and O.E. Potter, ``Experimental Investigation of models for fluidized bed catalytic reactors,'' AIChE J., 22, 38-47, 1976.

25
GNU, ``GNU Fortran 95,'' URL: http://gcc.gnu.org/fortran/, 2006.

26
G. Watson, ``Debug Malloc Library,'' URL: http://www.dmalloc.com, 2006.

27
Silicon Graphics Inc., ``Open SpeedShop For Linux'', URL: http://oss.sgi.com/projects/openspeedshop/, 2006.

28
Luiz DeRose, ``Performance Visualization on the Cray XT3,'' URL:http://www.psc.edu/training/
XT3_Oct05/lectures/CrayXT3Apprentice.pdf, 2006.

29
V. Herrarte and E. Lusk, ``Study parallel program behavior with Upshot,'' Technical Report ANL-91/15, Mathematics and Computer Science Division, Argonne National Laboratory, Aug. 1991.



Scott Biersdorff 2006-05-05