TAU Publications

  1. S. Shende, A. D. Malony, J. Cuny, K. Lindlan, P. Beckman and S. Karmesin, Portable Profiling and Tracing for Parallel Scientific Applications using C++, Appears in: Proceedings of SPDT'98: ACM SIGMETRICS Symposium on Parallel and Distributed Tools, pp. 134-145, Aug. 1998.

    Summary: descibes the TAU portable profiling package and its application to the ACTS Toolkit.

  2. A. Malony and S. Shende, Performance Technology for Complex Parallel and Distributed Systems, Proc. Third Austrian-Hungarian Workshop on Distributed and Parallel Systems, DAPSYS 2000, "Distributed and Parallel Systems: From Concepts to Applications," (Eds. G. Kotsis and P. Kacsuk) Kluwer, Norwell, MA, pp. 37-46, 2000.

    Summary: describes how TAU is applied to a multi-threaded multi-context execution model.

  3. S. Shende, and A. D. Malony, Integration and Application of the TAU Performance System in Parallel Java Environments Proceedings of the Joint ACM Java Grande - ISCOPE 2001 Conference, June 2001.

    Summary: describes TAU's application to mpiJava, for Parallel Java programs.

  4. S. Shende, A. D. Malony, and R. Ansell-Bell, Instrumentation and Measurement Strategies for Flexible and Portable Empirical Performance Evaluation, Proceedings Tools and Techniques for Performance Evaluation Workshop, PDPTA'01 , C.S.R.E.A., June 2001.

    Summary: describes TAU's instrumentation alternatives including DyninstAPI for parallel programs.

  5. T. Sheehan, A. Malony, S. Shende, A Runtime Monitoring Framework for the TAU Profiling System, Proceedings of the Third International Symposium on Computing in Object-Oriented Parallel Environments (ISCOPE'99), San Francisco, CA, December 1999.

    Summary: describes the TAU Monitoring Framework.

  6. S. Shende, A. D. Malony and S. Hackstadt, Dynamic Performance Callstack Sampling: Merging TAU and DAQV, Appears in: B. Kågström, J. Dongarra, E. Elmroth and J. Wasniewski (editors). Applied Parallel Computing, 4th International Workshop, PARA'98, Lecture Notes in Computer Science, No. 1541, Springer-Verlag, Berlin, 1998. (c) Springer-Verlag

    Summary: describes how callstacks can be sampled in TAU using DAQV-II.

  7. S. Shende, Profiling and Tracing in Linux, Appears in: Proceedings of the Extreme Linux Workshop #2, USENIX, Monterey CA, June 1999.

    Summary: gives a brief overview of profiling and tracing tools in the context of Linux operating system.

  8. Advanced Computing Laboratory, Los Alamos National Laboratory: TAU: Tuning and Analysis Utilities , Los Alamos National Laboratory Publication LALP-99-205, November 1999.

    Summary: A short four page summary of TAU. Produced as a flyer for Supercomping '99.

  9. S. Shende, J. Cuny, L. Hansen, J. Kundu, S. McLaughry and O. Wolf, Event and State-Based Debugging in TAU:A Prototype, Appears in: Proceedings of SPDT'96: ACM SIGMETRICS Symposium on Parallel and Distributed Tools, pp. 21-30, May 1996.

    Summary: describes a multilevel debugging strategy that combines both event- and state-based debugging approaches within the TAU program analysis environment for pC++.

  10. K. Windisch, B. Mohr, A. Malony, A Brief Technical Overview of the TAU Tools. Unpublished.

    Summary: A very brief look at the design of the TAU environment.

  11. B. Mohr, A. Malony, J. Cuny, TAU. In G. Wilson, editor, Parallel Programming using C++, M.I.T. Press, 1996.

    Summary: gives the most complete description of the TAU environment.

  12. D. Brown, A. Malony, B. Mohr, Language-based Parallel Program Interaction: the Breezy Approach, Appears in: Proceedings of the International Conference on High Performance Computing (HiPC'95),India, December 1995.

    Summary: describes the design and architecture of the breezy tool.

  13. K. Shanmugam, A. Malony, B. Mohr, Speedy: An Integrated Performance Extrapolation Tool for pC++ Programs, Appears in: Proceedings of the Joint Conference PERFORMANCE TOOLS'95 and MMB'95, 20th-22nd September, 1995, Heidelberg, Germany.

    Summary: A new TAU tool, speddy, is described which is a graphical interface to the pC++ performance extrapolation tool ExtraP. Speedy/Extrap allow analyzing the performance of pC++ programs without actually running them on a parallel computer.

  14. A. Malony, B. Mohr, P. Beckman, D. Gannon, Program Analysis and Tuning Tools for a Parallel Object Oriented Language: An Experiment with the TAU System, Appears in: Proceedings of the Workshop on Parallel Scientific Computing, Cape Cod, Maine, October 1994.

    Summary: The use of the TAU tools is illustrated from the perspective of the design and evaluation of a single application in pC++: a bitonic sort module that is used as part of a large N-Body simulation of cosmological evolution.

  15. S. Hackstadt, A. Malony, B. Mohr, Scalable Performance Visualization for Data-Parallel Programs, Proceedings of the Scalable High Performance Computing Conference (SHPCC), Knoxville, Tennessee, May 1994.

    Summary: presents several performance visualization techniques based on the context of data-parallel programming and execution that demonstrate good visual scalability properties.

  16. D. Brown, S. Hackstadt, A. Malony, B. Mohr, Program Analysis Environments for Parallel Language Systems: The TAU Environment, Proceedings of the 2nd Workshop on Environments and Tools For Parallel Scientific Computing, Townsend, Tennessee, pp. 162-171, May 1994.

    Summary: A companion paper to the CONPAR94 article. After an overview about the TAU program analysis tools is given, the barrier breakpoint debugger, breezy, is discussed.

  17. B. Mohr, D. Brown, A. Malony, TAU: A Portable Parallel Program Analysis Environment for pC++, Proceedings of CONPAR 94 - VAPP VI, University of Linz, Austria, LNCS 854, pp. 29-40, September 1994.

    Summary: describes the TAU program analysis tools: fancy (file and class browser), cagey (callgraph browser), classy (class hierarchy browser), racy (profile data browser), and easy (event and state viewer)

  18. A. Malony, B. Mohr, P. Beckman, D. Gannon, S. Yang, F. Bodin, Performance Analysis of pC++: A Portable Data-Parallel Programming System for Scalable Parallel Computers, Proceedings of the 8th International Parallel Processing Symbosium (IPPS), Cancún, Mexico, pp. 75-85, April 1994.

    Summary: describes profiling and tracing capabilities of pC++ and gives detailed results of speedup measurements for four benchmark programs

  19. F. Bodin, P. Beckman, D. Gannon, S. Yang, S. Kesavan, A. Malony, B. Mohr, Implementing a Parallel C++ Runtime System for Scalable Parallel Systems, Proceedings of the 1993 Supercomputing Conference, Portland, Oregon, pp. 588-597, November 1993.

    Summary: gives an overview of pC++, a parallel version of C++ and the implemementation of its runtime system on a variety of distributed and shared memory machines

  20. B. Mohr, Standardization of Event Traces Considered Harmful or Is an Implementation of Object-Independent Event Trace Monitoring and Analysis Systems Possible?, Proceedings of the CNRS-NSF Workshop on Environments and Tools For Parallel Scientific Computing, St. Hilaire du Touvet, France, Elsevier, Advances in Parallel Computing, Vol. 6, pp. 103-124, September 1992.

    Summary: describes how to write event trace analysis tools in a way that they are able to read and analyze traces of arbitrary origin

There is more documentation available through the pC++ / Sage++ archive.

Next: Additional Information Up: TAU Home Page Prev: TAU for HPF

Thu Jan 5 1995