TAU: The TAU parallel performance system. The ability of performance technology to keep pace with the growing complexity of parallel and distributed systems depends on robust performance frameworks that can at once provide system-specific performance capabilities and support high-level performance problem solving. Flexibility and portability in empirical methods and processes are influenced primarily by the strategies available for instrmentation and measurement, and how effectively they are integrated and composed. This paper presents the TAU (Tuning and Analysis Utilities) parallel performance sytem and describe how it addresses diverse requirements for performance observation and analysis.

References in zbMATH (referenced in 20 articles )

Showing results 1 to 20 of 20.
Sorted by year (citations)

  1. Antonelli, Laura; Corsaro, Stefania; Marino, Zelda; Rizzardi, Mariarosaria: Algorithm 944: Talbot Suite: parallel implementations of Talbot’s method for the numerical inversion of Laplace transforms (2014)
  2. Böhm, Stanislav; Běhálek, Marek; Meca, Ondřej; Šurkovský, Martin: Kaira: development environment for MPI applications (2014) ioport
  3. de la Encina, Alberto; Rodríguez, Ismael; Rubio, Fernando: pHood: tool description, analysis techniques, and case studies (2014) ioport
  4. Ding, Chen; Xiang, Xiaoya; Bao, Bin; Luo, Hao; Luo, Ying-Wei; Wang, Xiao-Lin: Performance metrics and models for shared cache (2014) ioport
  5. Chandrasekaran, Sunita; Shanbagh, Shilpa; Jayaraman, Ramkumar; Maskell, Douglas L.; Cheah, Hui Yan: C2FPGA -- a dependency-timing graph design methodology (2013) ioport
  6. De Cecchis, D.; Drummond, L. A.; Castillo, J. E.: Design of a distributed coupling toolkit for high performance computing environment (2013)
  7. Qawasmeh, Ahmad; Malik, Abid; Chapman, Barbara; Huck, Kevin; Malony, Allen: Open source task profiling by extending the OpenMP runtime API (2013) ioport
  8. Amritkar, Amit; Tafti, Danesh; Liu, Rui; Kufrin, Rick; Chapman, Barbara: OpenMP parallelism for fluid and fluid-particulate systems (2012) ioport
  9. Camata, J. J.; Rossa, A. L.; Valli, Andrea M. P.; Catabriga, Lucia; Carey, Graham F.; Coutinho, Alvaro L. G. A.: Reordering and incomplete preconditioning in serial and parallel adaptive mesh refinement and coarsening flow solutions (2012)
  10. Mohror, Kathryn; Karavanic, Karen L.: Trace profiling: scalable event tracing on high-end parallel systems (2012) ioport
  11. Rutar, Nick; Hollingsworth, Jeffrey K.: Data centric techniques for mapping performance data to program variables (2012) ioport
  12. Vömel, Christof; Tomov, Stanimire; Dongarra, Jack: Divide and conquer on hybrid GPU-accelerated multicore systems (2012)
  13. Crivellini, A.; Bassi, F.: An implicit matrix-free discontinuous Galerkin solver for viscous and turbulent aerodynamic simulations (2011)
  14. Elias, Renato N.; Camata, Jose J.; Aveleda, Albino; Coutinho, Alvaro L. G. A.: Evaluation of message passing communication patterns in finite element solution of coupled problems (2011)
  15. Liu, Xu; Zhan, Jianfeng; Zhan, Kunlin; Shi, Weisong; Yuan, Lin; Meng, Dan; Wang, Lei: Automatic performance debugging of SPMD-style parallel programs (2011) ioport
  16. Ray, J.; Armstrong, R.; Safta, C.; Debusschere, B. J.; Allan, B. A.; Najm, H. N.: Computational frameworks for advanced combustion simulations (2011)
  17. Geimer, Markus; Kuhlmann, Björn; Pulatova, Farzona; Wolf, Felix; Wylie, Brian J. N.: Scalable collation and presentation of call-path profile data with CUBE (2008)
  18. Morris, Alan; Malony, Allen D.; Shende, Sameer S.: Supporting nested OpenMP parallelism in the TAU performance system (2008) ioport
  19. Terrel, A. R.; Scott, L. R.; Knepley, M. G.; Kirby, R. C.: Automated FEM discretizations for the Stokes equation (2008)
  20. Morris, Alan; Malony, Allen D.; Shende, Sameer S.: Supporting nested OpenMP parallelism in the TAU performance system (2007)

Further publications can be found at: https://www.cs.uoregon.edu/research/tau/pubs.php