Algorithm 679

Algorithm 679: A set of Level 3 Basic Linear Algebra Subprograms. This paper describes a model implementation and test software for the Level 2 Basic Linear Algebra Subprograms (Level 2 BLAS). Level 2 BLAS are targeted at matrix-vector operations with the aim of providing more efficient but portable, implementations of algorithms on high-performance computers. The model implementation provides a portable set of FORTRAN 77 Level 2 BLAS for machines where specialized implementations do not exists or are not required. The test software aims to verify that specialized implementations meet the specification of Level 2 BLAS that implementations are correctly installed.

This software is also peer reviewed by journal TOMS.

References in zbMATH (referenced in 65 articles )

Showing results 1 to 20 of 65.
Sorted by year (citations)

1 2 3 4 next

  1. Abdelfattah, Ahmad; Costa, Timothy; Dongarra, Jack; Gates, Mark; Haidar, Azzam; Hammarling, Sven; Higham, Nicholas J.; Kurzak, Jakub; Luszczek, Piotr; Tomov, Stanimire; Zounon, Mawussi: A set of batched basic linear algebra subprograms and LAPACK routines (2021)
  2. Ji, Hao; Mascagni, Michael; Li, Yaohang: Gaussian variant of Freivalds’ algorithm for efficient and reliable matrix product verification (2020)
  3. Amestoy, Patrick R.; de la Kethulle de Ryhove, Sébastien; L’Excellent, Jean-Yves; Moreau, Gilles; Shantsev, Daniil V.: Efficient use of sparsity by direct solvers applied to 3D controlled-source EM problems (2019)
  4. Dongarra, Jack; Gates, Mark; Haidar, Azzam; Kurzak, Jakub; Luszczek, Piotr; Wu, Panruo; Yamazaki, Ichitaro; Yarkhan, Asim; Abalenkovs, Maksims; Bagherpour, Negin; Hammarling, Sven; Šístek, Jakub; Stevens, David; Zounon, Mawussi; Relton, Samuel D.: PLASMA: Parallel linear algebra software for multicore using OpenMP (2019)
  5. Lang, Bruno: Efficient reduction of banded Hermitian positive definite generalized eigenvalue problems to banded standard eigenvalue problems (2019)
  6. Springer, Paul; Bientinesi, Paolo: Design of a high-performance GEMM-like tensor-tensor multiplication (2018)
  7. Elmar Peise; Paolo Bientinesi: Algorithm 979: Recursive Algorithms for Dense Linear Algebra - The ReLAPACK Collection (2017) not zbMATH
  8. Ji, Hao; Li, Yaohang: Block conjugate gradient algorithms for least squares problems (2017)
  9. Peise, Elmar; Bientinesi, Paolo: Algorithm 979: Recursive algorithms for dense linear algebra -- the ReLAPACK collection (2017)
  10. Springer, Paul; Hammond, Jeff R.; Bientinesi, Paolo: TTC: a high-performance compiler for tensor transpositions (2017)
  11. Hager, William W.; Zhang, Hongchao: Projection onto a polyhedron that exploits sparsity (2016)
  12. Veremieiev, S.; Thompson, H. M.; Gaskell, P. H.: Free-surface film flow over topography: full three-dimensional finite element solutions (2015)
  13. Ballard, G.; Carson, E.; Demmel, J.; Hoemmen, M.; Knight, N.; Schwartz, O.: Communication lower bounds and optimal algorithms for numerical linear algebra (2014)
  14. Gustavson, Fred G.; Waśniewski, Jerzy; Dongarra, Jack J.; Herrero, José R.; Langou, Julien: Level-3 Cholesky factorization routines improve performance of many Cholesky algorithms (2013)
  15. Utsuno, Yutaka; Shimizu, Noritaka; Otsuka, Takaharu; Abe, Takashi: Efficient computation of Hamiltonian matrix elements between non-orthogonal Slater determinants (2013)
  16. Kacem, S.; Eichwald, O.; Ducasse, O.; Renon, N.; Yousfi, M.; Charrada, K.: Full multi grid method for electric field computation in point-to-plane streamer discharge in air at atmospheric pressure (2012)
  17. D’Alberto, Paolo; Bodrato, Marco; Nicolau, Alexandru: Exploiting parallelism in matrix-computation kernels for symmetric multiprocessor systems: matrix-multiplication and matrix-addition algorithm optimizations by software pipelining and threads allocation (2011)
  18. Drmač, Zlatko: A global convergence proof for cyclic Jacobi methods with block rotations (2010)
  19. Gustavson, Fred G.; Waśniewski, Jerzy; Dongarra, Jack J.; Langou, Julien: Rectangular full packed format for Cholesky’s algorithm: factorization, solution, and inversion (2010)
  20. Baboulin, Marc; Buttari, Alfredo; Dongarra, Jack; Kurzak, Jakub; Langou, Julie; Langou, Julien; Luszczek, Piotr; Tomov, Stanimire: Accelerating scientific computations with mixed precision algorithms (2009)

1 2 3 4 next