• CUDA

  • Referenced in 1325 articles [sw03258]
  • GPU-accelerated applications. The CUDA Toolkit includes a compiler for NVIDIA GPUs, math libraries...
  • CUBLAS

  • Referenced in 84 articles [sw06880]
  • CUBLAS library is an implementation of BLAS (Basic Linear Algebra Subprograms ... computational resources of NVIDIA Graphics Processing Unit (GPU), but does not auto-parallelize across multiple ... CUBLAS library, the application must allocate the required matrices and vectors in the GPU memory ... from the GPU memory space back to the host. The CUBLAS library also provides helper...
  • Keras

  • Referenced in 204 articles [sw15491]
  • Keras if you need a deep learning library that: allows for easy and fast prototyping ... output training). runs seamlessly on CPU and GPU. Read the documentation at Keras.io. Keras...
  • AmgX

  • Referenced in 14 articles [sw13440]
  • AmgX: a library for GPU accelerated algebraic multigrid and preconditioned iterative methods. The solution ... implementation of the AmgX library, which provides drop-in GPU acceleration of distributed algebraic multigrid ... AmgX library achieves $2-5 imes$ speedup on a single GPU against a competitive implementation...
  • OpenGL

  • Referenced in 136 articles [sw06740]
  • OpenGL (Open Graphics Library) is a cross-language, multi-platform application programming interface ... interact with a Graphics processing unit (GPU), to achieve hardware-accelerated rendering. OpenGL was developed...
  • MAGMA

  • Referenced in 55 articles [sw12741]
  • algebra library similar to LAPACK but for heterogeneous/hybrid architectures, starting with current ”Multicore+GPU” systems...
  • cuRAND

  • Referenced in 19 articles [sw11536]
  • CUDA Random Number Generation library (cuRAND) delivers high performance GPU-accelerated random number generation ... cuRAND library delivers high quality random numbers 8x faster using hundreds of processor cores available ... within your CUDA functions/kernels running on the GPU. A variety of RNG algorithms and distribution...
  • torchdiffeq

  • Referenced in 59 articles [sw35082]
  • PyTorch Implementation of Differentiable ODE Solvers. This library provides ordinary differential equation (ODE) solvers implemented ... fully supported to run on the GPU...
  • cuFFT

  • Referenced in 24 articles [sw11258]
  • cuFFT library provides a simple interface for computing FFTs on an NVIDIA GPU, which allows ... parallelism of the GPU in a highly optimized and tested FFT library. The cuFFT product...
  • Theano

  • Referenced in 95 articles [sw05894]
  • Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions ... integration with numpy, transparent use of a GPU, efficient symbolic differentiation, speed and stability optimizations...
  • Kokkos

  • Referenced in 28 articles [sw20455]
  • from Github. Kokkos is a templated C++ library that provides two key abstractions ... GPU, Intel Phi, or many-core CPU. The Kokkos library also provides data abstractions...
  • Horovod

  • Referenced in 5 articles [sw28748]
  • complications. First, the training library must support inter-GPU communication. Depending on the particular methods ... take advantage of inter-GPU communication. Depending on the training library’s API, the modification ... methods for enabling multi-GPU training under the TensorFlow library entail non-negligible communication overhead ... single-GPU training. In this paper we introduce Horovod, an open source library that improves...
  • GPflow

  • Referenced in 25 articles [sw21518]
  • using tensorflow. GPflow is a Gaussian process library that uses TensorFlow for its core computations ... software testing and is able to exploit GPU hardware...
  • GLift

  • Referenced in 5 articles [sw06826]
  • generic template library for defining complex, random-access graphics processor (GPU) data structures ... Like modern CPU data structure libraries, Glift enables GPU programmers to separate algorithms from data...
  • CUDAnative.jl

  • Referenced in 6 articles [sw31975]
  • CUDAdrv.jl: interfacing with the CUDA driver library; CUDAnative.jl: GPU kernel programming support; CuArrays.jl: array programming...
  • EAGL

  • Referenced in 2 articles [sw08231]
  • EAGL: an elliptic curve arithmetic GPU-based library for bilinear pairing. In this paper ... based library (EAGL), a self-contained GPU library, to support parallel computing of bilinear pairings ... advantage of the parallel processing power of GPU, with no shared memory bank conflict...
  • CatBoost

  • Referenced in 7 articles [sw31560]
  • popular publicly available datasets. The library has a GPU implementation of learning algorithm ... significantly faster than other gradient boosting libraries on ensembles of similar sizes...
  • TORC

  • Referenced in 4 articles [sw18633]
  • Runtime Library for Platform-Independent Task Parallelism. With the increasing diversity of computing systems ... extended with the advent of general-purpose GPU devices. In this paper we present ... task library for heterogeneous and GPU clusters. The library offers an intuitive programming interface...
  • SkePU

  • Referenced in 5 articles [sw14958]
  • multi-backend skeleton programming library for multi-GPU systems. We present SkePU, a C++ template ... library which provides a simple and unified interface for specifying data-parallel computations with ... parallel OpenMP backend. It also supports multi-GPU systems. Copying data between the host...
  • moderngpu

  • Referenced in 2 articles [sw23687]
  • Modern GPU library. moderngpu is a productivity library for general-purpose computing on GPUs...