• Sailfish

  • Referenced in 13 articles [sw16828]
  • Sailfish: a flexible multi-GPU implementation of the lattice Boltzmann method. We present Sailfish...
  • TheLMA

  • Referenced in 11 articles [sw12960]
  • TheLMA project: Multi-GPU implementation of the lattice Boltzmann method. In this paper, we describe...
  • SkePU

  • Referenced in 5 articles [sw14958]
  • multi-backend skeleton programming library for multi-GPU systems. We present SkePU, a C++ template ... parallel OpenMP backend. It also supports multi-GPU systems. Copying data between the host...
  • Horovod

  • Referenced in 5 articles [sw28748]
  • training code to take advantage of inter-GPU communication. Depending on the training library ... minimal. Existing methods for enabling multi-GPU training under the TensorFlow library entail non-negligible...
  • SimNet

  • Referenced in 4 articles [sw39551]
  • framework. We present SimNet, an AI-driven multi-physics simulation framework, to accelerate simulations across ... computing, and offers scalable performance for multi-GPU and multi-Node implementation with accelerated linear...
  • KBLAS

  • Referenced in 4 articles [sw17481]
  • tuning parameters, KBLAS efficiently runs on various GPU architectures while avoiding code rewriting and retaining ... kernels have been leveraged to a multi-GPU environment, which requires the introduction...
  • G-DNA

  • Referenced in 2 articles [sw22764]
  • highly efficient multi-GPU/MPI tool for aligning nucleotide reads. G-DNA (GPU-based ... software is very efficient on both multi-GPU machines and MPI+GPU clusters. It computes...
  • HTR solver

  • Referenced in 3 articles [sw40994]
  • open-source exascale-oriented task-based multi-GPU high-order code for hypersonic aerothermodynamics...
  • HPC2

  • Referenced in 3 articles [sw26407]
  • GPUs of AMD and NVIDIA. The multi-GPU scalability is demonstrated up to 256 devices...
  • CLAC

  • Referenced in 2 articles [sw14041]
  • Multi-GPU numerical simulation of electromagnetic waves. In this paper, we present three-dimensional numerical...
  • CUDA-Zero

  • Referenced in 2 articles [sw14130]
  • process of parallelization to multi-GPUs. Starting from a GPU program written in shared memory ... Zero can achieve efficient parallelization in multi-GPU environment...
  • RETURNN

  • Referenced in 2 articles [sw26580]
  • recurrent neural networks in a multi-GPU environment. Features include: Mini-batch training of feed...
  • Gluon

  • Referenced in 1 article [sw41760]
  • produce D-IrGL, the first multi-GPU distributed-memory graph analytics system. Our experiments were ... roughly 70,000 threads and on multi-GPU clusters with up to 64 GPUs...
  • NMF-mGPU

  • Referenced in 1 article [sw26273]
  • mGPU: Non-negative matrix factorization on multi-GPU systems. NMF-mGPU implements the Non-negative ... Device Architecture) framework for GPU Computing. CUDA represents a GPU device as a programmable general ... main memory to the GPU’s memory and processed accordingly. In addition, NMF-mGPU ... Finally, NMF-mGPU also provides a multi-GPU version that makes use of multiple...
  • WarpDrive

  • Referenced in 1 article [sw35093]
  • WarpDrive: Massively Parallel Hashing on Multi-GPU Nodes. Hash maps are among the most versatile ... hash maps supported by existing single-GPU hashing implementations is restricted by the limited amount ... WarpDrive - a scalable, distributed single-node multi-GPU implementation for the construction and querying...
  • MCMG

  • Referenced in 1 article [sw10705]
  • model heterogeneous architectures like CPU and GPU devices and their interactions as computing patterns move ... paper, we introduce MCMG (Multi-CPU Multi-GPU) simulator, a cycle accurate, modular and open...
  • G-MSA

  • Referenced in 1 article [sw22763]
  • common personal computer equipped with NVIDIA GPU (G80, GT200 or Fermi). Extensive tests show ... results remained very high. Moreover, multi-GPU support influences the execution time considerably...
  • PtychoLib

  • Referenced in 1 article [sw33339]
  • PtychoLib: PtychoLib is a parallel Multi-GPU open source library for real-time ptychographic phase...
  • Pkwrap

  • Referenced in 1 article [sw35383]
  • includes the parallel training ability when multi-GPU environments are unavailable and decode with graphs...
  • NaturalCC

  • Referenced in 1 article [sw36364]
  • providing (1) an efficient computation with multi-GPU and mixed-precision data processing for fast...