• StarPU

  • Referenced in 41 articles [sw14216]
  • computer mixing IBM Cell Broadband Engines and AMD opteron processors. Other architectures, featuring GPU accelerators...
  • Jacket

  • Referenced in 4 articles [sw11529]
  • enables standard Matlab. It greatly simplifies GPU computing for engineers, scientists, and technical computing professionals ... Jacket, most Matlab users are using the GPU and obtaining significant performance gains...
  • GPflowOpt

  • Referenced in 4 articles [sw33396]
  • TensorFlow including automatic differentiation, parallelization and GPU computations for Bayesian optimization. Design goals focus...
  • geomstats

  • Referenced in 11 articles [sw24373]
  • keras. We have enabled GPU implementation and integrated geomstats manifold computations into keras deep learning...
  • Gunrock

  • Referenced in 3 articles [sw27063]
  • expressiveness by coupling high performance GPU computing primitives and optimization strategies with a high-level ... primitives with small code size and minimal GPU programming knowledge...
  • CULA

  • Referenced in 11 articles [sw12745]
  • modern graphics processing unit (GPU) found in many standard personal computers is a highly parallel ... ratio. High-level linear algebra operations are computationally intense, often requiring O(N3) operations ... processing power of the GPU. Our work is on CULA, a GPU accelerated implementation...
  • SpGEMM

  • Referenced in 5 articles [sw14033]
  • well as with three GPU-based implementations. Measurements performed for computing the matrix square ... GPU caching architecture. An improved performance was also found for computing Galerkin products which...
  • ABC-SysBio

  • Referenced in 8 articles [sw24739]
  • SysBio: Approximate Bayesian Computation in Python with GPU support. MOTIVATION: The growing field of systems ... dynamical systems in an approximate Bayesian computation (ABC) framework. ABC-SysBio combines three algorithms...
  • ProxylessNAS

  • Referenced in 6 articles [sw42534]
  • However, the prohibitive computational demand of conventional NAS algorithms (e.g. 104 GPU hours) makes ... network architecture but suffers from the high GPU memory consumption issue (grow linearly w.r.t. candidate ... reduce the computational cost (CPU hours and GPU memory) to the same level of regular...
  • BSGP

  • Referenced in 5 articles [sw08995]
  • programming language for general purpose computation on the GPU. A BSGP program looks much...
  • MPSGPU-SJTU

  • Referenced in 2 articles [sw29217]
  • novel acceleration technique, graphics processing unit (GPU) parallel computing, is applied in MPS. Based ... GPU technique, an in-house solver MPSGPU-SJTU has been developed by using compute unified ... accuracy of GPU solver is verified by these comparisons. Moreover, the computation time of every ... results show that computational efficiency is improved dramatically by employing GPU acceleration technique...
  • RSVDPACK

  • Referenced in 6 articles [sw13832]
  • computing partial singular value decompositions via randomized sampling on single core, multi core, and GPU ... randomized algorithms for computing partial Singular Value Decompositions (SVDs). The techniques largely follow the prescriptions ... implement a number of low rank SVD computing routines for three different sets of hardware ... multi core CPU, and (3) massively multicore GPU...
  • BLAZE-DEM

  • Referenced in 2 articles [sw14055]
  • specifically targeted for Graphical Processing Unit (GPU) platforms. BLAZE-DEM uses actual polyhedral particle representations ... different geometries [4]. The use of computational modeling tools is essential in evaluation of various ... designs and processes as computational power increases [5]. However current DEM simulations are only able ... clusters [6]. The dramatic increase in GPU computing power has enabled the computational simulation...
  • EAGL

  • Referenced in 2 articles [sw08231]
  • EAGL), a self-contained GPU library, to support parallel computing of bilinear pairings based ... GPU pipeline vs. memory access latency are highly complex for parallelization of pairing computations. Overall ... main performance bottleneck for pairing computations on the tested GPU device, and the lazy reduction ... offer substantial performance improvement for GPU-based pairing computations...
  • Parallel Colt

  • Referenced in 2 articles [sw17569]
  • Colt, a multithreaded Java library for scientific computing and image processing. In addition to describing ... library. Performance comparisons with MATLAB, including GPU computations via AccelerEyes’ Jacket toolbox are also given...
  • LightSpMV

  • Referenced in 2 articles [sw23689]
  • some alternative storage formats for GPU computing. Unfortunately, these alternatives are incompatible with most ... from CSR at runtime, thus incurring significant computational and storage overheads. We present LightSpMV ... reveals that on the same Tesla K40c GPU, LightSpMV is superior to both CUSP...
  • Ripser++

  • Referenced in 2 articles [sw35629]
  • Ripser++: GPU-accelerated computation of Vietoris–Rips persistence barcodes. Ripser++ utilizes the massive parallelism hidden ... computation of Vietoris-Rips persistence barcodes by taking mathematical and algorithmic oppurtunities we have identified ... GPU compared to that on CPU for Ripser. After dimension 0 persistence computation, there ... from matrix reduction all on GPU, leaving the computation of submatrix reduction on the remaining...
  • Horovod

  • Referenced in 5 articles [sw28748]
  • often provided by GPUs. Scaling computation from one GPU to many can enable much faster...
  • BioEM

  • Referenced in 1 article [sw16813]
  • BioEM: GPU-accelerated computing of Bayesian inference of electron microscopy images. In cryo-electron microscopy ... demanding. Here we present highly parallelized, GPU-accelerated computer software that performs this task efficiently ... parallelization combined with both CPU and GPU computing. The resulting BioEM software scales nearly ideally ... both on pure CPU and on CPU+GPU architectures, thus enabling Bayesian analysis of tens...
  • Fireflies

  • Referenced in 1 article [sw14288]
  • interactively exploring dynamical systems using GPU computing. In nonlinear systems, where explicit analytic solutions usually ... power of graphical processing unit (GPU) computing to produce spectacular interactive visualizations of arbitrary systems ... massively parallel nature of GPU hardware, Fireflies is able to simulate millions of trajectories ... parallel (even on standard desktop computer hardware), producing “swarms” of particles that move around...