
StarPU
 Referenced in 41 articles
 computer mixing IBM Cell Broadband Engines and AMD opteron processors. Other architectures, featuring GPU accelerators...

Jacket
 Referenced in 4 articles
 enables standard Matlab. It greatly simplifies GPU computing for engineers, scientists, and technical computing professionals ... Jacket, most Matlab users are using the GPU and obtaining significant performance gains...

GPflowOpt
 Referenced in 4 articles
 TensorFlow including automatic differentiation, parallelization and GPU computations for Bayesian optimization. Design goals focus...

geomstats
 Referenced in 11 articles
 keras. We have enabled GPU implementation and integrated geomstats manifold computations into keras deep learning...

Gunrock
 Referenced in 3 articles
 expressiveness by coupling high performance GPU computing primitives and optimization strategies with a highlevel ... primitives with small code size and minimal GPU programming knowledge...

CULA
 Referenced in 11 articles
 modern graphics processing unit (GPU) found in many standard personal computers is a highly parallel ... ratio. Highlevel linear algebra operations are computationally intense, often requiring O(N3) operations ... processing power of the GPU. Our work is on CULA, a GPU accelerated implementation...

SpGEMM
 Referenced in 5 articles
 well as with three GPUbased implementations. Measurements performed for computing the matrix square ... GPU caching architecture. An improved performance was also found for computing Galerkin products which...

ABCSysBio
 Referenced in 8 articles
 SysBio: Approximate Bayesian Computation in Python with GPU support. MOTIVATION: The growing field of systems ... dynamical systems in an approximate Bayesian computation (ABC) framework. ABCSysBio combines three algorithms...

ProxylessNAS
 Referenced in 6 articles
 However, the prohibitive computational demand of conventional NAS algorithms (e.g. 104 GPU hours) makes ... network architecture but suffers from the high GPU memory consumption issue (grow linearly w.r.t. candidate ... reduce the computational cost (CPU hours and GPU memory) to the same level of regular...

BSGP
 Referenced in 5 articles
 programming language for general purpose computation on the GPU. A BSGP program looks much...

MPSGPUSJTU
 Referenced in 2 articles
 novel acceleration technique, graphics processing unit (GPU) parallel computing, is applied in MPS. Based ... GPU technique, an inhouse solver MPSGPUSJTU has been developed by using compute unified ... accuracy of GPU solver is verified by these comparisons. Moreover, the computation time of every ... results show that computational efficiency is improved dramatically by employing GPU acceleration technique...

RSVDPACK
 Referenced in 6 articles
 computing partial singular value decompositions via randomized sampling on single core, multi core, and GPU ... randomized algorithms for computing partial Singular Value Decompositions (SVDs). The techniques largely follow the prescriptions ... implement a number of low rank SVD computing routines for three different sets of hardware ... multi core CPU, and (3) massively multicore GPU...

BLAZEDEM
 Referenced in 2 articles
 specifically targeted for Graphical Processing Unit (GPU) platforms. BLAZEDEM uses actual polyhedral particle representations ... different geometries [4]. The use of computational modeling tools is essential in evaluation of various ... designs and processes as computational power increases [5]. However current DEM simulations are only able ... clusters [6]. The dramatic increase in GPU computing power has enabled the computational simulation...

EAGL
 Referenced in 2 articles
 EAGL), a selfcontained GPU library, to support parallel computing of bilinear pairings based ... GPU pipeline vs. memory access latency are highly complex for parallelization of pairing computations. Overall ... main performance bottleneck for pairing computations on the tested GPU device, and the lazy reduction ... offer substantial performance improvement for GPUbased pairing computations...

Parallel Colt
 Referenced in 2 articles
 Colt, a multithreaded Java library for scientific computing and image processing. In addition to describing ... library. Performance comparisons with MATLAB, including GPU computations via AccelerEyes’ Jacket toolbox are also given...

LightSpMV
 Referenced in 2 articles
 some alternative storage formats for GPU computing. Unfortunately, these alternatives are incompatible with most ... from CSR at runtime, thus incurring significant computational and storage overheads. We present LightSpMV ... reveals that on the same Tesla K40c GPU, LightSpMV is superior to both CUSP...

Ripser++
 Referenced in 2 articles
 Ripser++: GPUaccelerated computation of Vietoris–Rips persistence barcodes. Ripser++ utilizes the massive parallelism hidden ... computation of VietorisRips persistence barcodes by taking mathematical and algorithmic oppurtunities we have identified ... GPU compared to that on CPU for Ripser. After dimension 0 persistence computation, there ... from matrix reduction all on GPU, leaving the computation of submatrix reduction on the remaining...

Horovod
 Referenced in 5 articles
 often provided by GPUs. Scaling computation from one GPU to many can enable much faster...

BioEM
 Referenced in 1 article
 BioEM: GPUaccelerated computing of Bayesian inference of electron microscopy images. In cryoelectron microscopy ... demanding. Here we present highly parallelized, GPUaccelerated computer software that performs this task efficiently ... parallelization combined with both CPU and GPU computing. The resulting BioEM software scales nearly ideally ... both on pure CPU and on CPU+GPU architectures, thus enabling Bayesian analysis of tens...

Fireflies
 Referenced in 1 article
 interactively exploring dynamical systems using GPU computing. In nonlinear systems, where explicit analytic solutions usually ... power of graphical processing unit (GPU) computing to produce spectacular interactive visualizations of arbitrary systems ... massively parallel nature of GPU hardware, Fireflies is able to simulate millions of trajectories ... parallel (even on standard desktop computer hardware), producing “swarms” of particles that move around...