
LSDYNA
 Referenced in 309 articles
[sw03068]
 problems. Reduces customer costs by enabling massively parallel processing. Multicore processors have resulted ... DYNA, LSTC, continuously recodes existing algorithms and develops more efficient methodologies...

p4est
 Referenced in 81 articles
[sw06796]
 octrees. p4est is designed to work in parallel and scale to hundreds of thousands ... processor cores. We explain the concepts and algorithms behind p4est in this article...

SCASY
 Referenced in 12 articles
[sw03251]
 SCASY library software: Recursive blocked and parallel algorithms for Sylvestertype matrix equations with some ... loop nests of a singleelement algorithm so that the computations are performed on submatrices ... combine recursion and blocking. We consider parallelization of algorithms for reduced matrix equations ... reduced triangular systems. Parallelization of recursive blocked algorithms is done in two ways. The simplest...

FLAME
 Referenced in 38 articles
[sw00293]
 algorithms that are incorporated in the libraries. In combination with an extension of the parallel ... path from algorithm to MATLAB implementation to highperformance sequential implementation to parallel implementation. Finally...

NESL
 Referenced in 13 articles
[sw16627]
 various ideas from the theory community (parallel algorithms), the languages community (functional languages ... ideas behind NESL are Nested data parallelism: this feature offers the benefits of data parallelism ... debug, while being well suited for irregular algorithms, such as algorithms on trees, graphs ... NESL was to make parallel programming easy and portable. Algorithms are typically significantly more concise...

ALPS
 Referenced in 13 articles
[sw00036]
 ALPS), a framework for implementing scalable, parallel algorithms based on tree search. ALPS is specifically ... node in the search tree. Implementing such algorithms in a scalable manner is challenging both ... that supports the implementation of parallel branch and bound algorithms in which the bounds...

SPRINT
 Referenced in 37 articles
[sw11760]
 SPRINT: a scalable parallel classifier for data mining. Classification is an important data mining problem ... studied problem, most of the current classification algorithms require that all or a portion ... scalable. The algorithm has also been designed to be easily parallelized, allowing many processors ... build a single consistent model. This parallelization, also presented here, exhibits excellent scalability as well...

DibaP
 Referenced in 14 articles
[sw08343]
 diffusionbased multilevel algorithm for computing graph partitions. Graph partitioning requires the division ... sequential nature, KL is not easy to parallelize. Its use as a load balancer ... developed previously an inherently parallel algorithm, called BubbleFOS/C [H. Meyerhenke, B. Monien ... Schamberger, Accelerating shape optimizing load balancing for parallel FEM simulations by algebraic multigrid, in: Proceedings...

par2Dhp
 Referenced in 15 articles
[sw30807]
 work addresses parallelization of each stage of the automatic (hp)adaptive algorithm, including decomposition ... redistribution, a parallel frontal solver, and algorithms for parallel mesh refinement and mesh reconciliation...

Parsol
 Referenced in 20 articles
[sw00684]
 semiautomatic parallelisation of dataparallel (especially linear algebra) algorithms. It is written ... debugs it. Once done, the parallel version of the algorithm is created by substituting some...

SPIKE
 Referenced in 35 articles
[sw02780]
 linear solver SPIKE is proposed as a parallel environment for solving banded systems that ... sparse within the band. The SPIKE algorithm is a domain decomposition technique that allows performing ... architecture of the highend parallel computing platform. Numerical experiments are presented that demonstrate ... effectiveness of our parallel scheme. Comparison with the corresponding algorithms of ScaLAPACK are also provided...

DDESpecialSolutions
 Referenced in 35 articles
[sw12343]
 steps of the algorithm. Through discussion and example, parallels are drawn to the tanhmethod...

GraphLab
 Referenced in 22 articles
[sw12830]
 implementing efficient, provably correct parallel machine learning (ML) algorithms is challenging. Existing highlevel parallel ... like MapReduce by compactly expressing asynchronous iterative algorithms with sparse computational dependencies while ensuring data ... consistency and achieving a high degree of parallel performance. We demonstrate the expressiveness...

OSIRIS
 Referenced in 29 articles
[sw02458]
 system dependent code and the parallelization of the algorithms involved. We also discuss the implementation...

RAxML
 Referenced in 33 articles
[sw07716]
 present a nondeterministic parallel implementation of our algorithm which in some cases yields super...

ParaDisEO
 Referenced in 42 articles
[sw01948]
 features including evolutionary algorithms (EA), local searches (LS), the most common parallel and distributed models...

slimgb
 Referenced in 18 articles
[sw00878]
 ordering. Further key features of the algorithm are parallel reductions, exchanging members of the generating ... extended version of the product criterion. The algorithm is very flexible, the strategy is controlled...

BDDC
 Referenced in 27 articles
[sw07232]
 Parallel implementation of multilevel BDDC In application of the Balancing Domain Decomposition by Constraints (BDDC ... role of elements. In this way, the algorithm of threelevel BDDC method is obtained ... description of a recently developed parallel implementation of this algorithm. The implementation is applied...

HOGWILD
 Referenced in 42 articles
[sw28396]
 Parallelizing Stochastic Gradient Descent. Stochastic Gradient Descent (SGD) is a popular algorithm that can achieve ... Several researchers have recently proposed schemes to parallelize SGD, but all require performancedestroying memory ... aims to show using novel theoretical analysis, algorithms, and implementation that SGD can be implemented...

PNFFT
 Referenced in 8 articles
[sw07583]
 Parallel threedimensional nonequispaced fast Fourier transforms and their application to particle simulation Starting from ... serial algorithm, we develop a new parallel algorithm for calculating nonequispaced fast Fourier transforms ... massively parallel distributed memory architectures. We demonstrate how to deal with the inherent load imbalance ... Furthermore, we derive a new parallel distributed memory algorithm for the fast computation of fully...