• LS-DYNA

  • Referenced in 309 articles [sw03068]
  • problems. Reduces customer costs by enabling massively parallel processing. Multicore processors have resulted ... DYNA, LSTC, continuously recodes existing algorithms and develops more efficient methodologies...
  • p4est

  • Referenced in 81 articles [sw06796]
  • octrees. p4est is designed to work in parallel and scale to hundreds of thousands ... processor cores. We explain the concepts and algorithms behind p4est in this article...
  • SCASY

  • Referenced in 12 articles [sw03251]
  • SCASY library software: Recursive blocked and parallel algorithms for Sylvester-type matrix equations with some ... loop nests of a single-element algorithm so that the computations are performed on submatrices ... combine recursion and blocking. We consider parallelization of algorithms for reduced matrix equations ... reduced triangular systems. Parallelization of recursive blocked algorithms is done in two ways. The simplest...
  • FLAME

  • Referenced in 38 articles [sw00293]
  • algorithms that are incorporated in the libraries. In combination with an extension of the parallel ... path from algorithm to MATLAB implementation to high-performance sequential implementation to parallel implementation. Finally...
  • NESL

  • Referenced in 13 articles [sw16627]
  • various ideas from the theory community (parallel algorithms), the languages community (functional languages ... ideas behind NESL are Nested data parallelism: this feature offers the benefits of data parallelism ... debug, while being well suited for irregular algorithms, such as algorithms on trees, graphs ... NESL was to make parallel programming easy and portable. Algorithms are typically significantly more concise...
  • ALPS

  • Referenced in 13 articles [sw00036]
  • ALPS), a framework for implementing scalable, parallel algorithms based on tree search. ALPS is specifically ... node in the search tree. Implementing such algorithms in a scalable manner is challenging both ... that supports the implementation of parallel branch and bound algorithms in which the bounds...
  • SPRINT

  • Referenced in 37 articles [sw11760]
  • SPRINT: a scalable parallel classifier for data mining. Classification is an important data mining problem ... studied problem, most of the current classification algorithms require that all or a portion ... scalable. The algorithm has also been designed to be easily parallelized, allowing many processors ... build a single consistent model. This parallelization, also presented here, exhibits excellent scalability as well...
  • DibaP

  • Referenced in 14 articles [sw08343]
  • diffusion-based multilevel algorithm for computing graph partitions. Graph partitioning requires the division ... sequential nature, KL is not easy to parallelize. Its use as a load balancer ... developed previously an inherently parallel algorithm, called Bubble-FOS/C [H. Meyerhenke, B. Monien ... Schamberger, Accelerating shape optimizing load balancing for parallel FEM simulations by algebraic multigrid, in: Proceedings...
  • par2Dhp

  • Referenced in 15 articles [sw30807]
  • work addresses parallelization of each stage of the automatic (hp)-adaptive algorithm, including decomposition ... redistribution, a parallel frontal solver, and algorithms for parallel mesh refinement and mesh reconciliation...
  • Parsol

  • Referenced in 20 articles [sw00684]
  • semi-automatic parallelisation of data-parallel (especially linear algebra) algorithms. It is written ... debugs it. Once done, the parallel version of the algorithm is created by substituting some...
  • SPIKE

  • Referenced in 35 articles [sw02780]
  • linear solver SPIKE is proposed as a parallel environment for solving banded systems that ... sparse within the band. The SPIKE algorithm is a domain decomposition technique that allows performing ... architecture of the high-end parallel computing platform. Numerical experiments are presented that demonstrate ... effectiveness of our parallel scheme. Comparison with the corresponding algorithms of ScaLAPACK are also provided...
  • DDESpecialSolutions

  • Referenced in 35 articles [sw12343]
  • steps of the algorithm. Through discussion and example, parallels are drawn to the tanh-method...
  • GraphLab

  • Referenced in 22 articles [sw12830]
  • implementing efficient, provably correct parallel machine learning (ML) algorithms is challenging. Existing high-level parallel ... like MapReduce by compactly expressing asynchronous iterative algorithms with sparse computational dependencies while ensuring data ... consistency and achieving a high degree of parallel performance. We demonstrate the expressiveness...
  • OSIRIS

  • Referenced in 29 articles [sw02458]
  • system dependent code and the parallelization of the algorithms involved. We also discuss the implementation...
  • RAxML

  • Referenced in 33 articles [sw07716]
  • present a non-deterministic parallel implementation of our algorithm which in some cases yields super...
  • ParaDisEO

  • Referenced in 42 articles [sw01948]
  • features including evolutionary algorithms (EA), local searches (LS), the most common parallel and distributed models...
  • slimgb

  • Referenced in 18 articles [sw00878]
  • ordering. Further key features of the algorithm are parallel reductions, exchanging members of the generating ... extended version of the product criterion. The algorithm is very flexible, the strategy is controlled...
  • BDDC

  • Referenced in 27 articles [sw07232]
  • Parallel implementation of multilevel BDDC In application of the Balancing Domain Decomposition by Constraints (BDDC ... role of elements. In this way, the algorithm of three-level BDDC method is obtained ... description of a recently developed parallel implementation of this algorithm. The implementation is applied...
  • HOGWILD

  • Referenced in 42 articles [sw28396]
  • Parallelizing Stochastic Gradient Descent. Stochastic Gradient Descent (SGD) is a popular algorithm that can achieve ... Several researchers have recently proposed schemes to parallelize SGD, but all require performance-destroying memory ... aims to show using novel theoretical analysis, algorithms, and implementation that SGD can be implemented...
  • PNFFT

  • Referenced in 8 articles [sw07583]
  • Parallel three-dimensional nonequispaced fast Fourier transforms and their application to particle simulation Starting from ... serial algorithm, we develop a new parallel algorithm for calculating nonequispaced fast Fourier transforms ... massively parallel distributed memory architectures. We demonstrate how to deal with the inherent load imbalance ... Furthermore, we derive a new parallel distributed memory algorithm for the fast computation of fully...