
MapReduce
 Referenced in 267 articles
[sw00546]
 model initially developed for largescale web content processing. Data analysis meets the issue ... calculation over extremely large datasets. The arrival of MapReduce provides a chance ... utilize commodity hardware for massively parallel data analysis applications. The translation and optimization from relational...

ROOT
 Referenced in 55 articles
[sw06817]
 Object Oriented framework for large scale data analysis. ROOT written in C++, contains, among others ... database, a C++ interpreter, advanced statistical analysis (multidimensional histogramming, fitting, minimization, cluster finding algorithms ... language is C++ (using the interpreter) and large scripts can be compiled and dynamically linked...

clusfind
 Referenced in 476 articles
[sw27805]
 analysis. The programs are described and illustrated in the book ”Finding Groups in Data ... computes dissimilarities); Chapter 2: PAM.FOR (partitions the data set into clusters with a new method ... using medoids); Chapter 3: CLARA.FOR (for clustering large applications); Chapter 4: FANNY.FOR (a new method...

EnKF
 Referenced in 441 articles
[sw02066]
 Filter The EnKF is a sophisticated sequental data assimilation method. It applies an ensemble ... forward in time, and it uses an analysis scheme which operates directly on the ensemble ... efficiently handle strongly nonlinear dynamics and large state spaces and is now used in realistic...

Scilab
 Referenced in 175 articles
[sw00834]
 graphical functions. A large number of functionalities is included in Scilab: Maths & Simulation: For usual ... science applications including mathematical operations and data analysis. 2D & 3D Visualization: Graphics functions...

ParaView
 Referenced in 233 articles
[sw06128]
 analysis and visualization application. ParaView users can quickly build visualizations to analyze their data using ... qualitative and quantitative techniques. The data exploration can be done interactively in 3D or programmatically ... capabilities. ParaView was developed to analyze extremely large datasets using distributed memory computing resources...

SNAP
 Referenced in 179 articles
[sw04184]
 purpose, high performance system for analysis and manipulation of large networks. Graphs consists of nodes ... graph nodes. Networks are graphs with data on nodes and/or edges of the network...

GloptiPoly
 Referenced in 329 articles
[sw04343]
 mathematics such as algebra, Fourier analysis, functional analysis, operator theory, probability and statistics, to cite ... short formulation, the GPM has a large number of important applications in various fields such ... handle moment problems with polynomial data. Many important applications in e.g. optimization, probability, financial economics...

Healpix
 Referenced in 79 articles
[sw08860]
 HighResolution Discretization and Fast Analysis of Data Distributed on the Sphere. HEALPix—the Hierarchical ... versatile structure for the pixelization of data on the sphere. An associated library of computational ... large volumes of astronomical data. Originally developed to address the data processing and analysis needs...

KONECT
 Referenced in 64 articles
[sw17480]
 their analysis. In the cited areas, a surprisingly large number of very heterogeneous data ... Matlab toolbox for network analysis and (3) a website giving a compact overview the various...

mixOmics
 Referenced in 29 articles
[sw09508]
 joint analysis. However, mixOmics can also be applied to any other large data sets where ... version of CCA to deal with the large number of variables. sPLS allows variable selection ... frameworks are proposed: regression and canonical analysis. Numerous graphical outputs are provided to help interpreting ... Analysis, Independent Principal Component Analysis and multilevel analysis using variance decomposition of the data...

MEGAN
 Referenced in 17 articles
[sw33189]
 MEGAN analysis of metagenomic data. Metagenomics is the study of the genomic content ... known sequences. Most published studies use the analysis of pairedend reads, complete sequences ... computer program that allows laptop analysis of large metagenomic data sets. In a preprocessing step...

PLINK
 Referenced in 68 articles
[sw04581]
 designed to perform a range of basic, largescale analyses in a computationally efficient manner ... PLINK is purely on analysis of genotype/phenotype data, so there is no support for steps...

Mixmod
 Referenced in 37 articles
[sw06991]
 data set for the purposes of density estimation, clustering or discriminant analysis. A large variety ... sensible maximum for the likelihood (or completedata likelihood) function. MIXMOD is currently intended ... suitability depending on the particular perspective (cluster analysis or discriminant analysis). Written in C++, MIXMOD...

raster
 Referenced in 37 articles
[sw08287]
 data analysis and modeling. Reading, writing, manipulating, analyzing and modeling of gridded spatial data ... highlevel functions. Processing of very large files is supported...

SHOGUN
 Referenced in 104 articles
[sw03517]
 called SHOGUN, which is designed for unified largescale learning for a broad range ... Markov models, multiple kernel learning, linear discriminant analysis, and more. Most of the specific algorithms ... able to deal with several different data classes. We have used this toolbox in several...

LSSVMlab
 Referenced in 26 articles
[sw07367]
 spectral clustering, data visualization and dimensionality reduction, and survival analysis. For very large scale problems...

ROOT
 Referenced in 5 articles
[sw20052]
 container is optimized for statistical data analysis over very large data sets by using vertical ... storage techniques. These containers can span a large number of files on local disks ... file systems. In order to analyze this data, the user can chose ... minimization, and various methods for performing regression analysis (fitting). In particular, the RooFit package allows...

ShapeNet
 Referenced in 17 articles
[sw35059]
 Repository. We present ShapeNet: a richlyannotated, largescale repository of shapes represented ... object attributes, promote datadriven geometric analysis, and provide a largescale quantitative benchmark...

yEd
 Referenced in 10 articles
[sw04417]
 external data for analysis. Our automatic layout algorithms arrange even large data sets with just...