• MapReduce

  • Referenced in 253 articles [sw00546]
  • calculation over extremely large datasets. The arrival of MapReduce provides a chance to utilize commodity...
  • ParaView

  • Referenced in 177 articles [sw06128]
  • ParaView was developed to analyze extremely large datasets using distributed memory computing resources...
  • Pegasos

  • Referenced in 98 articles [sw08752]
  • especially suited for learning from large datasets. Our approach also extends to non-linear kernels ... algorithm is particularly well suited for large text classification problems, where we demonstrate an order...
  • DistAl

  • Referenced in 99 articles [sw01746]
  • datamining and knowledge acquisition from large datasets. The paper presents results of experiments using several...
  • FRK

  • Referenced in 93 articles [sw19172]
  • spatial/spatio-temporal modelling and prediction with large datasets. The approach, discussed in Cressie and Johannesson...
  • GOLEM

  • Referenced in 52 articles [sw24695]
  • mesh design. GOLEM copes efficiently with large datasets. It achieves this efficiency because it avoids...
  • RSVM

  • Referenced in 53 articles [sw15261]
  • little as 1% of a large dataset for its explicit evaluation. To generate this nonlinear...
  • CHomP

  • Referenced in 46 articles [sw09358]
  • through which the information hidden in large datasets can be reduced to compact algebraic expressions...
  • LabelMe

  • Referenced in 44 articles [sw36633]
  • annotation tool, we have collected a large dataset that spans many object categories, often containing...
  • WebGraph

  • Referenced in 48 articles [sw30097]
  • provides simple ways to manage very large graphs, exploiting modern compression techniques. More precisely ... provide a high compression ratio (see our datasets). The algorithms are controlled by several parameters ... actually necessary. Algorithms for analysing very large graphs, such as HyperBall, which has been used ... experiment with various settings. Datasets for very large graph (e.g., a billion of links). These...
  • RainForest

  • Referenced in 17 articles [sw20993]
  • Fast Decision Tree Construction of Large Datasets. Classification of large datasets is an important data...
  • NetCDF

  • Referenced in 24 articles [sw04611]
  • Scalable. A small subset of a large dataset may be accessed efficiently. Appendable. Data...
  • MULTIMIX

  • Referenced in 34 articles [sw03250]
  • program is used to cluster a large medical dataset...
  • DataCutter

  • Referenced in 20 articles [sw09615]
  • Distributed processing of very large datasets with DataCutter. We describe a framework, called DataCutter, that...
  • Signal-CF

  • Referenced in 30 articles [sw26854]
  • particularly useful for the analysis of large-scale datasets. Signal-CF is freely available...
  • Dendroscope

  • Referenced in 12 articles [sw21230]
  • editing phylogenetic trees, for increasingly very large datasets, such as arise in expression analysis ... major operating systems. Although a large number of tree visualization tools are freely available, some ... phylogenetic trees, for both small and large datasets...
  • Hive

  • Referenced in 16 articles [sw25391]
  • software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. Structure...
  • MatConvNet

  • Referenced in 16 articles [sw15651]
  • allowing to train complex models on large datasets such as ImageNet ILSVRC containing millions...
  • PseAAC-Builder

  • Referenced in 25 articles [sw22404]
  • some limitations in dealing with large-scale datasets. Here, we propose a new cross-platform...
  • BEDTools

  • Referenced in 12 articles [sw17302]
  • based methods is complicated by the massive datasets that are routinely produced with current sequencing ... allow the user to compare large datasets (e.g. next-generation sequencing data) with both public ... quickly answer intricate questions of large genomic datasets. Availability and implementation: BEDTools was written...