• MapReduce

  • Referenced in 248 articles [sw00546]
  • calculation over extremely large datasets. The arrival of MapReduce provides a chance to utilize commodity...
  • ParaView

  • Referenced in 154 articles [sw06128]
  • ParaView was developed to analyze extremely large datasets using distributed memory computing resources...
  • DistAl

  • Referenced in 99 articles [sw01746]
  • datamining and knowledge acquisition from large datasets. The paper presents results of experiments using several...
  • Pegasos

  • Referenced in 93 articles [sw08752]
  • especially suited for learning from large datasets. Our approach also extends to non-linear kernels ... algorithm is particularly well suited for large text classification problems, where we demonstrate an order...
  • FRK

  • Referenced in 90 articles [sw19172]
  • spatial/spatio-temporal modelling and prediction with large datasets. The approach, discussed in Cressie and Johannesson...
  • GOLEM

  • Referenced in 51 articles [sw24695]
  • mesh design. GOLEM copes efficiently with large datasets. It achieves this efficiency because it avoids...
  • RSVM

  • Referenced in 52 articles [sw15261]
  • little as 1% of a large dataset for its explicit evaluation. To generate this nonlinear...
  • CHomP

  • Referenced in 46 articles [sw09358]
  • through which the information hidden in large datasets can be reduced to compact algebraic expressions...
  • WebGraph

  • Referenced in 45 articles [sw30097]
  • provides simple ways to manage very large graphs, exploiting modern compression techniques. More precisely ... provide a high compression ratio (see our datasets). The algorithms are controlled by several parameters ... actually necessary. Algorithms for analysing very large graphs, such as HyperBall, which has been used ... experiment with various settings. Datasets for very large graph (e.g., a billion of links). These...
  • RainForest

  • Referenced in 17 articles [sw20993]
  • Fast Decision Tree Construction of Large Datasets. Classification of large datasets is an important data...
  • NetCDF

  • Referenced in 23 articles [sw04611]
  • Scalable. A small subset of a large dataset may be accessed efficiently. Appendable. Data...
  • MULTIMIX

  • Referenced in 34 articles [sw03250]
  • program is used to cluster a large medical dataset...
  • DataCutter

  • Referenced in 20 articles [sw09615]
  • Distributed processing of very large datasets with DataCutter. We describe a framework, called DataCutter, that...
  • Signal-CF

  • Referenced in 29 articles [sw26854]
  • particularly useful for the analysis of large-scale datasets. Signal-CF is freely available...
  • Dendroscope

  • Referenced in 12 articles [sw21230]
  • editing phylogenetic trees, for increasingly very large datasets, such as arise in expression analysis ... major operating systems. Although a large number of tree visualization tools are freely available, some ... phylogenetic trees, for both small and large datasets...
  • Hive

  • Referenced in 17 articles [sw25391]
  • software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. Structure...
  • PseAAC-Builder

  • Referenced in 25 articles [sw22404]
  • some limitations in dealing with large-scale datasets. Here, we propose a new cross-platform...
  • APPL

  • Referenced in 14 articles [sw12861]
  • been used for decades to analyze large datasets or to perform mathematically intractable statistical methods...
  • BEDTools

  • Referenced in 11 articles [sw17302]
  • based methods is complicated by the massive datasets that are routinely produced with current sequencing ... allow the user to compare large datasets (e.g. next-generation sequencing data) with both public ... quickly answer intricate questions of large genomic datasets. Availability and implementation: BEDTools was written...
  • NUS-WIDE

  • Referenced in 20 articles [sw23017]
  • used for evaluation. Based on this dataset, we highlight characteristics of Web image collections ... learn effective models from sufficiently large image dataset to facilitate general image retrieval...