PowerGraph: Distributed graph-parallel computation on natural graphs. Large-scale graph-structured computation is central to tasks ranging from targeted advertising to natural language processing and has led to the development of several graph-parallel abstractions including Pregel and GraphLab. However, the natural graphs commonly found in the real-world have highly skewed power-law degree distributions, which challenge the assumptions made by these abstractions, limiting performance and scalability. In this paper, we characterize the challenges of computation on natural graphs in the context of existing graphparallel abstractions. We then introduce the PowerGraph abstraction which exploits the internal structure of graph programs to address these challenges. Leveraging the PowerGraph abstraction we introduce a new approach to distributed graph placement and representation that exploits the structure of power-law graphs. We provide a detailed analysis and experimental evaluation comparing PowerGraph to two popular graph-parallel systems. Finally, we describe three different implementation strategies for PowerGraph and discuss their relative merits with empirical evaluations on large-scale real-world problems demonstrating order of magnitude gains.
Keywords for this software
References in zbMATH (referenced in 6 articles )
Showing results 1 to 6 of 6.
- Brandt, Sebastian; Wattenhofer, Roger: Approximating small balanced vertex separators in almost linear time (2019)
- Morihata, Akimasa; Emoto, Kento; Matsuzaki, Kiminori; Hu, Zhenjiang; Iwasaki, Hideya: Optimizing declarative parallel distributed graph processing by using constraint solvers (2018)
- Hong, Jihye; Park, Kisung; Han, Yongkoo; Rasel, Mostofa Kamal; Vonvou, Dawanga; Lee, Young-Koo: Disk-based shortest path discovery using distance index over large dynamic graphs (2017)
- Slota, George M.; Madduri, Kamesh; Rajamanickam, Sivasankaran: Complex network partitioning using label propagation (2016)
- Christopher R. Aberger, Susan Tu, Kunle Olukotun, Christopher Ré: EmptyHeaded: A Relational Engine for Graph Processing (2015) arXiv
- Mahani, Alireza S.; Sharabiani, Mansour T. A.: SIMD parallel MCMC sampling with applications for big-data Bayesian analytics (2015)