hiCUDA: High-Level GPGPU Programming. This project aims to create a high-level interface for GPGPU programming. More specifically, we have defined a directive-based language called hiCUDA (for high-level CUDA) for programming NVIDIA GPUs. It provides a programmer with high-level abstractions to carry out the tasks mentioned above in a simple manner, and directly to the original sequential code. More importantly, the use of hiCUDA directives makes it easier to experiment with different ways of identifying and extracting GPU computation, and of managing the GPU memory. Along with the language, we have designed and implemented a prototype source-to-source compiler that translates a hiCUDA program (i.e. a sequential C program with hiCUDA directives) to an equivalent CUDA program. In this way, we can compile a hiCUDA program to a binary using the existing CUDA compiler toolchain from NVIDIA. There are two aspects of hiCUDA we would like to evaluate. The first is its performance, i.e. how much slower a hiCUDA program runs compared to a hand-written CUDA version, given that they implement the same algorithm. Using seven CUDA benchmarks (most of which are from the Parboil suite developed at UIUC), we found that the performance of the compiler-generated CUDA code is very close to that of the hand-written version, even though we had to make modifications to the sequential program to achieve the same algorithm as the CUDA version. This result encourages us to share the hiCUDA language and its compiler support with the GPGPU programming community, and leads to the second aspect of evaluation: usability. We very much welcome you to try hiCUDA and give us feedback so that we can improve the language design as well as the compiler implementation.
Keywords for this software
References in zbMATH (referenced in 6 articles )
Showing results 1 to 6 of 6.
- Andon, F. I.; Doroshenko, A. E.; Beketov, A. G.; Iovchev, V. A.; Yatsenko, E. A.: Software tools for automation of parallel programming on the basis of algebra of algorithms (2015)
- Liao, Xiang-Ke; Yung, Can-Qun; Tang, Tao; Yi, Hui-Zhan; Wang, Feng; Wu, Qiang; Xue, Jingling: OpenMC: towards simplifying programming for TianHe supercomputers (2014) ioport
- Lowell, Daniel; Godwin, Jeswin; Holewinski, Justin; Karthik, Deepan; Choudary, Chekuri; Mametjanov, Azamat; Norris, Boyana; Sabin, Gerald; Sadayappan, P.; Sarich, Jason: Stencil-aware GPU optimization of iterative solvers (2013)
- Bakhtin, V. A.; Klinov, M. S.; Krukov, V. A.; Podderyugina, N. V.; Pritula, M. N.; Sazanov, Yu. L.: Extension of DVM parallel programming model for clusters with heterogeneous nodes (2012) ioport
- Yang, Xuejun; Tang, Tao; Wang, Guibin; Jia, Jia; Xu, Xinhai: MPtostream: an OpenMP compiler for CPU-GPU heterogeneous parallel systems (2012) ioport
- Andon, P. I.; Doroshenko, A. Yu.; Zhereb, K. A.: Programming high-performance parallel computations: formal models and graphics processing units (2011) ioport