OpenACC Use Case
From HP-SEE Wiki
Matrix Matrix multiplication
We briefly discuss the development effort needed to implement a simple matrix matrix multiplication algorithm using OpenACC directives on GPU resources and showcase timing and performance results obtained via several development approaches (simple algorithm for CPU, usage of BLAS/LAPACK libraries, CUDA, CuBLAS and OpenACC).