OpenACC Use Case
From HP-SEE Wiki
Matrix Matrix multiplication
We briefly discuss the development effort needed to implement a simple matrix matrix multiplication algorithm using OpenACC directives on GPU resources and showcase timing and performance results obtained via several development approaches (simple algorithm for CPU, BLAS, CUDA, CuBLAS and OpenACC).