OpenACC Use Case

From HP-SEE Wiki

Revision as of 13:12, 15 July 2013 by Pkoro (Talk | contribs)
Jump to: navigation, search

Matrix Matrix multiplication

We briefly discuss the development effort needed to implement a simple matrix matrix multiplication algorithm using OpenACC directives on GPU resources and showcase timing and performance results obtained via several development approaches (simple algorithm for CPU, usage of BLAS/LAPACK libraries, CUDA, CuBLAS and OpenACC).

Personal tools