Efficient out-of-core GEMM solvers for GPUs
Downloaded 43048 times
The package contains efficient out-of-core solvers performing the following Matrix-Matrix multiplication on one or more GPUs.
C = alpha * op(A) * op(B) + beta * C
where A, B, C are dense matrices of sizes m*k, k*n, and m*n respectively.
Original Authors
Ziming Zhong
Vladimir Rychkov
Alexey Lastovetsky
Current Developers
Ravi Reddy Manumachu
Hamidreza Khaleghzadeh
Downloads
Attachment | Size |
---|---|
zzgemmooc-v1.0.tar.gz | 57 KB |
- 143895 reads