Tensara Logo

tensara

4D Tensor-Matrix Multiplication

cuda

T4

Submitted by

a-hamdi

5/5/2025, 5:33:56 PM

Accepted

RUNTIME

958.23ms

PERFORMANCE

251.74GFLOPS

TEST CASES

4/4passed

Benchmark Results

Test CaseRuntime (ms)GFLOPS
16x256x512x256 x 256x7683641.69226.45
8x128x256x128 x 128x512154.89221.85
32x64x128x64 x 64x25636.13238.07
4x32x64x32 x 32x1280.21320.58

Submitted Code