A platform for GPU programming challenges. Write efficient GPU kernels and compare your solutions with other developers.
Want to submit from your IDE? Check out our CLI tool
vector-add.cu
#include <cuda_runtime.h>
__global__ void vectorAdd(const float* A, const float* B, float* C, int N) {
}
Tensara provides a unique platform for honing your GPU programming skills through competitive challenges and detailed benchmarking.
Submissions are run on standardized GPU hardware for fair and accurate performance comparisons.
See how your solutions stack up against others on detailed leaderboards for each problem.
Discuss strategies, share insights, and learn from fellow GPU programming enthusiasts.
Added support for Mojo submissions.
3 days ago
New rating system for user rankings.
2 weeks ago
Added support for Triton-based kernel submissions.
3 weeks ago
Fully integrated CLI tool for Tensara.
1 hour ago
Allows direct submissions via CLI.
5 hours ago
Initial release of the Tensara CLI.
1 month ago
Image processing problems in progress.
1 day ago
New set of convolution challenges available.
1 week ago
Added new matrix multiplication problems.
2 weeks ago
© 2025 Tensara. All rights reserved.