Tensara Logo

tensara

Optimize
Benchmark
Repeat

A platform for GPU programming challenges. Write efficient GPU kernels and compare your solutions with other developers.

Need help getting started? Join our community on Discord

vector-add.cu

#include <cuda_runtime.h>

__global__ void vectorAdd(const float* A, const float* B, float* C, int N) {

}

Why Tensara?

Tensara provides a unique platform for honing your GPU programming skills through competitive challenges and detailed benchmarking.

Real Hardware Benchmarking

Submissions are run on standardized GPU hardware for fair and accurate performance comparisons.

Competitive Leaderboards

See how your solutions stack up against others on detailed leaderboards for each problem.

Community & Collaboration

Discuss strategies, share insights, and learn from fellow GPU programming enthusiasts.

Latest Updates

Core Platform

Rating System

FEATURE

New rating system for user rankings.

2 weeks ago

Triton Kernel Support

FEATURE

Added support for Triton-based kernel submissions.

3 weeks ago

Error Handling

IMPROVEMENT

Improved error handling and rate limiting.

3 weeks ago

CLI Tool

CLI Submissions

IN PROGRESS

Working on allowing direct submissions via CLI.

1 week ago

CLI v0.1 Release

RELEASE

Initial release of the Tensara CLI.

1 month ago

Local Benchmarking

IMPROVEMENT

Improved local benchmarking accuracy.

1 month ago

Problems

Convolution Problems

FEATURE

New set of convolution challenges available.

1 week ago

3D/4D Tensor Matmul Problems

FEATURE

Added new matrix multiplication problems.

2 weeks ago

Problem Difficulty Tags

IMPROVEMENT

Added difficulty tags to problems.

2 weeks ago

Ready to Optimize?

Dive into our GPU programming challenges, submit your kernels, and climb the leaderboards.

Tensara

GPU Programming Challenges & Benchmarking Platform

Navigate

ProblemsLeaderboardsContestsBlog

© 2025 Tensara. All rights reserved.