Tensara Logo

tensara

Optimize
Benchmark
Repeat

A platform for GPU programming challenges. Write efficient GPU kernels and compare your solutions with other developers.

Start Solving

GitHub

Discord

Want to submit from your IDE? Check out our CLI tool

vector-add.cu

#include <cuda_runtime.h>

__global__ void vectorAdd(const float* A, const float* B, float* C, int N) {

}

Why Tensara?

Tensara provides a unique platform for honing your GPU programming skills through competitive challenges and detailed benchmarking.

Real Hardware Benchmarking

Submissions are run on standardized GPU hardware for fair and accurate performance comparisons.

Competitive Leaderboards

See how your solutions stack up against others on detailed leaderboards for each problem.

Community & Collaboration

Discuss strategies, share insights, and learn from fellow GPU programming enthusiasts.

Powered by

Modal Logo

Latest Activity

Fresh PRs, new problems, and recent community submissions.

Latest Changes

View all

Loading…

Fetching latest merged PRs

Latest Problems

View all

Loading…

Fetching latest problems

Community Submissions

View all

Loading…

Fetching latest blog posts

Ready to Optimize?

Dive into our GPU programming challenges, submit your kernels, and climb the leaderboards.

Tensara

GPU Programming Challenges & Benchmarking Platform

Navigate

ProblemsLeaderboardsContestsBlogCLI Tool

Resources

LearnRoadmap

© 2026 Tensara. All rights reserved.

Metric Migration

We're migrating our primary metric from FLOPS to runtime. More info on the changes we've made will be posted soon.