Tensara Logo

tensara

All Problems

2D Average Pooling

MEDIUM

Perform 2D average pooling on an input tensor:

output[i,j]=1k2m=0k1n=0k1input[Si+mP,Sj+nP]\text{output}[i,j] = \frac{1}{k^2}\sum_{m=0}^{k-1}\sum_{n=0}^{k-1} \text{input}[S \cdot i + m - P, S \cdot j + n - P]

The average pooling operation slides a window of size k×kk \times k over the input tensor with stride SS and padding PP, computing the average value within each window position.

Input:

  • Matrix input of size H×W\text{H} \times \text{W} (input tensor)
  • kernel_size (kk): Size of the pooling window
  • stride (SS): Step size between window positions
  • padding (PP): Number of zero-padding elements added on all sides

Output:

  • Matrix output of size Hout×Wout\text{H}_{\text{out}} \times \text{W}_{\text{out}} where: Hout=H+2PkS+1\text{H}_{\text{out}} = \left\lfloor\frac{\text{H} + 2P - k}{S} + 1\right\rfloor Wout=W+2PkS+1\text{W}_{\text{out}} = \left\lfloor\frac{\text{W} + 2P - k}{S} + 1\right\rfloor

Notes:

  • All matrices are stored in row-major order
  • Zero padding is applied when specified by the padding parameter
  • For values outside the input boundaries (after padding), use zero values in the average computation
  • The denominator (k2k^2) should always be the full kernel size, even when some elements are outside the input boundaries
  • This problem is adapted from KernelBench
Console

Sample Run Results

Hit "Run" to test your code with sample inputs

Loading...

Loading editor...

CUDA C++ environment

Desktop Required for Code Submission

For the best coding experience, please switch to a desktop device to write and submit your solution.