Batch Normalization

MEDIUM

Implement Batch Normalization over the batch dimension (B) for each feature channel in a 4D tensor.

The formula for Batch Normalization is:

\text{y} = \frac{x - \mathrm{E}[x]}{\sqrt{\mathrm{Var}[x] + \epsilon}}

where the mean $\mathrm{E}[x]$ and variance $\mathrm{Var}[x]$ are computed over the batch dimension (B) for each feature channel independently. $\epsilon$ is a small value added to the variance for numerical stability.

Input:

Tensor $\text{X}$ of shape $(\text{B}, \text{F}, \text{D1}, \text{D2})$ (input data)
Epsilon $\epsilon$ (a small float, typically 1e-5)

Output:

Tensor $\text{Y}$ of shape $(\text{B}, \text{F}, \text{D1}, \text{D2})$ (normalized data)

Notes:

Compute the mean and variance across the batch dimension $\text{B}$ independently for each feature channel $\text{F}$ .
The statistics (mean and variance) are computed independently for each spatial location $(D1, D2)$ in each feature channel.
Use $\epsilon = 10^{-5}$
For simplicity, this implementation focuses on the core normalization without learnable parameters (gamma and beta) and without tracking running statistics.
This problem is adapted from KernelBench

GPU Type

Language

Loading editor...

CUDA C++ environment

Sample Run Results

Hit "Run" to test your code with sample inputs

Desktop Required for Code Submission

For the best coding experience, please switch to a desktop device to write and submit your solution.