we believe that we don’t need to look far away from the center location to access what is going on. V is referred to as a convolution kernel, a filter, or simply the layer’s weights that are often learnable parameters.