date: 2024-10-25
title: CNN
status: TOBECONTINUED
author:
- AllenYGY
tags:
- NOTE
- CNN
publish: True
CNN
2D Convolutional Kernel 原点在正中间
Height
Width
If the input data is an image of size
So, the output size is
Preserve Spatial Dimensions (Same Padding)
For example:
Prevent Information Loss at Borders
Without padding, the convolution operation does not consider the edges and corners of an image as much as the center regions. Padding ensures the borders are included in the computation, retaining more information.
Enable Deeper Networks
By maintaining consistent feature map sizes throughout the layers, padding allows for the design of deeper networks without rapidly shrinking the feature map size.
Control Output Size
Padding can be adjusted to produce desired output dimensions for specific applications (e.g., “same” padding or “valid” padding).
Improve Symmetry for Feature Extraction
Padding ensures that the kernel interacts symmetrically with all regions of the input image, which improves the extraction of features near edges.
Assume we have a matrix A with size
Assume we have a matrix
Assume we have a 8×8 matrix (image) B:
The Laplacian filter in 2D (for edge detection) is commonly represented as:
If we use the matrix to represent the Laplacian, the matrix is:
And the result is:
Part I: Convolution and image filtering
1. Comparing different filters?
2. Comparing different scales / size of filter?
3.Separability property of a filter / convolution?
4. Convolution and correlation? How
5. How to work on a kernel approximating a 1st, 2nd derivative?
What is Fourier Transform? What is the usage? How to calculate in 1D? 2D? Padding if necessary.
Convolution in image domain is equivalent to multiplication in frequency domain. Why? Verify?
8. How to obtain image pyramid? Gaussian, Laplacian, Steerable? Calculate? What are the usages / applications of them?