Content abstract

This paper will introduce the generation process of neural network. By generating a neural network [2, 5, 3, 1], the calculation process of each step will be explained in detail. [2, 5, 3, 1] represents: two inputs, 5 neurons in the first layer, 3 neurons in the second layer and 1 neuron in the third layer.

  • Generate input data
Np.random. Seed (1) # Draw a circle train_X, train_Y = sklearn.datasets. Make_circles (n_samples=300, noise=.05) # Visualize the data plt.scatter(train_X[:, 0], train_X[:, 1], c=train_Y, s=40, cmap=plt.cm.Spectral);Copy the code

Let’s assume that the input is a point (x1,y1). There are 300 points, which are randomly distributed elliptically. These points form a matrix of all the input parameters, which we’ll call your X.



The X generated here is a 300 by 2 matrix. As follows:



Transpose X, you get a 2 by 300 matrix, as follows

  • The entire network weight is initialized to 0
def initialize_parameters_zeros(layers_dims): """ Arguments: layer_dims -- python array (list) containing the size of each layer. Returns: parameters -- python dictionary containing your parameters "W1", "b1", ... , "WL", "bL": W1 -- weight matrix of shape (layers_dims[1], layers_dims[0]) b1 -- bias vector of shape (layers_dims[1], 1) ... WL -- weight matrix of shape (layers_dims[L], layers_dims[L-1]) bL -- bias vector of shape (layers_dims[L], 1) "" parameters = {layers_dims = len(layers_dims) # number of layers in the network "" "" for l in range(1, l): parameters['W' + str(l)] = np.zeros((layers_dims[l], Layers_dims [1])) parameters['b' + STR (l)] = 0 layers_dims = [x.shape [0] 1] #[2, 5, 3, 1] parameters = initialize_parameters_zeros(layers_dims)Copy the code

At this point we have initialized the weights of each layer to zero. Now let’s analyze the weight details of each layer.

The first layer w1:5 by 2 matrix

B1:5*1 column vectors

The second layer w2:3 *5 matrix

B2:3 * 1

The third layer w3:1 *3 matrix

b3 : 0

Forward propagation calculation process

  • Propagated code and comments forward
def forward_propagation(X, parameters): """ Implements the forward propagation (and computes the loss) presented in Figure 2. Arguments: X -- input dataset, of shape (input size, number of examples) Y -- true "label" vector (containing 0 if cat, 1 if non-cat) parameters -- python dictionary containing your parameters "W1", "b1", "W2", "b2", "W3", "b3": W1 -- weight matrix of shape () b1 -- bias vector of shape () W2 -- weight matrix of shape () b2 -- bias vector of shape  () W3 -- weight matrix of shape () b3 -- bias vector of shape () Returns: loss -- the loss function (vanilla logistic loss) """ # retrieve parameters W1 = parameters["W1"] b1 = parameters["b1"] W2 = parameters["W2"] b2 = parameters["b2"] W3 = parameters["W3"] b3 = parameters["b3"] # LINEAR -> RELU -> LINEAR -> RELU -> LINEAR -> SIGMOID z1 = np.dot(W1, X) + b1 #W1: [5, 2] X[2, 300], matrix multiplication multiplication conditions: Z2 = np.dot(W2, a1) + b2 #W2:[3, 5] a1[5,300] z2:[3,300] a2 = relu(z2) z3 = np. Dot (W3, A1) + b3 #W3:[1,3] a2:[3,300] z3:[1,300] a3 = sigmoid(z3) cache = (z1, a1, W1, b1, z2, a2, W2, b2, z3, a3, W3, B3) return A3, cache # A3 is the last record Y 'value, and then carries out error back propagation according to the expected Y value in advance.Copy the code

The matrix form of neural network layer number and weight W has the following corresponding form: Assuming that neural network is [2, 10, 20, 5, 1], it is obvious that this is a 2-classification problem. So W1 is a 10*2 matrix b1:10*1 w2:20 *10 B2:20*1 W3:5 *20 B3:5*1 W4:1 *5 b4:1 *1 I think the pattern is pretty clear. Here W is each line that connects to the neuron, and B is each neuron.