Neural Networks

These models aim to mimic the human brain, or are at least inspired by it.

The basic unit in a neural network is a neuron. A neuron computes a weighted sum of its inputs and then an activation is computed.

X=[x0x1....xd]X=\begin{bmatrix}x_0\\x_1\\.\\.\\.\\.\\x_d\end{bmatrix}, W=[w0w1....wd]W=\begin{bmatrix}w_0\\w_1\\.\\.\\.\\.\\w_d\end{bmatrix}and x0=1x_0 = 1.

The sigmoid activation function 11+eWTX\frac{1}{1+e^{-W^TX}} gives a probability as an output. The threshold/step activation function outputs 1 if WTX>0W^TX > 0 and 0 otherwise. The linear activation function simply outputs WTXW^TX.

Some common activation functions:

A neural network (also known as a Multi Layer Perceptron (MLP)) has multiple layers of neurons. The most common problem faced in neural networks is the credit assignment problem: it is difficult to determine which neurons are to be given credit/blame for an increase/decrease in accuracy.

Last updated