Multilayer Networks

Next: Sigmoid Threshold Unit Up: Neural Network Learning Previous: Multilayer Networks & Nonlinear

Multiple layers of linear units still produce only linear functions
Perceptrons have a discontinuous threshold which is undifferentiable and therefore unsuitable for gradient descent
We want a unit whose output is a nonlinear, differentiable function of the inputs
One solution is a sigmoid unit
Like perceptrons it computes a linear combination of its inputs and then applies a threshold to the result. But the threshold output is a continuous function of its input which ranges from 0 to 1.
It is often referred to as a squashing function.

Patricia Riddle
Fri May 15 13:00:36 NZST 1998