Next: Sigmoid Threshold Unit
Up: Neural Network Learning
Previous: Multilayer Networks & Nonlinear
- Multiple layers of linear units still produce only linear functions
- Perceptrons have a discontinuous threshold which is
undifferentiable and therefore unsuitable for gradient descent
- We want a unit whose output is a nonlinear, differentiable function of the inputs
- One solution is a sigmoid unit
- Like perceptrons it computes a linear combination of its inputs
and then applies a threshold to the result. But the threshold output
is a continuous function of its input which ranges from 0 to 1.
- It is often referred to as a squashing function.
Patricia Riddle
Fri May 15 13:00:36 NZST 1998