Next: Impact of Reduced Error 
Up: Decision Tree Learning
 Previous: Approaches to Overfitting
 
-  Consider each node for pruning
 -  Pruning = removing the subtree at that node, make it a leaf and
assign the most common class at that node
 -  A node is removed if the resulting tree performs no worse then
the original on the validation set - removes coincidences and errors
 -  Nodes are removed iteratively choosing the node whose removal
most increases the decision tree accuracy on the graph
 -  Pruning continues until further pruning is harmful
 -  uses training, validation & test sets - effective approach if a
large amount of data is available
 
 
Patricia Riddle 
Fri May 15 13:00:36 NZST 1998