Next: Impact of Reduced Error
Up: Decision Tree Learning
Previous: Approaches to Overfitting
- Consider each node for pruning
- Pruning = removing the subtree at that node, make it a leaf and
assign the most common class at that node
- A node is removed if the resulting tree performs no worse then
the original on the validation set - removes coincidences and errors
- Nodes are removed iteratively choosing the node whose removal
most increases the decision tree accuracy on the graph
- Pruning continues until further pruning is harmful
- uses training, validation & test sets - effective approach if a
large amount of data is available
Patricia Riddle
Fri May 15 13:00:36 NZST 1998