Comparing Algorithms w/ Small Samples

Next: Question 9 Up: Evaluating Hypothesis and Experimental Previous: Question 8

do 5*2 cross validation
each time train both algorithms and test them on each set - this gives 4 error estimates - ,
use these measures in the augmented t test as follows
is for algorithm and trained on and tested on
subtract corresponding error estimates to get two estimated differences and likewise for
the estimated variance is ( is 2 in this case)
do this for each cross validation so is the variance computed for the repetition
5x2cv statistic is:
used 2-fold cv gives large test sets and disjoint training sets, but but training sets are only 1/2 the size which may cause problems
5 replications shown to produce lowest Type I error (incorrectly detecting a difference when no difference exists)

Patricia Riddle
Fri May 15 13:00:36 NZST 1998