Next: Question 9
Up: Evaluating Hypothesis and Experimental
Previous: Question 8
- do 5*2 cross validation
- each time train both algorithms and test them on each set - this
gives 4 error estimates - ,
- use these measures in the augmented t test as follows
- is for algorithm and
trained on and tested on
- subtract corresponding error estimates to get two estimated
differences and likewise for
- the estimated variance is ( is 2 in this case)
-
- do this for each cross validation so is the variance
computed for the repetition
- 5x2cv statistic is:
- used 2-fold cv gives large test sets and disjoint training sets,
but but training sets are only 1/2 the size which may cause problems
- 5 replications shown to produce lowest Type I error (incorrectly
detecting a difference when no difference exists)
Patricia Riddle
Fri May 15 13:00:36 NZST 1998