Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: missing edit-log entry for this revision

...

This is a log of what I did to run a 10-fold cross-validation test of the perceptron vs the GA when testing bug 2910 ( , http://bugzilla.spamassassin.org/show_bug.cgi?id=2910 ) – (-- JustinMason 21/01/04).

First, I checked out the source:

...

Results will appear in "tenpass_results" – over the course of 4 days. (wink)THE PERCEPTRON:

10-Fold Testing With The Perceptron Instead of GA

If all goes well, the Perceptron will take over from the GA as the main way we generate scores; in that case, this section will be obsolete.

copied ./tenpass/10pass-run to ./10pass-run-perceptron .

...

No Format
  make clean >> make.output
  make -C perceptron_c clean >> make.output
  make tmp/tests.h >> make.output 2>&1
  rm -rf perceptron_c/tmp; cp -r tmp perceptron_c/tmp
  make -C perceptron_c >> make.output
  ( cd perceptron_c ; ./perceptron -p 0.75 -e 100 )
  pwd; date

Change

No Format
  cp craig-evolve.scores tenpass_results/scores.$id

to

No Format
   cpperl -pe 's/^(score\s+\S+\s+)0\s+/$1/gs;' \
      < perceptron_c/perceptron.scores \
      > tenpass_results/scores.$id

(required to work around an extra digit output by the perceptron app) and run ./10pass-run-perceptron . This one runs quicker completes a lot more quickly (wink)