This Confluence has been LDAP enabled, if you are an ASF Committer, please use your LDAP Credentials to login. Any problems file an INFRA jira ticket please.

Page tree

Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: converted to 1.6 markup

Creating New Uploaded Corpora Users

TODO: this is obsolete and should be ignored! We're no longer using client/server mode for uploaded-corpora mass-checks....

These are instructions for PMC members who wish to set up a new account for users to upload their HandClassifiedCorpora for us to use in the NightlyMassCheck runs.

This method uses mass-check's client-server mode to distribute the load. However this seems to be occasionally buggy. If the resulting mass-checks run into problems, alternatively see UploadedCorporaIndependentMassCheck to set up a mass-check uid in traditional, non-client-server mode.

Log in to the zone and run:

No Format
MCPWD=[random password]

sudo mkdir /export/home/nitemc/$MCUSER
sudo chmod 1777 /export/home/nitemc/$MCUSER
cd /export/home/nitemc/$MCUSER
echo "$MCPWD" > rsync_password
chmod 600 rsync_password

sed -e "s/MCUSER/$MCUSER/" -e "s/MCPWD/$MCPWD/" > .corpus

And paste in these lines:

No Format
opts_weekly="--net -j 8 --reuse --cache --cachedir=/tmpfs/aicache_nightly --cs_schedule_cache --cs_cachedir=/export/home/nitemc/cache --restart=500 ham:detect:/export/home/bbmass/uploadedcorpora/MCUSER/ham/* --after="-15552000" --tail=25000 spam:detect:/export/home/bbmass/uploadedcorpora/MCUSER/spam/*"
opts_nightly=" --reuse --cache --cachedir=/tmpfs/aicache_nightly --cs_schedule_cache --cs_cachedir=/export/home/nitemc/cache --restart=500 ham:detect:/export/home/bbmass/uploadedcorpora/MCUSER/ham/* --after="-15552000" --tail=25000 spam:detect:/export/home/bbmass/uploadedcorpora/MCUSER/spam/*"

Then CTRL-D to end cat.

No Format
mkdir tmp
svn co svn
[accept certificate 'p'ermanently]

sudo chown -R nitemc .

In SVN trunk, edit build/nightlymc/run_nitemc, add their username to the list, check that file in.

Then in the zone, as the uid "automc", do this:

No Format
  cd /home/automc/svn/spamassassin
  svn up

so that that latest script is updated for when cron runs.

Finally, edit /home/corpus-rsync/secrets and add a line to the end, like so:

No Format

e.g. if MCUSER was "jm" and the generated MCPWD was "Wi0FdPWg":

No Format

That's it. Now tell the user their username (with the "bb-" prefix) and password.