This Confluence has been LDAP enabled, if you are an ASF Committer, please use your LDAP Credentials to login. Any problems file an INFRA jira ticket please.

Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: converted to 1.6 markup

Creating New Uploaded Corpora Users

TODO: this is obsolete and should be ignored! We're no longer using client/server mode for uploaded-corpora mass-checks....

These are instructions for PMC members who wish to set up a new account for users to upload their HandClassifiedCorpora for us to use in the NightlyMassCheck runs.

This method uses mass-check's client-server mode to distribute the load. However this seems to be occasionally buggy. If the resulting mass-checks run into problems, alternatively see UploadedCorporaIndependentMassCheck to set up a mass-check uid in traditional, non-client-server mode.

Log in to the zone and run:

No Format
MCUSER=[username]
MCPWD=[random password]

sudo mkdir /export/home/nitemc/$MCUSER
sudo chmod 1777 /export/home/nitemc/$MCUSER
cd /export/home/nitemc/$MCUSER
echo "$MCPWD" > rsync_password
chmod 600 rsync_password

sed -e "s/MCUSER/$MCUSER/" -e "s/MCPWD/$MCPWD/" > .corpus

And paste in these lines:

No Format
opts_weekly="--net -j 8 --reuse --cache --cachedir=/tmpfs/aicache_nightly --cs_schedule_cache --cs_cachedir=/export/home/nitemc/cache --restart=500 ham:detect:/export/home/bbmass/uploadedcorpora/MCUSER/ham/* --after="-15552000" --tail=25000 spam:detect:/export/home/bbmass/uploadedcorpora/MCUSER/spam/*"
opts_nightly=" --reuse --cache --cachedir=/tmpfs/aicache_nightly --cs_schedule_cache --cs_cachedir=/export/home/nitemc/cache --restart=500 ham:detect:/export/home/bbmass/uploadedcorpora/MCUSER/ham/* --after="-15552000" --tail=25000 spam:detect:/export/home/bbmass/uploadedcorpora/MCUSER/spam/*"
tmp=$HOME/tmp
tree=$HOME/svn
prefs_weekly=$HOME/user_prefs.weekly
prefs_nightly=$HOME/user_prefs.nightly
username=bb-MCUSER
password=__RSYNC_PASSWORD__
serverhost=spamassassin.zones.apache.org.:38899
clienthosts=__CLIENTHOSTS__
clienttree=nightlymc_MCUSER

Then CTRL-D to end cat.

No Format
mkdir tmp
svn co http://svn.apache.org/repos/asf/spamassassin/trunk svn
[accept certificate 'p'ermanently]

sudo chown -R nitemc .

In SVN trunk, edit build/nightlymc/run_nitemc, add their username to the list, check that file in.

Then in the zone, as the uid "automc", do this:

No Format
  cd /home/automc/svn/spamassassin
  svn up

so that that latest script is updated for when cron runs.

Finally, edit /home/corpus-rsync/secrets and add a line to the end, like so:

No Format
bb-$MCUSER:$MCPWD

e.g. if MCUSER was "jm" and the generated MCPWD was "Wi0FdPWg":

No Format
bb-jm:Wi0FdPWg

That's it. Now tell the user their username (with the "bb-" prefix) and password.