Running Nutch with Mac OSX
Downloading and setting up Tomcat
Download Tomcat (http://tomcat.apache.org/). The latest versions require J2SE 1.5 which can be downloaded from www.apple.com (Tiger users only). I downloaded apache-tomcat-5.5.12.tar.gz.
Open a terminal window and copy the file to /usr/local (cp apache-tomcat-5.5.12.tar.gz /usr/local) tar -zxvf apache-tomcat-5.5.12.tar.gz Start Tomcat (see below)
You will see something like:
Check that tomcat is running by opening http://localhost:8080. This should bring up Tomcat's Welcome Page.
Finally edit tomcat-users.xml which is in your Tomcat/conf Directory and add a 'manager' role.
Downloading and setting up Nutch
Download nutch-0.7.1.tar.gz or some other release and place the file somewhere in your Home directory. Expand the file using Stuffit Expander or the tar command. Open http://localhost:8080 and click on the link 'Tomcat Manager' Click select WAR file to upload. Browse to the Nutch Directory and select the file 'nutch-0.7.1.war' which is located in the nutch root folder. Click 'Deploy' Check http://localhost:8080/nutch-0.7.1/en/search.html. You should see the Nutch Search Form.
Note that the nutch command line tool (in our case nutch-0.7.1/bin/nutch) is not installed under the Tomcat web-application ($CATALINA_HOME/webapps/nutch-0.7.1/WEB-INF/...). You can either leave it there or move it manually to your tomcat/webapps/nutch/WEB-INF/classes. In the first case you will have to do some classpath configuring or maintain two nutch-site.xml files (one for indexing and one for searching).
Using Terminal, cd to the directory where your bin/nutch is located. From here you can follow the instructions from the tutorial.
Just like any other mac application the Terminal is scriptable which is a nice feature. The applescript below will start a crawl just by doubleclicking it's icon.