Command Line Options of bin/nutch
See each entry for datails of the command arguments and options.
command |
function |
Web page and link database administration, including creation |
|
Adjust database link-analysis scoring |
|
Perform complete crawling and indexing of a set of root urls |
|
NDFS data node |
|
Deletes duplicate documents in a set of segment indexes |
|
Fetch a segment's pages |
|
Print the fetchlist of a segment |
|
Generate new segments to fetch |
|
Run the indexer on a segment's fetcher output |
|
Inject new urls into the web page and link database |
|
Merge several segment indexes |
|
Merges multiple segments & removes duplicates |
|
NDFS name node |
|
NDFS administrative access |
|
Parse contents in one segment |
|
Prunes existing Nutch indexes of unwanted content |
|
Read data from the web page and link db |
|
Read data in an existing segment |
|
Divide data from one segement into several segments |
|
Run a search server of IPC connections |
|
Updates the web page and link db from the segment fetcher output |
|
|
|