Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 4.0

This page is in draft. Refer to the dev mail list for more information

Droids Web Services is a proposed module (i.e. not yet implemented) that offer web crawling functionalities in cloud computing platform. It works as follows:

  • A web application that expose Droids core functions in Web APIs
    • support URL fetching, HTML/Image parsing, and data extraction
    • Spring HTTP Invoker is chosen. (any binary web remoting technology is fine.)
  • The original Droids client component , with configuration to call remote Worker rather than using the local workerthat is configured to use a remote worker
    • The worker will no longer make local request to do fetch. Instead, it make remoting call to the web services and collect results.

Requirement

  • unlimited scalability / extreme throughput
  • support any cloud computing platform, e.g. Google App Engine, Amazon EC2 etc.
  • share nothing in the server application. no use of session. every remote method call is a complete process.

...