Skip to end of metadata
Go to start of metadata

Public search engines using Nutch

Please sort by name alphabetically

  • Able Grape is a vertical search engine for the world of wine.
  • AskAboutOil is a vertical search portal for the petroleum industry.
  • Asbestos is a vertical search portal and discussion forum for the asbestos and related information.
  • Biible is the first biblical search engine that allows people to search the web for comments of biblical passage. 6 major languages are fully recognized and 150 partially for now.
  • Baynote provides free hosted Nutch search for businesses.
  • Central Budapest Search is a search engine for English language sites focussing on Budapest news, restaurants, accommodation, life and events.
  • Circuit Scout is a search engine for electrical circuits.
  • Coder-Suche.de searchs for coding stuff like apis, documentations, tutorials, openBooks and more. Its origin is german, its contents are mainly english.
  • Dadi360 Usee nutch search engine for providing search of Chinese language websites in North America.
  • Ecolhub Web Search an E. coli specific search engine based on Nutch. EcoliHub WebSearch includes only those sites relevant to E. coli, thereby reducing the number of spurious hits. Searches can be optionally limited to your choice of resources. More than 110,000 pages to search. More resources getting added.
  • Epivista is a search engine of epilepsy related web sites.
  • ertech uses nutch as its search engine. It is integrated with the CMS system aarcat from aarboard.
  • Erzsuche.de is a local search engine for the Erzgebirge (For what? It is the home of the nutcracker) With spell check feature
  • FileRatings Search is a search engine of software product.
  • Gina Wild Erotic Search Engine is based on nutch and uses the language identifier modul to present results according to the choosen language. (Warning: NSFW)
  • Jambot General search engine, uses nutch open source software to crawl and deliver its web search results.
  • jboss homepage The jboss (tm) homepage runs a nutch as homepage search engine.
  • J&C Intersonic uses nutch as its search engine.
  • Jumble Fox - The Australian Search Engine
  • Krugle uses Nutch to crawl web pages for code, archives and technically-interesting content. We also use a modified version of Nutch to crawl CVS/Subversion repositories, and the NutchBean/distributed searcher support to search and generate hits for code and tech info queries.
  • LabforCulture - The essential tool for everyone in arts and culture who creates, collaborates, shares and produces across borders in Europe.
  • Mountbatten Search a search engine that crawls only the part of the Internet located in Uganda.
  • Nsyght.com is a real time search and aggregation service that leverages users social graph.
  • Nurse Web Search - Health Internet Search Engine.
  • Netluchs.de Searchengine for german language websites.
  • Openindex runs numerous small Nutch instances for crawling customer websites and operates a medium sized Hadoop cluster crawling the entire .nl-cctld.
  • Portuguese Web Archive Preserves web contents and provides a search service over information that is no longer available online.
  • Government of Quebec websites Over 400 websites of the government of Quebec (Canada) are indexed by Nutch. The Web application has been developped by Doculibre inc.
  • search2.net general search engine with an international index based on Nutch.
  • SearchMitchell.com is a community search engine for businesses and organizations in Mitchell, SD.
  • Seegnify - Enterprise search engine for private networks or the Web. It is free, open and transparent.
  • TamilSWeb Search is a search engine geared toward south asian web content.
  • SimilarPages - A web discovery and search add-on. Computes similarity between pages using Nutch crawls.
  • SymbolHound - A search engine targeted toward programming- and math- related queries. Allows users to search for queries with special characters that most other search engines ignore, like &, %, #, ¤, ¬, and µ.
  • UmkreisFinder.de is running the GeoPosition plugin for local searches in Germany and in German. Please insert a search term in the first field, a German city name in the second field and choose a perimeter at the last field.
  • UtilitySearch is a search engine for the regulated utility industries (Electricity, Water, Gas, and Telecommunications) in the United States and Canada.
  • Webharvest.gov offers full-text search of nearly 100 million resources collected from US Federal Government websites as part of the National Archive and Records Administration's 2004 Presidential Term Web Harvest

Not working, using different engine, cybersquatted

I propose the deletion or update to the current location for following:

  • BeThere BeSquare is an Event Search Engine for the San Francisco Bay Area that allows users to specify keywords, date, city, address, and category and get details about events in 4 different views. (cybersquatted)
  • Bigsearch.ca uses nutch open source software to deliver its search results. (server not found)
  • BusyTonight: Search for any event in the United States, by keyword, location, and date. Event listings are automatically crawled and updated from original source Web sites. (redirects to Google)
  • Comtec Search is a search engine for UK Tour Operator Package Holiday Brochures. (server not found)
  • eroscanner is a search engine for german adult stuff. Watching the quality of ranking in this hard-fought area might be very interesting. (Warning: NSFW) (cybersquatted)
  • GenSphere - Genealogy Search Engine based on Nutch. (server not found)
  • KnowMyDestination - Search Engine for Travel related stuff. We have created this search engine by using Google WebAPIs to fetch relavant start URLs and then use Nutch to crawl and index those URLs. (server not found)
  • LDSsearch.com - Search engine which indexes sites with a positive bias toward the mormon church. (Google search engine)
  • LOOQ.EU - European search engine which indexes sites in Europe. (server not found)
  • The Million Pixel Search Page - Search engine for Alex Tew's Million Dollar Homepage. (redirects to performancehosting.net)
  • Misterbot.fr a search engine for french language web sites. (blank page)
  • MyOpensourcejobs A Opensource skills jobs site using NUTCH and LAMP based DRUPAL CMS. (server not found)
  • NowAccepting.com is a job search engine. (Error 500)
  • Playfuls.com is a search engine that indexes the most important english gaming-related websites. (cybersquatted)
  • Synoo.com is a small web search engine (cybersquatted)
  • Cornell University Library is collaborating with the research group of Thorsten Joachims to develop a learning search engine for library web pages based on Nutch. The nutch-based search engine is near the bottom of the page.
  • Tokenizer is an online shopping search engine partially powered by Nutch
  • mozDex.com Running Nutch SVN release with Clustering & Ontology support enabled.
  • Creative Commons is a search engine for creative commons licensed material.
  • DiscoverEd is both an open-source software project and an experimental project from Creative Commons which attempts to provide a solution for scalable search and discovery of educational resources on the web.
  • No labels