Thursday, July 5, 2012

Configuring Nutch In Eclipse

Configuring nutch in eclipse can really be nightmare to developers, which may eventually lead them to unfollow the project.

Here are the simplest steps that can let any developer run nutch within eclipse.


  • Get eclipse
  • Get a copy of nutch source. 
  • Save it somewhere in your eclipse workspace. Name it as NUTCH_HOME
  • In the terminal, cd NUTCH_HOME
  • In the terminal, run "ant" command to build the nutch project.
  • now remove build directory, rm -rf build
  • Now eclipse part. Create new Java project., name it as NUTCH_HOME...dont click finish...click Next.
  • In the next screen, change the build directory from NUTCH_HOME/build to NUTCH_HOME/tmp-build
  • Go to libraries tab, add src/conf to the class folder
  • Go to order & export tab and move NUTCH_HOME/conf to top
  • open the file NUTCH_HOME/conf/nutch-defaul.xml and change plugins.folder property to ./src/plugins
  • set the agent name property.
  • clean and build project...and you are finish

Happy Coding...!

2 comments:

Unknown said...

This is deprecated. The updated steps are here: https://wiki.apache.org/nutch/RunNutchInEclipse

Arpit Joshi said...

Thanks Tejas,
this is a year old post..ill update it when i get the time.