Check out and build the source code
$ git clone git://gitorious.org/discovered/repo.git discovered $ cd discovered $ ant
Add a curator and a feed
DiscoverEd uses feeds to help identify resources to crawl. Feeds are provided by curators, who can also provide metadata about resources.
$ ./bin/feeds addcurator "ND OCW" http://ocw.nd.edu/ $ ./bin/feeds addfeed rss http://ocw.nd.edu/front-page/courselist/rss http://ocw.nd.edu/
Aggregate and crawl resources
$ ./bin/feeds aggregate $ mkdir seed $ ./bin/feeds seed > seed/urls.txt $ ant -f dedbuild.xml crawl
Run the web application
Edit conf/nutch-site.xml to point to your crawl location.
$ ant war $ [copy the war file to your J2EE container]