Difference between revisions of "DiscoverEd/Install manually"

From Creative Commons
Jump to: navigation, search
(+cat +stub)
(Run the web server)
Line 27: Line 27:
 
</pre>
 
</pre>
  
=== Run the web server ===
+
=== Run the web application ===
 +
 
 +
Edit conf/nutch-site.xml to point to your crawl location.
 +
 
 +
<code>
 +
$ ant war
 +
$ [copy the war file to your J2EE container]
 +
</code>

Revision as of 07:43, 11 June 2010

Check out and build the source code

$ git clone git://gitorious.org/discovered/repo.git discovered
$ cd discovered
$ ant

Add a curator and a feed

DiscoverEd uses feeds to help identify resources to crawl. Feeds are provided by curators, who can also provide metadata about resources.

$ ./bin/feeds addcurator "ND OCW" http://ocw.nd.edu/ 
$ ./bin/feeds addfeed rss http://ocw.nd.edu/front-page/courselist/rss http://ocw.nd.edu/

Aggregate and crawl resources

$ ./bin/feeds aggregate
$ mkdir seed
$ ./bin/feeds seed > seed/urls.txt
$ ant -f dedbuild.xml crawl

Run the web application

Edit conf/nutch-site.xml to point to your crawl location.

$ ant war $ [copy the war file to your J2EE container]