Difference between revisions of "DiscoverEd/Install manually"
Paulproteus (talk | contribs) |
(→Run the web application) |
||
Line 31: | Line 31: | ||
Edit conf/nutch-site.xml to point to your crawl location. | Edit conf/nutch-site.xml to point to your crawl location. | ||
− | < | + | <pre> |
$ ant war | $ ant war | ||
$ [copy the war file to your J2EE container] | $ [copy the war file to your J2EE container] | ||
− | </ | + | </pre> |
=== Switching to MySQL === | === Switching to MySQL === |
Revision as of 02:01, 24 August 2010
This article is a stub. You can help the Creative Commons by expanding it.
Contents
Check out and build the source code
$ git clone git://gitorious.org/discovered/repo.git discovered $ cd discovered $ ant
Add a curator and a feed
DiscoverEd uses feeds to help identify resources to crawl. Feeds are provided by curators, who can also provide metadata about resources.
$ ./bin/feeds addcurator "ND OCW" http://ocw.nd.edu/ $ ./bin/feeds addfeed rss http://ocw.nd.edu/front-page/courselist/rss http://ocw.nd.edu/
Aggregate and crawl resources
$ ./bin/feeds aggregate $ mkdir seed $ ./bin/feeds seed > seed/urls.txt $ ant -f dedbuild.xml crawl
Run the web application
Edit conf/nutch-site.xml to point to your crawl location.
$ ant war $ [copy the war file to your J2EE container]
Switching to MySQL
By default, DiscoverEd (at least on the next branch) uses an on-disk database called Derby for storing resource metadata. You should use a different database, like MySQL, in production.
To do that, edit conf/discovered.xml and update the following sections as appropriate:
<property> <name>rdfstore.db.driver</name> <value>com.mysql.jdbc.Driver</value> </property> <property> <name>rdfstore.db.url</name> <value>jdbc:mysql://localhost/discovered?autoReconnect=true</value> </property> <property> <name>rdfstore.db.user</name> <value>discovered</value> </property> <property> <name>rdfstore.db.password</name> <value></value> </property>