Difference between revisions of "CcNutch"
(minor changes) |
Jon Phillips (talk | contribs) |
||
Line 10: | Line 10: | ||
==Build== | ==Build== | ||
− | + | {{incomplete}} | |
− | |||
==Crawl== | ==Crawl== | ||
− | + | {{incomplete}} | |
− | == | + | == Roadmap == |
* Fill in documentation above | * Fill in documentation above |
Revision as of 17:02, 2 July 2007
Creative Commons plugin for the open source Nutch search engine. Module ccnutch in the cctools sourceforge repository.
There was a running instance at http://search.creativecommons.org. Commercial search engines support CC well enough now that it was turned off. http://search.creativecommons.org now offers a selection of these. Nutch may be revived if we want to explore search features that do not yet have commercial interest.
Build
Crawl
Roadmap
- Fill in documentation above
- Add feature requests here
- Update CCNutch plugin for current Nutch version (may be no-op apart from testing)
- Add support for parsing RDFa (currently embedded RDF/XML is supported)
- Add support for indexing assertions about objects other than the current document (eg image, audio, video).
- Add support for indexing specific attribution metadata