CcNutch
Revision as of 23:33, 19 October 2006 by Mike Linksvayer (talk | contribs)
Creative Commons plugin for the open source Nutch search engine. Module cc-nutch-plugin in the cctools sourceforge repository.
There was a running instance at http://search.creativecommons.org/
Commercial search engines support CC well enough now that it was turned off -- http://search.creativecommons.org now offers a selection of these. Nutch may be revived if we want to explore search features that do not yet have commercial interest.
Build
todo
Crawl
todo
TODO
- fill in documentation above
- add feature requests here
- update CC Nutch plugin for current Nutch version (may be no-op apart from testing)
- add support for parsing RDFa (currently embedded RDF/XML is supported)
- add support for indexing assertions about objects other than the current document (eg image, audio, video).
- add support for indexing specific attribution metadata