Difference between revisions of "CcNutch"
Jon Phillips (talk | contribs) |
|||
Line 8: | Line 8: | ||
Creative Commons plugin for the open source [http://nutch.org Nutch] search engine. Module cc-nutch-plugin in the cctools [[Source_Repository_Information|sourceforge repository]]. | Creative Commons plugin for the open source [http://nutch.org Nutch] search engine. Module cc-nutch-plugin in the cctools [[Source_Repository_Information|sourceforge repository]]. | ||
− | Running instance at http://search.creativecommons.org/ | + | Running instance at http://search.creativecommons.org/ |
==Build== | ==Build== |
Revision as of 16:26, 19 October 2006
Creative Commons plugin for the open source Nutch search engine. Module cc-nutch-plugin in the cctools sourceforge repository.
Running instance at http://search.creativecommons.org/
Build
todo
Crawl
todo
TODO
- fill in documentation above
- add feature requests here
- update CC Nutch plugin for current Nutch version (may be no-op apart
from testing)
- add support for parsing RDFa (currently embedded RDF/XML is supported)
- add support for indexing assertions about objects other than the
current document (eg image, audio, video).
- add support for indexing specific attribution metadata