Difference between revisions of "CcNutch"
Line 3: | Line 3: | ||
[[Category:Technology]] | [[Category:Technology]] | ||
[[Category:Developer]] | [[Category:Developer]] | ||
− | |||
− | |||
Creative Commons plugin for the open source [http://nutch.org Nutch] search engine. Module cc-nutch-plugin in the cctools [[Source_Repository_Information|sourceforge repository]]. | Creative Commons plugin for the open source [http://nutch.org Nutch] search engine. Module cc-nutch-plugin in the cctools [[Source_Repository_Information|sourceforge repository]]. | ||
− | + | There '''was''' a running instance at http://search.creativecommons.org/ | |
+ | |||
+ | Commercial search engines support CC well enough now that it was turned off -- http://search.creativecommons.org now offers a selection of these. Nutch may be revived if we want to explore search features that do not yet have commercial interest. | ||
==Build== | ==Build== | ||
Line 22: | Line 22: | ||
* fill in documentation above | * fill in documentation above | ||
* add feature requests here | * add feature requests here | ||
− | * update CC Nutch plugin for current Nutch version (may be no-op apart | + | * update CC Nutch plugin for current Nutch version (may be no-op apart from testing) |
− | from testing) | ||
* add support for parsing RDFa (currently embedded RDF/XML is supported) | * add support for parsing RDFa (currently embedded RDF/XML is supported) | ||
− | * add support for indexing assertions about objects other than the | + | * add support for indexing assertions about objects other than the current document (eg image, audio, video). |
− | current document (eg image, audio, video). | ||
* add support for indexing specific attribution metadata | * add support for indexing specific attribution metadata |
Revision as of 23:33, 19 October 2006
Creative Commons plugin for the open source Nutch search engine. Module cc-nutch-plugin in the cctools sourceforge repository.
There was a running instance at http://search.creativecommons.org/
Commercial search engines support CC well enough now that it was turned off -- http://search.creativecommons.org now offers a selection of these. Nutch may be revived if we want to explore search features that do not yet have commercial interest.
Build
todo
Crawl
todo
TODO
- fill in documentation above
- add feature requests here
- update CC Nutch plugin for current Nutch version (may be no-op apart from testing)
- add support for parsing RDFa (currently embedded RDF/XML is supported)
- add support for indexing assertions about objects other than the current document (eg image, audio, video).
- add support for indexing specific attribution metadata