Difference between revisions of "CcNutch"

From Creative Commons
Jump to: navigation, search
Line 3: Line 3:
 
[[Category:Technology]]
 
[[Category:Technology]]
 
[[Category:Developer]]
 
[[Category:Developer]]
[[Category:TODO]]
 
{{incomplete}}
 
  
 
Creative Commons plugin for the open source [http://nutch.org Nutch] search engine.  Module cc-nutch-plugin in the cctools [[Source_Repository_Information|sourceforge repository]].
 
Creative Commons plugin for the open source [http://nutch.org Nutch] search engine.  Module cc-nutch-plugin in the cctools [[Source_Repository_Information|sourceforge repository]].
  
Running instance at http://search.creativecommons.org/
+
There '''was''' a running instance at http://search.creativecommons.org/
 +
 
 +
Commercial search engines support CC well enough now that it was turned off -- http://search.creativecommons.org now offers a selection of these.  Nutch may be revived if we want to explore search features that do not yet have commercial interest.
  
 
==Build==
 
==Build==
Line 22: Line 22:
 
* fill in documentation above
 
* fill in documentation above
 
* add feature requests here
 
* add feature requests here
* update CC Nutch plugin for current Nutch version (may be no-op apart
+
* update CC Nutch plugin for current Nutch version (may be no-op apart from testing)
from testing)
 
 
* add support for parsing RDFa (currently embedded RDF/XML is supported)
 
* add support for parsing RDFa (currently embedded RDF/XML is supported)
* add support for indexing assertions about objects other than the
+
* add support for indexing assertions about objects other than the current document (eg image, audio, video).
current document (eg image, audio, video).
 
 
* add support for indexing specific attribution metadata
 
* add support for indexing specific attribution metadata

Revision as of 23:33, 19 October 2006


Creative Commons plugin for the open source Nutch search engine. Module cc-nutch-plugin in the cctools sourceforge repository.

There was a running instance at http://search.creativecommons.org/

Commercial search engines support CC well enough now that it was turned off -- http://search.creativecommons.org now offers a selection of these. Nutch may be revived if we want to explore search features that do not yet have commercial interest.

Build

todo

Crawl

todo

TODO

  • fill in documentation above
  • add feature requests here
  • update CC Nutch plugin for current Nutch version (may be no-op apart from testing)
  • add support for parsing RDFa (currently embedded RDF/XML is supported)
  • add support for indexing assertions about objects other than the current document (eg image, audio, video).
  • add support for indexing specific attribution metadata