Difference between revisions of "CcNutch"

From Creative Commons
Jump to: navigation, search
(minor changes)
Line 10: Line 10:
 
==Build==
 
==Build==
  
todo
+
{{incomplete}}
 
 
 
==Crawl==
 
==Crawl==
  
todo
+
{{incomplete}}
  
==TODO==
+
== Roadmap ==
  
 
* Fill in documentation above
 
* Fill in documentation above

Revision as of 18:02, 2 July 2007


Creative Commons plugin for the open source Nutch search engine. Module ccnutch in the cctools sourceforge repository.

There was a running instance at http://search.creativecommons.org. Commercial search engines support CC well enough now that it was turned off. http://search.creativecommons.org now offers a selection of these. Nutch may be revived if we want to explore search features that do not yet have commercial interest.

Build

Crawl

Roadmap

  • Fill in documentation above
  • Add feature requests here
  • Update CCNutch plugin for current Nutch version (may be no-op apart from testing)
  • Add support for parsing RDFa (currently embedded RDF/XML is supported)
  • Add support for indexing assertions about objects other than the current document (eg image, audio, video).
  • Add support for indexing specific attribution metadata