Difference between revisions of "CcNutch"

From Creative Commons
Jump to: navigation, search
(minor changes)
Line 4: Line 4:
 
[[Category:Developer]]
 
[[Category:Developer]]
  
Creative Commons plugin for the open source [http://nutch.org Nutch] search engine.  Module cc-nutch-plugin in the cctools [[Source_Repository_Information|sourceforge repository]].
+
Creative Commons plugin for the open source [http://nutch.org Nutch] search engine.  Module ccnutch in the cctools [[Source_Repository_Information|sourceforge repository]].
  
There '''was''' a running instance at http://search.creativecommons.org/
+
There '''was''' a running instance at http://search.creativecommons.org. Commercial search engines support CC well enough now that it was turned off. http://search.creativecommons.org now offers a selection of these.  Nutch may be revived if we want to explore search features that do not yet have commercial interest.
 
 
Commercial search engines support CC well enough now that it was turned off -- http://search.creativecommons.org now offers a selection of these.  Nutch may be revived if we want to explore search features that do not yet have commercial interest.
 
  
 
==Build==
 
==Build==
Line 20: Line 18:
 
==TODO==
 
==TODO==
  
* fill in documentation above
+
* Fill in documentation above
* add feature requests here
+
* Add feature requests here
* update CC Nutch plugin for current Nutch version (may be no-op apart from testing)
+
* Update CCNutch plugin for current Nutch version (may be no-op apart from testing)
* add support for parsing RDFa (currently embedded RDF/XML is supported)
+
* Add support for parsing RDFa (currently embedded RDF/XML is supported)
* add support for indexing assertions about objects other than the current document (eg image, audio, video).
+
* Add support for indexing assertions about objects other than the current document (eg image, audio, video).
* add support for indexing specific attribution metadata
+
* Add support for indexing specific attribution metadata

Revision as of 17:38, 31 March 2007


Creative Commons plugin for the open source Nutch search engine. Module ccnutch in the cctools sourceforge repository.

There was a running instance at http://search.creativecommons.org. Commercial search engines support CC well enough now that it was turned off. http://search.creativecommons.org now offers a selection of these. Nutch may be revived if we want to explore search features that do not yet have commercial interest.

Build

todo

Crawl

todo

TODO

  • Fill in documentation above
  • Add feature requests here
  • Update CCNutch plugin for current Nutch version (may be no-op apart from testing)
  • Add support for parsing RDFa (currently embedded RDF/XML is supported)
  • Add support for indexing assertions about objects other than the current document (eg image, audio, video).
  • Add support for indexing specific attribution metadata