Difference between revisions of "CcNutch"

From Creative Commons
Jump to: navigation, search
(minor changes)
 
(2 intermediate revisions by one other user not shown)
Line 1: Line 1:
[[Category:CcNutch]]
+
'''CcNutch''' is a Creative Commons plugin for the open source [http://nutch.org Nutch] search engine.  Module ccnutch in the cctools [[Source_Repository_Information|sourceforge repository]].
[[Category:opensource]]
 
[[Category:Technology]]
 
[[Category:Developer]]
 
 
 
Creative Commons plugin for the open source [http://nutch.org Nutch] search engine.  Module ccnutch in the cctools [[Source_Repository_Information|sourceforge repository]].
 
  
 
There '''was''' a running instance at http://search.creativecommons.org. Commercial search engines support CC well enough now that it was turned off. http://search.creativecommons.org now offers a selection of these.  Nutch may be revived if we want to explore search features that do not yet have commercial interest.
 
There '''was''' a running instance at http://search.creativecommons.org. Commercial search engines support CC well enough now that it was turned off. http://search.creativecommons.org now offers a selection of these.  Nutch may be revived if we want to explore search features that do not yet have commercial interest.
Line 10: Line 5:
 
==Build==
 
==Build==
  
todo
+
{{incomplete}}
 +
==Crawl==
  
==Crawl==
+
{{incomplete}}
  
todo
+
== Roadmap ==
  
==TODO==
+
=== Milestone 0 ===
  
 
* Fill in documentation above
 
* Fill in documentation above
* Add feature requests here
 
 
* Update CCNutch plugin for current Nutch version (may be no-op apart from testing)
 
* Update CCNutch plugin for current Nutch version (may be no-op apart from testing)
 
* Add support for parsing RDFa (currently embedded RDF/XML is supported)
 
* Add support for parsing RDFa (currently embedded RDF/XML is supported)
 +
 +
=== Milestone 1 ===
 
* Add support for indexing assertions about objects other than the current document (eg image, audio, video).
 
* Add support for indexing assertions about objects other than the current document (eg image, audio, video).
 
* Add support for indexing specific attribution metadata
 
* Add support for indexing specific attribution metadata
 +
 +
=== Milestone 2 ===
 +
 +
* Add feature requests here
 +
 +
=== Milestone 3 ===
 +
* deploy ccNutch on some infrastructure for testing
 +
 +
=== Milestone 4 ===
 +
 +
* Add feature requests here
 +
 +
[[Category:CcNutch]]
 +
[[Category:opensource]]
 +
[[Category:Technology]]
 +
[[Category:Developer]]

Latest revision as of 02:35, 24 July 2009

CcNutch is a Creative Commons plugin for the open source Nutch search engine. Module ccnutch in the cctools sourceforge repository.

There was a running instance at http://search.creativecommons.org. Commercial search engines support CC well enough now that it was turned off. http://search.creativecommons.org now offers a selection of these. Nutch may be revived if we want to explore search features that do not yet have commercial interest.

Build

Crawl

Roadmap

Milestone 0

  • Fill in documentation above
  • Update CCNutch plugin for current Nutch version (may be no-op apart from testing)
  • Add support for parsing RDFa (currently embedded RDF/XML is supported)

Milestone 1

  • Add support for indexing assertions about objects other than the current document (eg image, audio, video).
  • Add support for indexing specific attribution metadata

Milestone 2

  • Add feature requests here

Milestone 3

  • deploy ccNutch on some infrastructure for testing

Milestone 4

  • Add feature requests here