Difference between revisions of "CcNutch"
m |
Jon Phillips (talk | contribs) |
||
Line 3: | Line 3: | ||
[[Category:Technology]] | [[Category:Technology]] | ||
[[Category:Developer]] | [[Category:Developer]] | ||
+ | [[Category:TODO]] | ||
{{incomplete}} | {{incomplete}} | ||
Line 17: | Line 18: | ||
todo | todo | ||
− | == | + | ==TODO== |
* fill in documentation above | * fill in documentation above | ||
* add feature requests here | * add feature requests here | ||
− | * . | + | * update CC Nutch plugin for current Nutch version (may be no-op apart |
+ | from testing) | ||
+ | * add support for parsing RDFa (currently embedded RDF/XML is supported) | ||
+ | * add support for indexing assertions about objects other than the | ||
+ | current document (eg image, audio, video). | ||
+ | * add support for indexing specific attribution metadata |
Revision as of 19:55, 3 May 2006
Creative Commons plugin for the open source Nutch search engine. Module cc-nutch-plugin in the cctools sourceforge repository.
Running instance at http://search.creativecommons.org/index.jsp
Build
todo
Crawl
todo
TODO
- fill in documentation above
- add feature requests here
- update CC Nutch plugin for current Nutch version (may be no-op apart
from testing)
- add support for parsing RDFa (currently embedded RDF/XML is supported)
- add support for indexing assertions about objects other than the
current document (eg image, audio, video).
- add support for indexing specific attribution metadata