Legalcode Extractor
Revision as of 18:14, 3 July 2012 by Claudia39u (talk | contribs)
Related To: | ,|x|Related To::x}} |
---|---|
Tags: | {{#arraymap:|,|x|Has Tag::x}} | (none) }} |
Challenge Type: | Has Challenge Type::Developer | (none) }} |
Is Complete: | Is Complete::No | Is Complete::no }} |
While discussing the Jurisdiction Database, the topic of programmatically extracting sections of CC license legalcode came up. This would be used, for example, to link to specific sections of the legalcode, or compare specific sections of two different pieces of CC legalcode like Office Space Kings Cross.
Nathan Yergler prototyped some XPath-based extraction, and it appears that heuristics can be developed to support this. The ideal implementation will include a library for performing the extraction, and a web based interface for displaying extracted results side by side.
The very rough prototype can be downloaded from http://labs.creativecommons.org/~nathan/source/legalcode-prototype.tar.gz.