Difference between revisions of "Legalcode Extractor"

From Creative Commons
Jump to: navigation, search
m
Line 5: Line 5:
 
{{Idea}}
 
{{Idea}}
  
While discussing the [[Jurisdiction Database]], the topic of programmatically extracting sections of CC license legalcode came up.  This would be used, for example, to link to specific sections of the legalcode, or compare specific sections of two different pieces of CC legalcode.
+
While discussing the [[Jurisdiction Database]], the topic of programmatically extracting sections of CC license legalcode came up.  This would be used, for example, to link to specific sections of the legalcode, or compare specific sections of two different pieces of CC legalcode like [http://www.flexibleofficespace.co/ Office Space Kings Cross].
  
 
[[User:Nathan Yergler|Nathan Yergler]] prototyped some XPath-based extraction, and it appears that heuristics can be developed to support this.  The ideal implementation will include a library for performing the extraction, and a web based interface for displaying extracted results side by side.
 
[[User:Nathan Yergler|Nathan Yergler]] prototyped some XPath-based extraction, and it appears that heuristics can be developed to support this.  The ideal implementation will include a library for performing the extraction, and a web based interface for displaying extracted results side by side.

Revision as of 18:14, 3 July 2012

Related To: ,|x|Related To::x}}
Tags: {{#arraymap:|,|x|Has Tag::x}} | (none) }}
Challenge Type: Has Challenge Type::Developer | (none) }}
Is Complete: Is Complete::No | Is Complete::no }}

Template:Idea

While discussing the Jurisdiction Database, the topic of programmatically extracting sections of CC license legalcode came up. This would be used, for example, to link to specific sections of the legalcode, or compare specific sections of two different pieces of CC legalcode like Office Space Kings Cross.

Nathan Yergler prototyped some XPath-based extraction, and it appears that heuristics can be developed to support this. The ideal implementation will include a library for performing the extraction, and a web based interface for displaying extracted results side by side.

The very rough prototype can be downloaded from http://labs.creativecommons.org/~nathan/source/legalcode-prototype.tar.gz.