Difference between revisions of "DiscoverEd Data"

From Creative Commons
Jump to: navigation, search
(Data Gathered)
Line 15: Line 15:
 
* Crawled pages (embedded [[RDFa]])
 
* Crawled pages (embedded [[RDFa]])
  
The aggregated information, along with source annotations, is stored in a triple store.  This will be available soon as a SPARQL endpoint.
+
You can read more details about our metadata specifications [http://wiki.creativecommons.org/CcLearn_Search_Metadata here]. The aggregated information, along with source annotations, is stored in a triple store.  This will be available soon as a SPARQL endpoint.
  
 
[[Category:Learn]]
 
[[Category:Learn]]
 
[[Category:Developer]]
 
[[Category:Developer]]
 
[[Category:DiscoverEd]]
 
[[Category:DiscoverEd]]

Revision as of 00:16, 11 June 2009

DiscoverEd is a project of ccLearn. You can find general information about the project on the ccLearn site.

This page documents ways in which developers may use the data gathered by the project for other purposes.

Data Gathered

The DiscoverEd project is a web-scale search of educational resources with a special emphasis on Open Educational Resources (OER). As such, it utilizes a web-wide index, promoting results which have been identified as OER. ccLearn is serving as an aggregation point for other organizations which have identified or produced OER.

Data is aggregated from several sources, including:

  • RSS and Atom feeds (title, description and subject information)
  • OAI-PMH repositories (OAI-DC metadata)
  • Crawled pages (embedded RDFa)

You can read more details about our metadata specifications here. The aggregated information, along with source annotations, is stored in a triple store. This will be available soon as a SPARQL endpoint.