Difference between revisions of "PuSH Feed Type"

From Creative Commons
Jump to: navigation, search
(Created page with "{{DiscoverEd Specification |contact=Asheesh Laroia |project=AgShare |status=Draft }} The people who run a DiscoverEd instance may wish to be updated nearly-immediately when there...")
 
Line 14: Line 14:
 
== Requirements ==
 
== Requirements ==
  
To be considered working, this specification would provide the following things.
+
A complete implementation of this specification would provide the following things.
  
* Instructions for any Atom/RSS feed to participate in the PuSH network.
+
* DiscoverEd can discover a PuSH ''hub'' mentioned in a feed.
* DiscoverEd can accept POST requests from a PuSH hub. These POST requests are how the hub notifies the Nutch
+
* DiscoverEd can register itself as a ''subscriber'' to that feed on that hub.
* DiscoverEd can subscribe to updates from a PuSH hub (whose address can be configured in the configuration file).
+
* When the hub pings DiscoverEd with an update to that feed, it re-aggregates data from that feed, does a crawl, and merges the index.
 +
 
 +
== Status ==
 +
 
 +
* This draft document has been written. That's all.

Revision as of 15:01, 7 September 2010

Contact Contact::Asheesh Laroia
Project ,|project_name|Project Driver::project_name}}
Status Status::Draft

The people who run a DiscoverEd instance may wish to be updated nearly-immediately when there are new resources published by a curator.

Right now, DiscoverEd instances aggregate feeds and crawl every once in a while, often manually at the behest of the search engine operator. PubSubHubBub provides a way for the DiscoverEd instance to subscribe feeds and receive automatic, nearly-instantaneous notification of new information in the feed.

This can be built on top of existing Atom/RSS feeds that curators already publish.

This feature was defined and developed during the fun DC meeting thing. (Nathan, did that meeting have a name?)

Requirements

A complete implementation of this specification would provide the following things.

  • DiscoverEd can discover a PuSH hub mentioned in a feed.
  • DiscoverEd can register itself as a subscriber to that feed on that hub.
  • When the hub pings DiscoverEd with an update to that feed, it re-aggregates data from that feed, does a crawl, and merges the index.

Status

  • This draft document has been written. That's all.