Diffbot is a robot that sees the web the way people do, and helps developers extract the important parts from any web page.

Real-time feed parsing in the cloud - Atom over PubSubHubbub and XMPP.

content-extraction tier-2 upcoming
upcoming