Atom (standard)
From Wikipedia, the free encyclopedia
Filename extension | .atom, .xml |
---|---|
Internet media type | application/atom+xml |
Type of format | web syndication |
Extended from | XML |
The name Atom applies to a pair of related standards. The Atom Syndication Format is an XML language used for web feeds, while the Atom Publishing Protocol (AtomPub or APP) is a simple HTTP-based protocol for creating and updating web resources.
Web feeds allow software programs to check for updates published on a web site. To provide a web feed, a site owner may use specialized software (such as a content management system) that publishes a list (or "feed") of recent articles or content in a standardized, machine-readable format. The feed can then be downloaded by web sites that syndicate content from the feed, or by feed reader programs that allow Internet users to subscribe to feeds and view their content.
A feed contains entries, which may be headlines, full-text articles, excerpts, summaries, and/or links to content on a web site, along with various metadata.
The Atom format was developed as an alternative to RSS. Ben Trott, an advocate of the new format that became Atom, believed that RSS had limitations and flaws—such as lack of on-going innovation and its necessity to remain backward compatible— and that there were advantages to a fresh design.[1]
Proponents of the new format formed the IETF Atom Publishing Format and Protocol Workgroup. The Atom syndication format was published as an IETF proposed standard in RFC 4287, and the Atom Publishing Protocol was published as RFC 5023.
Contents |
[edit] Usage
Web feeds are used by the weblog community to share the latest entries' headlines or their full text, and even attached multimedia files.[2] These providers allow other websites to incorporate the weblog's "syndicated" headline or headline-and-short-summary feeds under various usage agreements. Atom and other web syndication formats are now used for many purposes, including journalism, marketing, bug-reports, or any other activity involving periodic updates or publications. Atom also provides a standardized way to export an entire blog, or parts of it, for backup or for importing into other blogging systems.
A program known as a feed reader or aggregator can check webpages on behalf of a user and display any updated articles that it finds. It is common to find web feeds on major Web sites, as well as many smaller ones. Some websites let people choose between RSS or Atom formatted web feeds; others offer only RSS or only Atom. In particular, many blog and wiki sites offer their web feeds in the Atom format.
Client-side readers and aggregators may be designed as standalone programs or as extensions to existing programs like web browsers. Browsers are moving toward integrated feed reader functions. Such programs are available for various operating systems.
Web-based feed readers and news aggregators require no software installation and make the user's "feeds" available on any computer with Web access. Some aggregators syndicate (combine) web feeds into new feeds, e.g., taking all football related items from several sports feeds and providing a new football feed. There are several search engines which provide search functionality over content published via these web feeds.
On Web pages, web feeds (Atom or RSS) are typically linked with the word "Subscribe" or with the unofficial web feed logo ().
[edit] Atom compared to RSS 2.0
This section may require cleanup to meet Wikipedia's quality standards. |
When Atom emerged as a format intended to rival or replace RSS, CNET described the motivation of its creators as follows: "Winer's opponents are seeking a new format that would clarify RSS ambiguities, consolidate its multiple versions, expand its capabilities, and fall under the auspices of a traditional standards organization." [3]
A brief description of some of the ways Atom 1.0 differs from RSS 2.0 has been given by Tim Bray, who played a major role in the creation of Atom:[4]
[edit] Content Model
RSS 2.0 may contain either plain text or escaped HTML as a payload, with no way to indicate which of the two is provided. Atom, on the other hand, provides a mechanism to explicitly and unambiguously label the type of content being provided by the entry, and allows for a broad variety of payload types including plain text, escaped HTML, XHTML, XML, Base64-encoded binary, and references to external content such as documents, video, audio streams, and so forth.
[edit] Date Formats
The RSS 2.0 specification relies on the use of RFC 822 formatted timestamps to communicate information about when items in the feed were created and last updated. The Atom working group chose instead to use timestamps formatted according to the rules specified by RFC 3339 (which is a subset of ISO 8601).
[edit] Internationalization
While the RSS vocabulary has a mechanism to indicate a language for the feed, there is no means by which a language for individual items or text elements can be specified. Atom, on the other hand, uses the standardized xml:lang attribute to make it possible to specify a language context for every piece of human readable content in the feed.
Atom also differs from RSS in that it supports the use of Internationalized Resource Identifiers, which allow links to resources and unique identifiers to contain characters outside the US ASCII character set.
[edit] Modularity
The elements of the RSS vocabulary are not generally reusable in other XML vocabularies. The Atom syntax was specifically designed to allow elements to be reused outside the context of an Atom feed document. For instance, it is not uncommon to find atom:link elements being used within RSS 2.0 feeds.
[edit] Barriers to adoption
- Despite the emergence of Atom as an IETF Proposed Standard and the decision by major companies such as Google to embrace Atom, use of the older and more widely known RSS 1.0 and RSS 2.0 formats has continued.
- RSS 2.0 support for enclosures led directly to the development of podcasting. While many podcasting applications, such as iTunes, support the use of Atom 1.0, RSS 2.0 remains the preferred format.[5]
- Many sites choose to publish their feeds in only a single format. For example CNN, the New York Times, and the BBC offer their web feeds only in RSS 2.0 format.
- News articles about web syndication feeds have increasingly used the term "RSS" to refer generically to any of the several variants of the RSS format such as RSS 2.0 and RSS 1.0 as well as the Atom format.[6][7]
- Each of the various web syndication feed formats has attracted large groups of supporters who remain satisfied by the specification and capabilities of their respective formats.
- Sites that publish Atom will often publish RSS as well.
[edit] Development History
[edit] Background
Before the creation of Atom the primary method of web content syndication was the RSS family of formats.
Members of the community who felt there were significant deficiencies with this family of formats were unable to make changes directly to RSS 2.0 because the official specification document stated that it was purposely frozen to ensure its stability.[8]
[edit] Initial Work
In June 2003, Sam Ruby set up a wiki to discuss what makes "a well-formed log entry".[9] This initial posting acted as a rallying point.[10] People quickly started using the wiki to discuss a new syndication format to address the shortcomings of RSS. It also became clear that the new format could form the basis of a more robust replacement for blog editing protocols such as the Blogger API and LiveJournal XML-RPC Client/Server Protocol as well.
The project aimed to develop a web syndication format that was:[11]
- "100% vendor neutral,"
- "implemented by everybody,"
- "freely extensible by anybody, and"
- "cleanly and thoroughly specified."
In short order, a project road map[11] was built. The effort quickly attracted more than 150 supporters, including David Sifry of Technorati, Mena Trott of Six Apart, Brad Fitzpatrick of LiveJournal, Jason Shellen of Blogger, Jeremy Zawodny of Yahoo, Timothy Appnel of the O'Reilly Network, Glenn Otis Brown of Creative Commons and Lawrence Lessig. Other notables supporting Atom include Mark Pilgrim, Tim Bray, Aaron Swartz, Joi Ito, and Jack Park.[12] Also, Dave Winer, the key figure behind RSS 2.0, gave tentative support to the new endeavor.[13]
After this point, discussion became chaotic, due to the lack of a decision-making process. The project also lacked a name, tentatively using "Pie," "Echo," "Necho," and "PEAW"[14] before settling on Atom. After releasing a project snapshot known as Atom 0.2 in early July 2003, discussion was shifted off the wiki.
[edit] Atom 0.3 and Adoption by Google
The discussion then moved to a newly set up mailing list. The next and final snapshot during this phase was Atom 0.3, released in December 2003. This version gained widespread adoption in syndication tools, and in particular it was added to several Google-related services, such as Blogger, Google News, and Gmail. Google's Data APIs (Beta) GData are based on Atom 1.0 and RSS 2.0.
[edit] Atom 1.0 and IETF Standardization
In 2004, discussions began about moving the project to a standards body such as the World Wide Web Consortium or the Internet Engineering Task Force (IETF). The group eventually chose the IETF and the Atompub working group[15] was formally set up in June 2004, finally giving the project a charter and process. The Atompub working group is co-chaired by Tim Bray (the co-editor of the XML specification) and Paul Hoffman. Initial development was focused on the syndication format.
The Atom Syndication Format was issued as a Proposed Standard in IETF RFC 4287 in December 2005. The co-editors were Mark Nottingham and Robert Sayre. This document is known as atompub-format in IETF's terminology. The Atom Publishing Protocol was issued as a Proposed Standard in IETF RFC 5023 in October 2007. Two other drafts have not been standardized.[16]
[edit] Example of an Atom 1.0 Feed
An example of a document in the Atom Syndication Format:
<?xml version="1.0" encoding="utf-8"?> <feed xmlns="http://www.w3.org/2005/Atom"> <title>Example Feed</title> <subtitle>A subtitle.</subtitle> <link href="http://example.org/feed/" rel="self"/> <link href="http://example.org/"/> <updated>2003-12-13T18:30:02Z</updated> <author> <name>John Doe</name> <email>johndoe@example.com</email> </author> <id>urn:uuid:60a76c80-d399-11d9-b91C-0003939e0af6</id> <entry> <title>Atom-Powered Robots Run Amok</title> <link href="http://example.org/2003/12/13/atom03"/> <id>urn:uuid:1225c695-cfb8-4ebb-aaaa-80da344efa6a</id> <updated>2003-12-13T18:30:02Z</updated> <summary>Some text.</summary> </entry> </feed>
[edit] See also
- hAtom - microformat for marking up (X)HTML so that Atom feeds can be derived from it.
- Sam Ruby
- Tim Bray
- Dave Winer
- XML
- XSPF
- Web syndication
- Syndication format family tree
- List of content syndication markup languages
- Podcasting
- Channel Definition Format - an early feed format developed before Atom and RSS.
- Content Management Interoperability Services - Provides an extension to APP for Content Management
[edit] References
- ^ Trott, Benjamin (2003-06-29). "Why We Need Echo". Six Apart — News and Events. http://www.sixapart.com/about/news/2003/06/why_we_need_ech.html.
- ^ See podcasting, vodcasting, broadcasting, screencasting, vlogging, and MP3 blogs.
- ^ Festa, Paul (2003-08-04). "Dispute exposes bitter power struggle behind Web logs". news.cnet.com. http://news.cnet.com/Battle-of-the-blog/2009-1032_3-5059006.html. Retrieved on 2008-08-06. "The conflict centers on something called Really Simple Syndication (RSS), a technology widely used to syndicate blogs and other Web content. The dispute pits Harvard Law School fellow Dave Winer, the blogging pioneer who is the key gatekeeper of RSS, against advocates of a different format."
- ^ "RSS 2.0 and Atom 1.0 Compared". Atom Wiki. http://www.intertwingly.net/wiki/pie/Rss20AndAtom10Compared. Retrieved on 2007-12-04.
- ^ "Making a Podcast". Apple Inc. http://www.apple.com/itunes/store/podcaststechspecs.html. Retrieved on 2007-12-04.
- ^ Quain, John R. (2004-06-03). "Fine-Tuning Your Filter for Online Information". New York Times. http://www.nytimes.com/ref/technology/circuits/03basi.html.
- ^ Tedeschi, Bob (2006-01-29). "There's a Popular New Code for Deals: RSS". New York Times. http://travel2.nytimes.com/2006/01/29/travel/29prac.html.
- ^ "RSS 2.0 Specification — Roadmap". http://blogs.law.harvard.edu/tech/rss#roadmap.
- ^ Ruby, Sam (2003-06-16). "Anatomy of a Well Formed Log Entry". http://www.intertwingly.net/blog/1472.html.
- ^ Bray, Tim (2003-06-23). "I Like Pie". http://www.tbray.org/ongoing/When/200x/2003/06/23/SamsPie.
- ^ a b "Roadmap". Atom Wiki. http://www.intertwingly.net/wiki/pie/RoadMap. Retrieved on 2007-12-04.
- ^ "Roadmap — Supporters". Atom Wiki. http://www.intertwingly.net/wiki/pie/RoadMap#head-74b5d46318b48115b07ab1a2e77cb22df987c284. Retrieved on 2007-12-04.
- ^ Winer, Dave (2003-06-26). "Tentative endorsement of Echo". http://backend.userland.com/2003/06/26.
- ^ Schemaware for PEAW 0.2
- ^ "Atompub working group". http://www.ietf.org/html.charters/atompub-charter.html.
- ^ Internet Engineering Task Force. "Atompub Status Pages". http://tools.ietf.org/wg/atompub. Retrieved on 2007-12-04.
[edit] External links
Atom standard
- RFC 4287 – "The Atom Syndication Format"
- RFC 5023 - "The Atom Publishing Protocol"
- An HTML version of RFC 4287
- Comparison of RSS and Atom Web Feed Formats
- The Atom API - Xml.com column by Mark Pilgrim
- Getting to know the Atom Publishing Protocol - IBM Developer Works article by James Snell
- Atom Landscape Overview - Overview of Atom-related IETF standards and drafts
Atom advocacy / evangelism
- AtomEnabled.org - An unofficial website following the Atom project
- Planet Atom - A fusion of atom-related news
- Atom Syndication Format - Introduction - Overview
Atom history & motivation
- Anatomy of a Well Formed Log Entry - the weblog post that started it all
- Why we need Echo (June 2003) - Motivation and goals of the Atom project
Atom working group links
- Atom Wiki - The main place for work on Atom.
- Atom Protocol Mailing List - WG discussion of the publishing protocol
- Atom Syntax Mailing List - WG discussion of the syndication format
- IETF Atom Publishing Format and Protocol working group (atompub)
Atom Extension Standards
- IANA Registry of Link-Relations - Official registry of atom:link element "rel" attribute values
- RFC 4685 - Atom Threading Extensions
- RFC 4946 - Atom License Extension
- RFC 5005 - Feed Paging and Archiving
- hAtom - a microformat for marking up standard HTML such that an Atom feed can be derived from it.
|