%ents; ]>
Stream Management This specification defines an XMPP protocol extension for active management of an XML stream between two XMPP entities, including features for stanza acknowledgements, pings, and stream resumption. &LEGALNOTICE; 0198 Experimental Standards Track Standards XMPP Core None None NOT_YET_ASSIGNED &infiniti; &hildjj; &stpeter; 0.6 2009-03-19 psa
  • Incremented protocol number from 0 to 1.
  • Changed attribute names from c (?) and b (?) to u (unacknowledged) and h (handled).
  • Added stanzas attribute to specify maximum number of stanzas between acking requests.
  • More clearly defined error handling using <failed/> element plus stanza error conditions.
  • Defined error handling for <ping/> element by allowing stanza error conditions in <pong/> element.
  • More clearly specified maximum reconnect time.
  • Added detailed scenarios for basic acking and for outbound and inbound throttling.
0.5 2008-09-29 psa

Removed recommendation to use namespace prefixes; modified namespace to incorporate namespace versioning.

0.4 2008-09-08 jjh/jk/psa

Added support for session resumption; re-organized the document; changed name to stream management; changed provisional namespace.

0.3 2007-10-03 jk

Updates per devcon discussion.

0.2 2007-04-05 jk

Require c attribute on <r/> element. Describe minimal implementation. Switch to standard temporary namespace.

0.1 2006-11-21 psa

Initial published version.

0.0.3 2006-11-08 jk New version, using sequence numbers. 0.0.2 2004-12-11 jk Further clarification, allow acking many stanzas at once. 0.0.1 2004-08-09 jk First draft.

&xmppcore; defines the fundamental streaming XML technology used by XMPP (i.e., stream establishment and termination including authentication and encryption). However, the core XMPP specification does not provide tools for actively managing a "live" XML stream. In particular, the following management features might improve network reliability and the end-user experience (especially when connectivity is infrequent):

Detailed descriptions of these features are provided in the remainder of this specification.

This specification applies at the level of an XML stream between a client and a server or between a server and a peer server. By constrast, &xep0079;, &xep0184;, and &xep0199; cover acks and pings that are sent end-to-end over multiple streams; these facilities are useful in special scenarios but are unnecessary for checking of a direct stream between two XMPP entities. It is also expected that this protocol will revive interest in Advanced Message Processing (AMP), because single-hop acks are necessary for AMP delivery receipts to function properly.

The basic concept behind stream management is that the initiating entity (either a client or a server) and the receiving entity (a server) can exchange commands for active management of the stream. Instead of using XMPP IQ, message, or presence stanzas (which are relatively verbose), stream management uses a series of short XML elements at the root stream level.

The benefits to be gained from stream management include the following:

After negotiating use of TLS and authenticating via SASL, the receiving entity returns a new stream header to the intiating entity along with stream features, where the features include an <sm/> element qualified by the 'urn:xmpp:sm:1' namespace &VNOTE;.

The stream management feature MUST NOT be offered unless the initiating entity has been authenticated via SASL, &xep0078;, or &xep0220;.

]]>

The <sm/> element MAY include a 'stanzas' attribute whose value indicates the server's preferred maximum number of received stanzas between acks. For example, if the 'stanzas' attribute has a value of "10" then a server would prefer that a connected client request an ack after the client has sent at most ten stanzas to the server.

]]>

If the receiving entity offers stream resumption in addition to acks and pings, the <sm/> element MUST include an 'id' attribute (a unique identifier for the session) and SHOULD include a 'max' attribute that specifies the longest allowable time period for session resumption (in seconds).

]]>

Definition: The 'id' attribute defines a unique identifier for purposes of stream management (an "SM-ID"). The SM-ID MUST be generated by the receiving entity (server). The initiating entity MUST consider the SM-ID to be opaque and therefore MUST NOT assign any semantic meaning to the SM-ID. The receiving entity MAY encode any information it deems useful into the SM-ID, such as the full JID &LOCALFULL; of a connected client (e.g., the full JID plus a nonce value). Any characters allowed in an XML attribute are allowed. The SM-ID MUST NOT be reused for simultaneous or subsequent sessions (as long as the receiving entity is available). The SM-ID SHOULD NOT be longer than 4000 bytes.

]]>

To enable use of stream management, the client sends an <enable/> command to the server.

]]>

If the client wants to be allowed to resume the stream, it includes a boolean 'resume' attribute, which defaults to false &BOOLEANNOTE;. For information about resuming a previous session, see the Resumption section of this document.

Upon receiving the enable request, the receiving entity MUST reply with an <enabled/> element or an <failed/> element qualified by the 'urn:xmpp:sm:1' namespace. The <failed/> element indicates that there was a problem enabling the stream management "session". The <enabled/> element indicates successful enabling of the stream management session.

For client-to-server connections, the client SHOULD NOT attempt to enable stream management until after it has completed Resource Binding. The server MAY enforce this order and return a <failed/> element in response (see Error Handling).

]]>

If session resumption is allowed, the receiving entity MUST include a 'resume' attribute set to a value of "true" or "1".

]]>

The parties can then the use stream management features defined below.

After enabling the feature, the initiating or receiving entity can send ack elements at any time over the stream. An ack element is one of the following:

The <r/> and <a/> elements are qualified by the 'urn:xmpp:sm:1' namespace.

The following attributes are defined:

An <r/> element MUST contain a 'u' attribute and MAY contain an 'h' attribute.

An <a/> element MUST contain an 'h' attribute and MAY contain a 'u' attribute.

Therefore an ack element is used to indicate an unacknowledged sequence number (contains 'u'), to acknowledge a sequence number (contains 'h'), or to do both at once (contains 'u' and contains 'h').

Definition: Acknowledging a previously-received ack element indicates that the stanza(s) sent since then have been "handled" by the receiver. By "handled" we mean that the stanzas have been either processed directly by the receiver, delivered to a local entity such as another connected client on the same server, or routed to a remote entity at a different server. Acking a stanza does not necessarily indicate that the stanza has been successfully delivered to the intended recipient, only that it has been handled by the entity at the other end of an XML stream.

I'll send a friar with speed, to Mantua, with my letters to thy lord. ]]>

Note: Ack elements SHOULD be sent immediately after sending an XMPP stanza. In particular, if a request is received, an applications MAY wait a short period for something else to send before responding, so that the response can be sent with the other data.

When an <r/> element ("request") is received, the recipient MUST acknowledge it by sending an ack element (either <a/> or <r/>) to the sender. The sender does not have to wait for an ack to continue sending stanzas. The response MUST contain a value of 'h' that is greater than or equal to the 'u' value given in the request. The response SHOULD be sent as soon as possible, and MUST NOT be withheld for any condition other than a timeout. For example, a client with a slow connection might want to collect many stanzas over a period of time before acking, and a server might want to throttle incoming stanzas. Because acks indicate stanza acceptance, a server that is throttling stanzas MUST delay the response until the client is no longer being penalized.

When a recipient receives a sequence number for unacknowledged stanzas (via the 'u' attribute), the recipient SHOULD keep a record of this value as the last received sequence number for the current stream. Every time a new sequence number is received, the previous number SHOULD be discarded. If a stream ends and it is not resumed within the time specified in the acknowledgement feature element, the sequence number and any associated state MAY be discarded. Before the session state is discarded, implementations SHOULD take alternative action regarding any unacknowledged stanzas (e.g. stanzas sent after the latest sequence number reported by 'h'). A server implementation SHOULD treat unacknowledged stanzas in the same way that it would treat a stanza sent to an unavailable resource, by either returning an error to the sender or committing the stanza to offline storage. A user-oriented client implementation SHOULD inform the user of the failure via appropriate user-interface elements.

]]>

Either entity can also ping the other. This is useful for ensuring that the TCP connection is still up and working, and also for determining latency. The procedure is intended to replace the legacy behavior of sending whitespace. Pinging is done by sending a <ping/> element:

]]>

The peer then MUST reply immediately with a <pong/> element.

]]>

A server that is throttling stanzas (and thus withholding ack answers until later) MUST still immediately reply to pings and SHOULD include an appropriate error or status condition in the <pong/> element (preferably &constraint; for inbound throttling and &policy; for outbound throttling). However, if the connected entity continues to send stanzas or sends excessive pings then the server MAY terminate the stream as explained elsewhere.

It can happen that an XML stream is terminated unexpectedly (e.g., because of network outages). In this case, it is desirable to quickly resume the former stream rather than complete the tedious process of stream establishment, roster retrieval, and presence broadcast.

To request that the stream will be resumable, when enabling stream management the initiating entity MUST add a 'resume' attribute to the <enable/> element with a value of "true" or "1" &BOOLEANNOTE;.

]]>

If the receiving entity will allow the stream to be resumed, it MUST include a 'resume' attribute set to "true" or "1" on the <enabled/> element.

]]>

If the stream is terminated unexpectedly, the initiating entity would then open a TCP connection to the receiving entity. The order of events is envisioned to be as follows:

  1. Initiating entity sends initial stream header.
  2. Receiving entity sends response stream header.
  3. Receiving entity sends stream features.
  4. Initiating entity sends STARTTLS request.
  5. Receiving entity informs initiating entity to proceed with the TLS negotiation.
  6. The parties complete a TLS handshake. (Note: When performing session resumption and also utilizing TLS, it is RECOMMENDED to take advantage of TLS session resumption to further optimize the resumption of the XML stream.)
  7. Initiating entity sends new initial stream header.
  8. Receiving entity sends response stream header.
  9. Receiving entity sends stream features, requiring SASL negotiation and offering appropriate SASL mechanisms. (Note: If the server considers the information provided during TLS session resumption to be sufficient authentication, it MAY offer the SASL EXTERNAL mechanism; for details, refer to &sasltls;.)
  10. The parties complete SASL negotiation.
  11. Initiating entity sends new initial stream header.
  12. Receiving entity sends response stream header.
  13. Receiving entity sends stream features, offering the SM feature.
  14. Initiating entity requests resumption of the former stream.

To request resumption of the former stream, the initiating entity sends a <resume/> element qualified by the 'urn:xmpp:sm:1' namespace. This element MUST include a 'previd' attribute whose value is the SM-ID of the former stream and SHOULD include an 'h' attribute that identifies the last acknowledged sequence number sent over the former stream from the receiving entity to the initiating entity.

]]>

If the receiving entity can resume the former stream, it MUST return a <resumed/> element that includes a 'previd' attribute set to the SM-ID of the former stream. If the receiving entity does not support session resumption or does not recognize the 'previd' as an earlier session, it MUST return an <failed/> element, which SHOULD include an error condition of &feature; or ¬found; respectively.

The <resumed/> element MAY also include an 'h' attribute set to the last acknowledged sequence number sent over the former stream from the initiating entity to the receiving entity. If there is no known last acknowledged sequence number for the former stream, then the 'h' attribute MUST NOT be included.

If the former stream is resumed and the receiving entity still has the stream for the previously-identified session open at this time, the old stream SHOULD be terminated.

]]>

When a session is resumed, the parties SHOULD proceed as follows:

If an error occurs with regard to an <enable/> or <resume/> element, the receiving entity MUST return a <failed/> element. This element SHOULD contain an error condition, which MUST be one of the stanza error conditions defined in &rfc3920bis;.

An example follows.

]]>

In addition, the <pong/> element MAY contain an error condition.

Stream management errors SHOULD be considered recoverable; however, misuse of stream management (e.g., sending excessive pings) MAY result in termination of the stream.

The following scenarios illustrate several different uses of stream management. The examples are that of a client and a server, but stream management can also be used for server-to-server streams.

The Stream Management protocol can be used to improve reliability using acks without the ability to resume a session. In fact, a basic implementation might not even care about sequence numbers and therefore would do the following:

  • As an initiating entity, send <enable/> with no attributes, and ignore the attributes on the <enabled/> response.
  • As a receiving entity, ignore the attributes on the <enable/> element received, and respond via <enabled/> with no attributes.
  • When receiving an <r/> or <a/> element with a 'u' attribute, immediately respond via an <a/> element where the value of 'h' sent is the value of 'u' received.
  • Keep an integer X for this stream session, initially set to zero. When about to send a stanza, first put the stanza (paired with the current value of X) in an "unacknowleged" queue. Then send the stanza over the wire with <r u='[value of X]'/> to request acknowledgement of that outbond stanza, and increment X by 1. When receiving an <r/> or <a/> element with an 'h' attribute, all stanzas whose paired value (X at the time of queueing) is less than or equal to the value of 'h' can be removed from the unacknowledged queue.

This is enough of an implementation to minimally satisfy the peer, and allows basic tracking of each outbound stanza. If the stream connection is broken, the application has a queue of unacknowledged stanzas that it can choose to handle appropriately (e.g., warned a human user or silently sending on reconnect).

The following examples illustrate basic acking.

First, after authentication and resource binding, the client enables stream management.

]]>

The server then enables stream management.

]]>

The client then retrieves its roster and immediately sends an <r/> element to request acknowledgement.

]]>

The server returns the roster and immediately sends an <a/> element to acknowledge handling of the stanza.

]]>

The client then sends initial presence and immediately sends an <r/> element to request acknowledgement, incrementing the 'u' attribute by one.

]]>

The server handles the stanza by broadcasting the user's presence and immediately sends an <a/> element to acknowledge handling of the stanza.

]]>

The client then sends an outbound message and an <r/> element.

ciao! ]]>

The server handles the stanza by routing it to the remote contact and immediately sends an <a/> element to acknowledge handling of the stanza.

]]>

And so on.

Most XMPP servers include rate limiting functionality (often called "karma") to prevent connected clients from generating too much outbound traffic. A connected client might also have a "thin pipe" that prevents it from receiving too much inbound traffic. Both of these scenarios can be more successfully handled using stream management, because the server can inform the client to throttle the generation of outbound stanzas or determine that the delivery of inbound stanzas is throttled.

In the first scenario we assume that Romeo tries to send a file to Juliet via in-band bytestreams.

4k-of-base64-encoded-data-here ]]>

His server handles the stanza by routing it to the remote contact and immediately sends an <a/> element to acknowledge handling of the stanza.

]]>

After Romeo sends a number of 4k IBB blocks, the montague.net server starts to enforce rate limiting, so it does not immediately acknowledge handling of the stanzas.

4k-of-base64-encoded-data-here ]]>

Because the server has not responded after some period of time, Romeo pings the server.

]]>

As mentioned, even though Romeo's server is throttling his outbound stanzas (i.e., not handling them by routing them to the remote contact), it replies by sending a <pong/> element. In this case the server informs Romeo why it is not handling his stanzas by including a &policy; condition because Romeo is violating a service policy on the number of outbound stanzas.

]]>

Romeo's client now knows that the stream is still active but that his server is not processing stanzas because of a policy violation.

In the second scenario we assume that Juliet tries to send a file to Romeo via in-band bytestreams, but that Romeo is on a slow connection via a mobile device.

4k-of-base64-encoded-data-here ]]>

Romeo then sends an outbound IQ-result and also immediately sends an <r/> to request acknowledge that his server has handled the stanza.

]]>

His server handles the stanza by routing it to the remote contact and immediately sends an <a/> element to acknowledge handling of the stanza.

]]>

After Juliet sends a number of 4k IBB blocks, delivery from the montague.net to Remeo's mobile device starts to get backed up, so Romeo receives slow delivery of packets from the server. Therefore Romeo pings the server.

]]>

As mentioned, even though Romeo's server is throttling inbound stanzas intended for him because he is on a slow connection, it replies by sending a <pong/> element. In this case the server informs Romeo why stanza delivery is slowed down by including a &constraint; condition.

]]>

Romeo's client now knows that the stream is still active but that delivery of inbound stanzas is slow because of resource constraints.

To follow.

This XEP requires no interaction with &IANA;.

This specification defines the following XML namespace:

  • urn:xmpp:sm:1

Upon advancement of this specification from a status of Experimental to a status of Draft, the ®ISTRAR; shall add the foregoing namespaces to the registry located at &NAMESPACES;, as described in Section 4 of &xep0053;.

&NSVER;

Upon approval of this specification, the XMPP Registrar shall add 'urn:xmpp:sm:1:feature' to its registry of stream features.

]]>

Thanks to Dave Cridland, Philipp Hancke, Curtis King, Tobias Markmann, Pedro Melo, Robin Redeker, and Mickaël Rémond for their feedback.