%ents; ]>
Jingle This specification defines an XMPP protocol extension for initiating and managing peer-to-peer media sessions between two XMPP entities in a way that is interoperable with existing Internet standards. The protocol provides a pluggable model that enables the core session management semantics to be used for a wide variety of application types (e.g., voice chat, video chat, file sharing) and with a wide variety of transport methods (e.g., TCP, UDP, ICE, application-specific transports). &LEGALNOTICE; 0166 Proposed Standards Track Standards Council XMPP Core TO BE ASSIGNED &scottlu; &joebeda; &stpeter; &robmcqueen; &seanegan; &hildjj; 0.23 2008-01-11 psa

Removed content-accept after content-remove; removed errors for unsupported-content and unsupported-transports since they are handled via session-terminate; clarified handling of responder attribute.

0.22 2007-12-06 psa

Modified session flows for busy, unsupported application formats, and unsupported transport methods to enable separation between Jingle core and distinct modules for applications and transports; moved resource determination recommendations to XEP-208.

0.21 2007-11-27 psa

Further editorial review.

0.20 2007-11-15 psa

Editorial review and consistency check; moved voice chat scenarios to XEP-0167.

0.19 2007-11-13 psa

Added scenario for handling of busy state, including Jingle-specific error code and modified error flow (no longer an instance of decline).

0.18 2007-11-08 psa

Added scenarios for various session flows; clarified handling of content-add, content-modify, and content-remove actions; clarified rules for codec priority.

0.17 2007-06-20 psa

Added <unsupported-info/> error.

0.16 2007-06-06 psa

Clarified resource determination process and updated text to reflect modifications to XEP-0168.

0.15 2007-05-25 psa

Rewrote introduction and moved historical text to separate section.

0.14 2007-04-17 psa

Clarified session lifetime; defined reason attribute and associated registry; further specified conformance requirements.

0.13 2007-03-23 psa/ram

Simplified signalling process and state chart; Removed session-redirect action (use redirect error instead); removed content-decline action; removed transport-* actions (except transport-info for ICE negotiation); removed description-* actions; simplified syntax to allow only one transport per content type; corrected syntax of creator attribute to be either initiator or responder (not JIDs); added profile attribute to content element in order to specify RTP profile in use.

0.12 2006-12-21 psa/ram

Added creator attribute to content element for prevention of race condition; modified spec to use provisional namespace before advancement to Draft (per XEP-0053).

0.11 2006-10-31 psa

Completed clarifications and corrections throughout; added section on Jingle Actions.

0.10 2006-09-29 ram/psa

Made several corrections to the state machines and examples.

0.9 2006-09-08 ram/psa

Further cleaned up state machines and state-related actions.

0.8 2006-08-23 ram/psa

Changed channels to components in line with ICE; changed various action names for consistency; added session-extend and session-reduce actions to add and remove description/transport pairs; added description-modify action; added sender attribute to specify directionality.

0.7 2006-07-17 psa

Added implementation note about handling multiple content types.

0.6 2006-07-12 se/psa

Changed media type to content type.

0.5 2006-03-20 psa/jb

Further clarified state machine diagrams; specified that session accept must include agreed-upon media format and transport description; moved deployment notes to appropriate transport spec.

0.4 2006-03-01 psa/jb

Added glossary; clarified state machines; updated to reflect publication of XEP-0176 and XEP-0177.

0.3 2006-02-24 psa/jb

Provided more detail about modify scenarios; defined media-specific and transport-specific actions and adjusted state machine accordingly.

0.2 2006-02-13 psa/jb

Per agreement among the co-authors, moved transport definition to separate specification, simplified state machine, and made other associated changes to the text, examples, and schemas; also harmonized redirect, decline, and terminate processes.

0.1 2005-12-15 psa

Initial version.

0.0.10 2005-12-11 psa

More fully documented burst mode, connectivity checks, error cases, etc.

0.0.9 2005-12-08 psa

Restructured document flow; provided example of burst mode.

0.0.8 2005-12-05 psa/sl/jb

Distinguished between dribble mode and burst mode, including mode attribute, service discovery features, and implementation notes; provided detailed resource discovery examples; corrected state chart; specified session termination; specified error conditions; specified semantics of informational messages; began to define security considerations; added Joe Beda as co-author.

0.0.7 2005-11-08 psa

Added more detail to basic session flow; harmonized candidate negotiation process with ICE.

0.0.6 2005-10-27 psa

Added XMPP Registrar considerations; defined schema; completed slight syntax cleanup.

0.0.5 2005-10-21 psa/sl

Separated method description formats from signalling protocol.

0.0.4 2005-10-19 psa/sl

Harmonized basic session flow with Google Talk protocol; added Scott Ludwig as co-author.

0.0.3 2005-10-10 psa

Added more detail to basic session flow.

0.0.2 2005-10-07 psa/jjh

Protocol cleanup.

0.0.1 2005-10-06 psa/jjh

First draft.

The purpose of Jingle is to enable one-to-one, peer-to-peer media sessions between XMPP entities, where the negotiation occurs over the XMPP "channel" and the media is exchanged outside the XMPP channel using technologies such as the Real-time Transport Protocol (RTP; &rfc3550;), the User Datagram Protocol (UDP; &rfc0768;), and &ice;.

One target application for Jingle is simple voice chat (see &xep0167;). We stress the word "simple". The purpose of Jingle is not to build a full-fledged telephony application that supports call waiting, call forwarding, call transfer, hold music, IVR systems, find-me-follow-me functionality, conference calls, and the like. These features are of interest to some user populations, but adding support for them to the core Jingle layer would introduce unnecessary complexity into a technology that is designed for basic multimedia interaction.

The purpose of Jingle is not to supplant or replace technologies based on Session Initiation Protocol (SIP; &rfc3261;). Because dual-stack XMPP+SIP clients are difficult to build, Jingle was designed as a pure XMPP signalling protocol. However, Jingle is at the same time designed to interwork with SIP so that the millions of deployed XMPP clients can be added onto existing Voice over Internet Protocol (VoIP) networks, rather than limiting XMPP users to a separate and distinct network.

Jingle is designed in a modular way so that developers can easily add support for multimedia session types other than voice chat, such as video chat (see &xep0180;), application sharing, file sharing, collaborative editing, whiteboarding, and torrent broadcasting. The transport methods are also modular, so that Jingle implementations can use any appropriate media transport (including proprietary methods not standardized through the XMPP Standards Foundation).

This section provides a friendly introduction to Jingle.

In essence, Jingle enables two XMPP entities (e.g., romeo@montague.lit and juliet@capulet.lit) to set up, manage, and tear down a multimedia session. The negotiation takes place over XMPP, and the media transfer takes place outside of XMPP. The simplest session flow is as follows:

| | ack | |<----------------------------| | session-accept | |<----------------------------| | ack | |---------------------------->| | MEDIA SESSION | |<===========================>| | session-terminate | |<----------------------------| | ack | |---------------------------->| | | ]]>

Naturally, more complex scenarios are probable; such scenarios are described in other specifications, such as XEP-0167 for voice chat.

The simplest flow might happen as follows. The example is that of a voice chat (see XEP-0167) initiated by Romeo, where the transport is &xep0177;.

]]> ]]>

If the proposed session is acceptable, the responder definitively accepts it.

]]>

The initiator then acknowledges the session-accept action.

]]>

The parties then begin to exchange media. In this case they would exchange audio using the Speex codec at a clockrate of 8000 since that is the highest-priority codec for the responder (as determined by the XML order of the &PAYLOADTYPE; children).

The parties may continue the session as long as desired.

Eventually, one of the parties terminates the session.

]]>

The other party MUST then acknowledge termination of the session.

]]>

The protocol defined herein is designed to meet the following requirements:

  1. Make it possible to manage a wide variety of peer-to-peer sessions (including but not limited to voice and video) within XMPP.
  2. When a peer-to-peer connection cannot be negotiated, make it possible to fall back to relayed communications.
  3. Clearly separate the signalling channel (XMPP) from the data channel.
  4. Clearly separate the application formats (e.g., video) from the transport methods (e.g., RTP).
  5. Make it possible to add, modify, and remove both application types and transport methods in relation to an existing session.
  6. Make it relatively easy to implement support for the protocol in standard Jabber/XMPP clients.
  7. Where communication with non-XMPP entities is needed, push as much complexity as possible onto server-side gateways between the XMPP network and the non-XMPP network.

This document defines the signalling protocol only. Additional documents specify the following:

Term Definition
Application Format The data format of the content type being established, which formally declares one purpose of the session (e.g., "voice" or "video"). This is the 'what' of the session (i.e., the bits to be transferred), such as the acceptable codecs when establishing a voice conversation. In Jingle XML syntax the application format is the namespace of the &DESCRIPTION; element.
Component A numbered stream of data that needs to be transmitted between the endpoints for a given content type in the context of a given session. It is up to the transport to negotiate the details of each component. Depending on the content type, multiple components may be needed (e.g., two components might be needed, one to transmit an RTP stream and another to transmit RTCP timing information).
Content Type A pair formed by the combination of one application format and one transport method.
Session One or more content types negotiated between two entities. It is delimited in time by an initiate request and session ending events. During the lifetime of a session, content types can be added or removed. A session consists of at least one content type at a time.
Transport Method The method for establishing data stream(s) between entities. Possible transports might include ICE-TCP, Raw UDP, inband data, etc. This is the 'how' of the session. In Jingle XML syntax this is the namespace of the &TRANSPORT; element. The transport method defines how to transfer bits from one host to another. Each transport method must specify whether it is lossy (thus suitable for applications where some packet loss is tolerable) or reliable (thus suitable for applications where packet loss is not tolerable).

In diagrams, the following conventions are used:

  • Dashed lines (---) represent Jingle stanzas that are sent via the XMPP signalling channel.
  • Double-dashed lines (===) represent media packets that are sent via the non-XMPP media channel.

Jingle consists of three parts, each with its own syntax and semantics:

  1. Overall session management
  2. Application types (the "what")
  3. Transport methods (the "how")

This document defines the semantics and syntax for overall session management. It also provides pluggable "slots" for application formats and transport methods, which are specified in separate documents.

At the most basic level, the process for negotiating a Jingle session is as follows:

  1. One user (the "initator") sends to another user (the "responder") a session request with at least one content type.
  2. If the responder wants to proceed, it acknowledges the session initiation request by sending an IQ result.
  3. Both the initiator and responder start exchanging possible transport candidates as quickly as possible (these are sent in quick succession before further negotiation in order to minimize the time required before media data can flow).
  4. These candidates are checked for connectivity.
  5. As soon as the responder finds a candidate over which media can flow, the responder sends to the initiator a "session-accept" action.
  6. The parties start sending media over the negotiated candidate.

If the parties later discover a better candidate, they perform a "content-modify" negotiation and then switch to the better candidate. Naturally they can also modify various other parameters related to the session (e.g., adding video to a voice chat).

The state machine for overall session management (i.e., the state per Session ID) is as follows:

o | | session-initiate | | +-----------------------+ |/ | PENDING o---------------------+ | | | content-accept, | | | | content-modify, | | | | content-remove, | | | | session-info, | | | | transport-info | | | +------------------+ | | | | session-accept | | | ACTIVE o---------------------+ | | | content-accept, | | | | content-add, | | | | content-modify, | | | | content-remove, | | | | session-info, | | | | transport-info | | | +------------------+ | | | +-------------------------+ | | session-terminate | o ENDED

There are three overall session states:

  1. PENDING
  2. ACTIVE
  3. ENDED

The actions related to management of the overall Jingle session are as follows:

Action Description
content-accept Accept a content-add or content-modify action received from another party.
content-add Add one or more new content types to the session. The sender MUST specify only the added content-type(s), not the added content-type(s) plus the existing content-type(s). Therefore it is the responsibility of the recipient to maintain a local copy of the current content type definition. This action MUST NOT be sent while the session is in the PENDING state. When a party sends a content-add, it MUST ignore any actions received from the other party until it receives acknowledgement of the content-add. In the event that a session contains two unidirectional streams of the same type because a content-add was issued simultaneously by both parties, it is RECOMMENDED that participants close the duplicate stream in favour of that created by the session initiator, which should be made bidirectional with a 'content-modify' action by the responder.
content-modify Change an existing content type. The sender SHOULD specify only the aspects for which a modification is desired (e.g., if a party wishes to change only the profile then it would send an empty <content/> element with a modified value for the 'profile' attribute; if a party wishes to change only the transport, then it would send a <content/> element that contains only a <transport/> child; etc.). Therefore it is the responsibility of the recipient to maintain a local copy of the current content type definition. The recipient MUST NOT reply to a content-modify action with another content-modify action. If both parties send modify messages at the same time, the modify message from the session initiator MUST trump the modify message from the recipient and the initiator SHOULD return an &unexpected; error to the other party.
content-remove Remove one or more content types from the session. The sender MUST specify only the removed content-type(s), not the removed content-type(s) plus the remaining content-type(s). Therefore it is the responsibility of the recipient to maintain a local copy of the current content type definition. The recipient SHOULD NOT send a content-accept with the new content type definition after receiving a content-remove. A client MUST NOT return an error upon receipt of a 'content-remove' action for a content type that is received after a 'content-remove' action has been sent but before the action has been acknowledged by the peer. If the content-remove results in zero content types for the session, the entity that receives the content-remove SHOULD send a session-terminate action to the other party (since a session with no content types is void).
session-accept Definitively accept a session negotiation (implicitly this action also serves as a content-accept).
session-info Send session-level information / messages, such as (for Jingle audio) a ringing message.
session-initiate Request negotiation of a new Jingle session.
session-terminate End an existing session.
transport-info Exchange transport candidates; it is mainly used in XEP-0176 but may be used in other transport specifications.

This section defines the high-level flow of a Jingle session. More detailed descriptions are provided in the specifications for Jingle application formats and transport methods.

In order to initiate a Jingle session, the initiator must determine which of the responder's XMPP resources is best for the desired application format. Methods for doing so are out of scope for this specification. However, see the Determining Support section of this document (and associated specifications) for relevant information.

Once the initiator has discovered which of the responder's XMPP resources is ideal for the desired application format, it sends a session initiation request to the responder. This request is an IQ-set containing a &JINGLE; element qualified by the 'http://www.xmpp.org/extensions/xep-0166.html#ns' namespace &NSNOTE;, where the value of the 'action' attribute is "session-initiate" and where the &JINGLE; element contains one or more &CONTENT; elements. Each &CONTENT; element defines a content type to be transferred during the session, and each &CONTENT; element in turn contains one &DESCRIPTION; child element that specifies a desired application format and one &TRANSPORT; child element that specifies a potential transport method. If either party wishes to propose the use of multiple transport methods for the same application format, it must send multiple &CONTENT; elements.

Note: The syntax and semantics of the &DESCRIPTION; and &TRANSPORT; elements are out of scope for this specification, since they are defined in related specifications. The syntax and semantics of the &JINGLE; and &CONTENT; elements are specified in this document under Formal Definition.

Note: In order to expedite session establishment, the initiator MAY send transport candidates (e.g., for negotiation of the ICE transport) immediately after sending the "session-initiate" message and before receiving acknowledgement from the responder (i.e., the initiator MUST consider the session to be PENDING even before receiving acknowledgement). Given in-order delivery, the responder should receive such "transport-info" messages after receiving the "session-initiate" message (if not, it is appropriate for the responder to return <unknown-session/> errors since according to its state machine the session does not exist).

]]>

Unless one of the following errors occurs, the responder SHOULD acknowledge receipt of the initiation request.

]]>

However, after acknowledging the session initiation request, the responder may subsequently determine that it cannot proceed with negotiation of the session (e.g., because it does not support any of the offered application formats or transport methods, because a human user is busy or unable to accept the session, because a human user wishes to formally decline the session, etc.). In these cases, the responder SHOULD immediately acknowledge the session initiation request but then terminate the session with an appropriate reasoncode as described in the Termination section of this document.

If the responder acknowledges receipt of the initation request, both parties must consider the session to be in the PENDING state.

There are several reasons why the responder might immediately return an error instead of acknowledging receipt of the initiation request:

  • The initiator is unknown to the responder and the responder does not communicate with unknown entities.
  • The responder does not support Jingle.
  • The responder wishes to redirect the request to another address.
  • The responder does not have sufficient resources to participate in a session.
  • The initiation request was malformed.

If the initiator is unknown to the responder (e.g., via presence subscription as defined in &rfc3921; or stanza session negotiation as defined in &xep0155;) and the responder has a policy of not communicating via Jingle with unknown entities, it SHOULD return a &unavailable; error.

]]>

If the responder does not support Jingle, it MUST return a &unavailable; error.

]]>

If the responder wishes to redirect the request to another address, it SHOULD return a &redirect; error.

voicemail@capulet.lit ]]>

If the responder does not have sufficient resources to participate in a session, it SHOULD return a &constraint; error.

]]>

If the initiation request was malformed, the responder MUST return a &badrequest; error.

]]>

In general, negotiation will be necessary before the parties can agree on an acceptable set of application formats and transport methods. There are many potential parameter combinations, as defined in the relevant specifications for various application formats and transport methods.

The allowable negotiations (including content-level and transport-level negotiations) are as follows:

  • Adding a content type via the content-add action (not allowed in the PENDING state).
  • Modifying a content type via the content-modify action.
  • Removing a content type via the content-remove action.
  • Exchanging transport methods via the transport-info action.

If (after negotiation of transport methods and application formats as well as checking of transport candidates) the responder determines that it will be able to establish a connection, it sends a definitive acceptance to the initiator.

Note: In the session-accept stanza, the &JINGLE; element MUST contain one or more <content/> elements, each of which MUST contain one <description/> element and one <transport/> element. The &JINGLE; element SHOULD possess a 'responder' attribute that explicitly specifies the full JID of the responding entity; after sending acknowledgement of the session-accept, the initiator SHOULD send all future commmunications about this Jingle session to the JID provided in the 'responder' attribute and note the new JID in the user interface.

The initiator then acknowledges the responder's definitive acceptance, after which the parties can exchange media over the negotiated connection.

If one of the parties cannot find a suitable transport method or candidate, it SHOULD terminate the session as described below.

Once a session is in the ACTIVE state, it may be modified via a content-add, content-modify, content-remove, or transport-info action. Examples of such modifications are shown in the specifications for various application formats and transport methods.

In order to gracefully end the session (which MAY be done at any point after acknowledging receipt of the initiation request, including immediately thereafter in order to decline the request), either the responder or the initiator MUST send a "terminate" action to the other party.

The party that terminates the session SHOULD include a 'reasoncode' attribute that specifies why the session is being terminated. Examples follow.

One reason for terminating the session is that the terminating party is busy; in this case, the recommended reasoncode is "busy".

]]>

Another reason for terminating the session is that the terminating party wishes to formally decline the session; in this case, the recommended reasoncode is "decline".

]]>

Another reason for terminating the session is that the terminating party does not support any of the offered application formats; in this case, the recommended reasoncode is "unsupported-applications".

]]>

Another reason for terminating the session is that the terminating party does not support any of the offered transport methods; in this case, the recommended reasoncode is "unsupported-transports".

]]>

Upon receiving an action of "session-terminate", the other party MUST then acknowledge termination of the session:

]]>

Note: As soon as an entity sends a "session-terminate" action, it MUST consider the session to be in the ENDED state (even before receiving acknowledgement from the other party). If the terminating entity receives additional Jingle-related IQ-sets from the other party after sending the "session-terminate" action, it MUST reply with an <unknown-session/> error.

Unfortunately, not all sessions end gracefully. In applications of Jingle that also involve the exchange of presence information, receipt of &UNAVAILABLE; from the other party MAY be considered a session-ending event. However, in this case there is nothing for the party to acknowledge.

At any point after initiation of a Jingle session, either entity MAY send an informational message to the other party, for example to change a transport method, inform the other party that a device is ringing or that a scheduled event has occurred or will occur, etc.

An informational message MUST be an IQ-set containing a &JINGLE; element whose 'action' attribute is set to a value of "session-info" or "transport-info"; the &JINGLE; element MUST further contain a payload child element (specific to the session or to a transport method) that specifies the information being communicated. If the party that receives an informational message does not understand the payload, it MUST return a &feature; error with a Jingle-specific error condition of <unsupported-info/>.

]]>

If either party receives an empty "session-info" message for an active session, it MUST send an empty IQ result; this way, an empty "session-info" message may be used as a "ping" to determine session vitality.

Informational messages are specific to a particular application type or transport method and therefore are described in specifications other than this one.

The &JINGLE; element MAY be empty or contain one or more &CONTENT; elements (for which see Content Element).

The attributes of the &JINGLE; element are as follows.

Attribute Definition Inclusion
action A Jingle action as listed in this document (e.g., "session-terminate"). REQUIRED
initiator The full JID of the entity that has initiated the session flow (which may be different from the 'from' address on the IQ-set). REQUIRED
reasoncode A machine-readable purpose for the action being sent (e.g., "connectivity-error" for a session-terminate action). OPTIONAL
reasontext A human-readable purpose for the action being sent (e.g., "Sorry, gotta go!" for a session-terminate action). OPTIONAL
responder The full JID of the entity that has replied to the initiation, which may be different from the 'to' address on the IQ-set. RECOMMENDED
sid A random session identifier generated by the initiator, which effectively maps to the local-part of a SIP "Call-ID" parameter; this SHOULD match the XML Nmtoken production See <http://www.w3.org/TR/2000/WD-xml-2e-20000814#NT-Nmtoken> so that XML character escaping is not needed for characters such as '&'. REQUIRED

The attributes of the &CONTENT; element are as follows.

Attribute Definition Inclusion
creator Which party originally generated the content type (used to prevent race conditions regarding modifications). REQUIRED
name A unique name or identifier for the content type (this identifier is opaque and does not have semantic meaning). REQUIRED
profile The profile in use (e.g., "RTP/AVP" in the context of the Real-time Transport Protocol). RECOMMENDED
senders Which parties in the session will be generating content; the allowable values are "initiator", "responder", or "both" (where "both" is the default). RECOMMENDED

The Jingle-specific error conditions are as follows. These condition elements are qualified by the 'http://www.xmpp.org/extensions/xep-0166.html#ns-errors' namespace &NSNOTE;.

Jingle Condition XMPP Condition Description
<out-of-order/> &unexpected; The request cannot occur at this point in the state machine (e.g., session-initiate after session-accept).
<unknown-session/> &badrequest; The 'sid' attribute specifies a session that is unknown to the recipient (e.g., no longer live according to the recipient's state machine because the recipient previously terminated the session).
<unsupported-info/> &feature; The recipient does not support the informational payload of a session-info message.

If an entity supports Jingle, it MUST advertise that fact by returning a feature of "http://www.xmpp.org/extensions/xep-0166.html#ns" &NSNOTE; in response to a &xep0030; information request. The response MUST also include features for the application formats and transport methods supported by the responding entity, as described in the relevant specifications.

]]> ... ... ]]>

Naturally, support MAY also be determined via the dynamic, presence-based profile of Service Discovery defined in XEP-0115.

A document that specifies a Jingle application format (e.g., audio via RTP) MUST define:

  1. How successful application format negotiation occurs for encapsulation into Jingle.
  2. A &DESCRIPTION; element and associated semantics for representing the application format.
  3. If and how the application format can be mapped to the Session Description Protocol, including the appropriate SDP media type (see Section 8.2.1 of RFC 4566).
  4. Whether the media for the application format should be sent over a reliable transport method or a lossy transport method (or, if both, which is preferred).
  5. Exactly how the media is to be sent and received over a reliable or lossy transport.

A document that specifies a Jingle transport method (e.g., Raw UDP) MUST define:

  1. How successful transport negotiation occurs for encapsulation into Jingle.
  2. A &TRANSPORT; element and associated semantics for representing the transport method.
  3. Whether the transport is reliable or lossy.
  4. If and how the transport handles components as defined herein (e.g., for the Real Time Control Protocol).

Jingle sessions may be resource-intensive. Therefore, it is possible to launch a denial-of-service attack against an entity by burdening it with too many Jingle sessions. Care must be taken to accept sessions only from known entities and only if the entity's device is able to process such sessions.

Jingle communications may be enabled through gateways to non-XMPP networks, whose security characteristics may be quite different from those of XMPP networks. (For example, on some SIP networks authentication is optional and "from" addresses can be easily forged.) Care must be taken in communicating through such gateways.

Mere negotiation of a Jingle session may expose sensitive information about the parties (e.g., IP addresses). Care must be taken in communicating such information, and end-to-end encryption should be used if the parties do not trust the intermediate servers or gateways.

This document requires no interaction with &IANA;.

Until this specification advances to a status of Draft, its associated namespaces shall be:

  • http://www.xmpp.org/extensions/xep-0166.html#ns
  • http://www.xmpp.org/extensions/xep-0166.html#ns-errors

Upon advancement of this specification, the ®ISTRAR; shall issue permanent namespaces in accordance with the process defined in Section 4 of &xep0053;.

The following namespaces are requested, and are thought to be unique per the XMPP Registrar's requirements:

  • urn:xmpp:jingle
  • urn:xmpp:jingle:errors

The XMPP Registrar shall maintain a registry of Jingle application formats. All application format registrations shall be defined in separate specifications (not in this document). Application types defined within the XEP series MUST be registered with the XMPP Registrar, resulting in protocol URNs of the form "urn:xmpp:jingle:app:name" (where "name" is the registered name of the application format).

®PROCESS; the name of the application format a natural-language summary of the application format whether the media can be sent over a "reliable" transport, a "lossy" transport, or "both" the document in which this application format is specified ]]>

The XMPP Registrar shall maintain a registry of Jingle transport methods. All transport method registrations shall be defined in separate specifications (not in this document). Transport methods defined within the XEP series MUST be registered with the XMPP Registrar, resulting in protocol URNs of the form "urn:xmpp:jingle:transport:name" (where "name" is the registered name of the transport method).

®PROCESS; the name of the transport method a natural-language summary of the transport method whether the transport method can be "reliable", "lossy", or "both" the document in which this transport method is specified ]]>

The XMPP Registrar shall maintain a registry of reason codes related to Jingle actions.

®PROCESS; the value of the 'reasoncode' attribute a natural-language summary of the reason code the document in which this reason code is specified ]]>

The following submission registers reasoncodes currently in use. Refer to the registry itself for a complete and current list of reasoncodes.

busy the party is busy XEP-0166 decline the party wishes to formally decline the session XEP-0166 connectivity-error the action is related to connectivity problems XEP-0166 general-error the action is related to a non-specific application error XEP-0166 media-error the action is related to media processing problems XEP-0166 no-error the action is generated during the normal course of state management XEP-0166 unsupported-applications the party supports none of the offered application formats XEP-0166 unsupported-transports the party supports none of the offered transport methods XEP-0166 ]]>
]]> ]]>

Until Jingle was developed, there existed no widely-adopted standard for initiating and managing peer-to-peer interactions between XMPP entities. Although several large service providers and Jabber client teams had written and implemented their own proprietary XMPP extensions for peer-to-peer signalling (usually only for voice), those technologies were not open and did not always take into account requirements to interoperate with SIP-based technologies. The only existing open protocol was &xep0111;, which made it possible to initiate and manage peer-to-peer sessions, but which did not provide enough of the key signalling semantics to be easily implemented in Jabber/XMPP clients. It is true that TINS made it relatively easy to implement an XMPP-to-SIP gateway; however, in line with the long-time Jabber philosophy of "simple clients, complex servers", it would be better to force complexity onto the server-side gateway and to keep the client as simple as possible.

The result was an unfortunate fragmentation within the XMPP community regarding signalling protocols. Essentially, there were two possible approaches to solving the problem:

  1. Recommend that all client developers implement a dual-stack (XMPP + SIP) solution.
  2. Define a full-featured protocol for XMPP signalling.

Implementation experience indicates that a dual-stack approach may not be feasible on all the computing platforms for which Jabber clients have been written, or even desirable on platforms where it is feasible. For example, one large ISP decided to switch to a pure XMPP approach after having implemented and deployed a dual-stack client for several years. Therefore, it seemed reasonable to define an XMPP signalling protocol that could provide the necessary session management semantics while also making it relatively straightforward to interoperate with existing Internet standards.

As a result of feedback received on XEP-0111, the original authors of this document (Joe Hildebrand and Peter Saint-Andre) began to define such a signalling protocol, code-named Jingle. Upon communication with members of the Google Talk team, Google Talk is a messaging and voice chat service and client provided by Google; see <http://www.google.com/talk/>. it was discovered that the emerging Jingle approach was conceptually (and even syntactically) quite similar to the signalling protocol used in the Google Talk application. Therefore, in the interest of interoperability and adoption, we decided to harmonize the two approaches. The signalling protocol specified herein is, therefore, substantially equivalent to the original Google Talk protocol, with several adjustments based on feedback received from implementors as well as for publication by the XMPP Standards Foundation.

The authors would like to thank Rohan Mahy for his valuable input on early versions of this document. Thiago Camargo, Dafydd Harries, Antti Ijäs, Lauri Kaila, Justin Karneges, Jussi Laako, Anthony Minessale, Matt O'Gorman, Rob Taylor, Matt Tucker, Saku Vainio, Brian West, and others have also provided helpful input. Thanks also to those who have commented on the &SSIG; and (earlier) Jingle Before this specification was formally accepted by the XMPP Standards Foundation as an XMPP Extension Protocol, it was discussed on the semi-private <jingle@jabber.org> mailing list; although that list is no longer used since the standards@xmpp.org list is the preferred discussion venue, for historical purposes it is publicly archived at <http://mail.jabber.org/pipermail/jingle/>. mailing lists.