S/MIME Update discussions based upon introduction of Digital Signatures in XMPP. Initial published version as accepted for publication by the XMPP Council. Proto-XEP draft. Digital signatures may be used to provide a number of security services, including message
authentication, message integrity and non-repudiation. There are many use cases for Digital
Signatures in the Extensible Messaging and Presence Protocol (&xmpp;). XMPP can be described as a mean for exchanging structured information or stanzas between two
or more entities. To accomplish this exchange, a number of other entities may be involved. For
instance, communication of a stanza between two client entities will typically involve one or
more server entities. Entities may exchange stanzas through service entities, such as a chat
room service, to effect one-to-many communications. Any entity involved in the exchange of a stanza may have wish to include one or more digital
signatures for the benefit of any entity involved in the exchange: Digital signatures are provided to serve specific purposes. These purposes might include
authentication of a particular entity involved in the exchange and integrity of information
that entity provided. This document discusses considerations for the design of
general-purpose digital signature extension for XMPP. The document discusses use cases and requirements, as well as explores the
solution space. The document also discusses existing solutions in
this area. This document contains a numerous examples intended to aide in the discussion of design
issues. The examples are examples generally abbreviated and often use informal syntaxes. Directed one-to-one stanzas are stanzas which are exchanged between two entities, the
originator of the stanza and intended recipient of that stanza, without exchanging through
services which provide re-direction of stanzas (such as a groupchat service). The stanza may
be handled by one or more other entities. Examples of directed one-to-one stanzas include chat &MESSAGE; used in one-to-one chat
sessions and &IQ; stanzas (excepting those exchanged through services providing re-direction). The originator may wish to provide a signature for the benefit of the intended recipient.
The intended recipient could use this signature to authenticate the originator and to ensure
integrity of originator provided information. Entities handling the stanza may wish to provide a signature for the benefit of the
intended recipient. For instance, where a originator is a client and does not provide a
signature, the client's server may wish to provide a signature for the benefit of the
intended recipient. The intended recipient could use this signature to authenticate this
server and to ensure integrity of the information as forwarded by this server. Redirected one-to-one stanzas which are exchanged between two entities, the originator of
the stanza and intended recipient of that stanza, through a service which provides
re-direction of stanzas. The stanza may be handled by one or more other entities. A multi-user chat (MUC) 'private message' is an example of redirected one-to-one
stanza. The originator's server may wish to provide a signature for the benefit of the re-direction
service. The service could use this signature to authenticate the originator and to ensure
integrity of originator provided information. The originator may wish to provide a signature for the benefit of the intended recipient.
The intended recipient could use this signature to authenticate the originator and to ensure
integrity of originator provided information. However, this signature would by itself not
establish any relationship between the signer and 'from' address in the stanza as received,
nor does it establish this signature establish that the stanza was processed by the
re-direction service. As in the directed one-to-one
stanza, a originating client's server may wish to provide a signature for the
benefit of the intended recipient. The re-direction service may wish to provide a signature for the benefit of the intended
recipient. The intended recipient could use this signature to authenticate the service and
hence establish the service processed the stanza. The intended recipient could also use the
signature to ensure the integrity of the information as redirected by the service. Redirected one-to-many stanzas which are exchanged between two or more entities, the
originator of the stanza and a group of recipients, through a service which provides
re-direction of stanzas of a single stanza to a set of recipients. The stanza may be handled
by one or more other entities. A multi-user chat (MUC) message to all occupants is an example of redirected one-to-group
stanza. The originator's server may wish to provide a signature for the benefit of the re-direction
service. The service could use this signature to authenticate the originator and to ensure
integrity of originator provided information. The originator may wish to provide a signature for the benefit of each recipient in the
group. Each recipient could use this signature to authenticate the originator and to ensure
integrity of originator provided information. However, this signature would by itself not
establish any relationship between the signer and the 'from' address in the stanza as
received, nor does it establish this signature establish that the stanza was processed by
the re-direction service. As in the directed one-to-one
stanza, a originating client's server may wish to provide a signature for the
benefit of the each recipient. The re-direction service may wish to provide a signature for the benefit of each recipient
in the group. Each recipient could use this signature to authenticate the service and hence
establish the service processed the stanza. Each could also use the signature to ensure the
integrity of the information as redirected by the service. The presence can be viewed as a specialized "publish-subscribe" mechanism. Commonly the
publishing entity sends a &PRESENCE; stanza to a presence service and the presence service
than forwards the stanza to each subscriber. In basic user presence, the publishing entity
is the user's client and the presence service is presence service is the provided by this
client's server. In this case, the 'to' address is empty. The publisher may wish to sign the signature for the benefit of each subscriber. Each
subscriber could use this signature to authenticate the publisher and to ensure integrity of
publisher provided information. The presence service may wish to provide a signature for the benefit of each subscriber.
Each subscriber could use this signature to authenticate the service and hence establish the
service processed the stanza. Each could also use the signature to ensure the integrity of
the information as redirected by the service. A presence stanza may also directed to another entity, possibly through a re-direction
service. This use is similar to the directed one-to-one
and redirected one-to-one cases detailed above. For the purposes of this memo, the following requirements are stipulated for a general
solution: Some of above requirements may well be, if not outright mutually exclusive, in opposition to
each other. It is suspected that set of reasonable solutions meeting all of the above
requirements may be empty. To produce a reasonable solution, it is expected that some of the
above requirements be eliminated and hence limiting the solution to some subset of the
applications of digital signatures in XMPP. The &IETF; standardized a signing and encryption facility for XMPP known as &xmppe2e;. XMPP
E2E is based upon Secure/Multipurpose Internet Message Extensions (&SMIME;) and the
Cryptographic Message Syntax (&CMS;). As it's name implies, XMPP E2E is intended to be an
end-to-end solution. That is, it enables a sender to sign content sent to a specific
recipient. An advantage of the XMPP E2E approach is that it uses an encapsulating signature which
protects the signed content from alteration as it is exchanged over an XMPP network. A
disadvantage is that implementations which do not support XMPP E2E cannot make use of the
signed content. At the time of this writing, XMPP E2E has not been widely implemented. XMPP E2E appears to
have limited applicability. The &xep0285; (XMPP DSIG), like the XMPP E2E, uses an encapsulating
signature to protects the signed content from alteration as it is exchanged over an XMPP
network. XMPP DSIG avoids certain dependencies which are believed to have hindered
implementation of XMPP E2E. It is hoped that the XMPP DSIG will prove to be more viable
solution than XMPP E2E. Like XMPP E2E, XMPP DSIG does not support optimistic signing. At the time of this writing, XMPP DSIG was just introduced. Alternative approaches have been developed. For instance, the Cross Domain Collaborative
Information Environment (&CDCIE;) Client Chat Protocol (&CDCIE-CCP;), an XMPP-based
protocol, supports signing of XMPP stanzas utilizes XML digital signatures (&XMLDSIG;)
"enveloped" signatures over the whole stanza. An advantage of the CDCIE-CCP approach is that, because it uses an encapsulated signature,
implementations need not support CDCIE-CPP to make use of the stanza. The disadvantage is
that the signature always over the entire stanza. Alteration of the stanza, as is common
(often required) when exchanging stanzas over an XMPP network, will invalidate the
signature. While this approach has been implemented and deployed to some extent, the approach appears
to have applicability limited to the CDCIE. An encapsulating signature is a signature approach that encapsulates the signed content
within the signature syntax. An encapsulated signature is a signature approach where the
signature syntax in encapsulated within the structure of the signed content. XMPP E2E and
XMPP DSIG are examples of the former. CDCIE-CCP is an example of the latter. The following example illustrates, using pseudo language, an encapsulating signature over a
&MESSAGE; stanza. To transfer a signed &MESSAGE; using an encapsulating signature, one needs to send it
within &MESSAGE; stanza. The following example illustrates, using pseudo language, an encapsulated signature over a
&MESSAGE; stanza. Applicability of a simple (non-nesting) encapsulating signatures, such as in XMPP E2E and
XMPP DSIG, are generally limited to end-to-end use cases. That is, cases where the
originator of a stanza signs the stanza and send it through the XMPP network to its intended
recipient, and only the intended recipient is expected to make use of the signed content.
Entities between the signer and the intended recipient are expected to forward of the stanza
without regard to the encapsulating signature, and without themselves signing the stanza.
The approach does not require forwarding entities to support the signing extension. Simple encapsulating signatures have limited applicability in MUC and PubSub use cases. For
instance, an occupant can sign its submissions to the service for the benefit of the service
and the service can sign reflected stanzas to occupants. In providing non-anonymous chat
rooms, in addition to signing the reflected content, the service should include and sign the
stanza it received when it was signed. This allows the occupants verify the content the
service purports to have received, and to determine whether the reflected content is
consistent given this. The following example illustrates an encapsulating signature over a
groupchat &MESSAGE; stanza. The following examples illustrates the signed reflection of the above stanza. In encapsulated signature solutions, as in CDCIE-CCP, any entities can make use of the
signed content even if they do not support the signing extension. If the signature is over
the entire stanza, as in CDCIE-CCP, the signature is likely not to be valid when the stanza
is passed through multiple entities prior to verification. Hence, when the signature is over
the entire stanza, the encapsulating signature approach applicability is generally limited
to cases where there no entities between the signer and verifier. However, as discussed
below, encapsulated selective signatures are generally
more applicable. While an entity could provide a signature to be over the entire stanza, such signatures are
likely be invalidated as the stanza exchanged over the XMPP. This is because XMPP allows
and, in many cases, requires stanza to be modified as they are forwarded. For instance, a client with the JID "juliet@example.com/Balcony" might send the following
signed stanza: The example.com server is required, per &rfc3920;, to add a 'from' attribute to the
&MESSAGE; element before forwarding it to the example.net server. The example.net server is
required to replace the 'to' attribute with the full JID of the romeo@example.net client it
intends to forward the message to. These alternatations will "break" the signature. XMLDSIG provides for a facility to selective sign XML content. For instance, the client
could sign the &SUBJECT; and &BODY; element and their content. However, this by itself would
not cover key aspects of the stanza, such that it was a chat &MESSAGE; addressed to a
particular JID and sent from a particular JID. XMLDSIG allows for enveloping signatures,
that is a signature that signs a data object contained within the &SIGNATURE; element. The
solution could define an element, such as &XMPPprop; used below, for including properties of
the stanza in the signature. The signature in Example 1 does not provide any protection against replay attack. To
address replay attack, as well as other concerns, XMLDSIG defines the &SIGNATUREPROPERTIES;
element for including information items about the generation of the Signature, such as the
date/time the signature was generated. While one could have &SIGNATURE; which included a &REFERENCE; element for each of four
elements discussed above within its &SIGNEDINFO; element, this would require reference
validation for each &REFERENCE; (See 2.3 of XMLDSIG). To provide greater flexibility over
handling of absent references and broken digest values, a &MANIFEST; can be constructed and
only it signed. Putting all of the above together, the client might send the following signed stanza: The signature references needs to unambiguously identify content in stanza even in face of
subsequent modification of that stanza. Failure to unambiguously identify signed content
would also be problematic. In the above example, signed child elements of the stanza were identified by 'id'
attribute. As stanzas may be forwarded into any XMPP stream, such identifiers needs to
remain unique. Use of an extension attribute to identify elements may be problematic. In particular, the
XMPP specifications provide no assurance that this attribute would be forwarded with
element. While one could identify signed content by other means, such as &XPointer;, these
means would not unambiguously identify the signed content in the face of subsequent stanza
modification. The an 'id' attribute is could be used (or possibly 'xml:id'), it may be appropriate for
the XMPP entity inserting a child element into a stanza to provide an 'xml:id' attribute
regardless of what stanza content it might sign. Multiple entities can sign a stanza. A single entity may sign a stanza multiple times,
typically on different occasions. Each signer simply adds their &SIGNATURE; element to the stanza, typically as the last
element. A &SIGNATURE; may sign other signatures, or portions thereof. While a simple chat &MESSAGE; typically transits through only one or two XMPP servers and a
groupchat &MESSAGE; may typically transits one to three XMPP servers, a stanza might include
far more than four &SIGNATURE; elements. Some users design the ability to optimistic signing of stanzas. That is, to sign
all stanzas adhere to a configured criteria, such as all &MESSAGE; stanzas, they send. A key
aspect of optimistic signing is that receiving entities not supporting the signing
extension should be able to make use the message content (excluding the signature
information) while those receiving entities supporting the extension can make use of the
message content and the signature information. Optimistic signing is available in E-mail through the use of S/MIME detached signatures.
Use of S/MIME detached signatures can be problematic. Mail systems, especially restribution
services such as mailing lists, are notorious for changing the signed content and hence
breaking the signature. In XMPP, as stanzas are generally altered in transit and hence optimistic signing will be
fragile at best. Through use of selective signing and manifesting, issues may be mitigated
to some degree. It is doubtful that a solution exists that provides optimistic signing and
reliability verification. One possible optimistic signing solution is for stanzas to carry alternative sets of
content, an unsigned content alternative and a signed content alternative. The premise of
this approach is that an entity supporting the signing extension could make use of the
signed content alternative while an entity not supporting the signing extension could make
use of the unsigned content alternative. The approach has been suggested to as a mechanism
for support extension-unaware entities downstream of extension-unware groupchat (or like)
services use of the stanza content. The following example not only illustrate this approach, but highlights some of the
issues with this approach: But it should be obvious that the signed and unsigned contents are not proper
alternatives. The signed content presumedly is what the signer sent. The unsigned content
is presumedly a modified version of what the signer sent. The modifications are generally
important to the entity making use of the stanza. In the above example, note that the
to/from addresses of the signed content differ from the unsigned content. Note as well
that the unsigned content contains a >delay/< element indicating that the stanza was
delayed in transit. Such modifications are generally important to the proper processing of
the content by not only this entity, but entities to which the content might be forwarded
to. Dual content, even in absence of attacks, simply complicates such processing. Note that the &BODY; element values differ between the signed and unsigned content. While
it reasonable straight forward (though significant work) to determine that the signed and
unsigned content differ, it is extermely difficult to to determine whether the changes are
due to normal processing or an attack. Dual content adds significant blot. In simple cases, the approach effective doubles the
content. However, in some use cases, the appraoch may lead to multiple doublings of the
content. It must be noted that verifying entities downstream of a redistribution will need some
mechanism to determine who signed the stanza, determine what signer is an appropriate
signer, and to obtain the public key of that signer. While certain information can be
placed in key data, the question of whether the signer is an appropriate signer for
purported sender (e.g., a room subscriber) generally would require information from the
redistribution service, and this would generally require the redistribution service to
support an extension to make that information available to entities desiring to verify the
signature(s). If one accepts the premise that downstream verification of redistributed
stanzas, such as via a MUC service, cannot be performed without extension and cooperation
of the redistribution service, then it follows that dual content can be avoided by having
the MUC service also support the signing extension. Dual content approaches should be avoided. While a signer may provide a &KEYINFO; element within the &SIGNATURE;, doing so will
significantly increase the size of the &SIGNATURE; element. As implementations may enforce a
maximum stanza size as small as 10,000 bytes, use of &KEYINFO; in stanza signatures should
be limited. It is also noted there are cases where the signer may not want to expose the key
information to all entities involved in the exchange of stanza. There are a number of ways key information may be published, such as in user's vCard. Key
information can also be provided at request, such as by &IQ;. Care must be taken in the design of not only ensure it provides an effective digital
signature solution for XMPP, but is designed itself with security in mind. This section
discussions some security issues in providing a digital signature solution. The design should
consider a general digital signature issues as well issues specific to the technologies
used/involved, and particulars of the solution. Due to the nature of XML and XMPP, an effective general digital signing solution for XMPP is
likely to be quite complex. This document suggests nothing less. With complexity comes
significant security risk. To minimize this risk, the solutions should avoid reinvention of
needed technology, such as signature and key information syntaxes, by reusing well established
and understood technologies such as XMLDSIG. Solutions should also favor simple and widely
used features of such technologies over esoteric or rarely used features Designers of the solution should be mind full of security considerations discussed in XMLDSIG
(regardless of whether XMLDSIG is used in the solution) If XMLDSIG is used, a number of security considerations would be introduced into the
solution. Implementations need to take special care in processing XMLDSIG &SIGNATURE; elements
to avoid a wide range of attacks. For instance, an attacker could attempt to mount a Denial of
Service attack by sending a &SIGNATURE; purporting to sign arbitrary large and complex
content. Or an attacker could attempt to mount a Distributed Denial of Service sending a
message to a chatroom that containing &SIGNATURE; with multiple references to large content
hosted at the attack target in hopes that each room participant will repeated fetch it. A
&SIGNATURE; element might also contain circler references.