diff --git a/xep-0301.xml b/xep-0301.xml index d5746e43..658fed56 100644 --- a/xep-0301.xml +++ b/xep-0301.xml @@ -29,6 +29,12 @@ RealJabber.org and Rejhon Technologies Inc. http://www.realjabber.com + + 0.5 + 2012-07-22 + MDR +

Minor corrections and clarifications.

+
0.4 2012-07-20 @@ -76,6 +82,7 @@ +

This document defines a specification for real-time text transmitted in-band over an XMPP network.

Real-time text is text transmitted instantly while it is being typed or created. The recipient can immediately read the sender's text as it is written, without waiting. Text can be used conversationally, similar to a telephone conversation, where one listens while the other is speaking. It eliminates waiting times found in messaging, and is favored by deaf and hard of hearing individuals who prefer text conversation. For a visual animation of real-time text, see RealJabber.org RealJabber.org, the author's website covering this specification, including animation examples of what real time text looks like. <http://www.realjabber.org>..

Real-time text has been around for decades in various implementations:

@@ -85,12 +92,12 @@
  • TTY and text telephones for the deaf and hard of hearing.
  • For SIP, real-time text is sent using IETF RFC 4103 IETF RFC 4103: RTP Payload for Text Conversation. <http://tools.ietf.org/html/rfc4103>. with ITU-T T.140 ITU-T T.140: Protocol for multimedia application text conversation. <http://www.itu.int/rec/T-REC-T.140>. presentation coding.
  • In 2008, AOL AIM 6.8 gained the AOL Real-Time IM AOL AIM Real Time Text: <http://help.aol.com/help/microsites/microsite.do?cmd=displayKC&externalId=223568>. feature.
  • -
  • Deployment of Reach112 Reach112: European emergency service with real-time text. <http://www.reach112.eu>. in Europe, an accessible emergency service with real-time text.
  • +
  • A component within Total Conversation, used by Reach112 Reach112: European emergency service with real-time text. <http://www.reach112.eu>. in Europe, an accessible emergency service with real-time text.
  • -

    Real-time text is suitable for smooth and rapid mainstream communication in text, as an all-inclusive technology to complement instant messaging. It can also allow immediate conversation in situations where speech cannot be used (e.g. quiet environments, privacy, deaf and hard of hearing). Real-time text is also beneficial in emergency situations, due to its immediacy. This document defines a specification for real-time text transmitted in-band over an XMPP network.

    +

    Real-time text is suitable for smooth and rapid mainstream communication in text, as an all-inclusive technology to complement instant messaging. It can also allow immediate conversation in situations where speech cannot be used (e.g. quiet environments, privacy, deaf and hard of hearing). Real-time text is also beneficial in emergency situations, due to its immediacy.

    @@ -98,14 +105,14 @@
  • Allow transmission of real-time text with a low latency.
  • Balance low latencies versus system, network and server limitations.
  • Support message editing in real-time, including text insertions and deletions.
  • -
  • Support transmission of the original intervals between key presses, to preserve look-and-feel of typing independently of transmission intervals.
  • +
  • Support transmission and reproduction of the original intervals between key presses, to preserve look-and-feel of typing independently of transmission intervals.
    1. Reliable real-time text delivery.
    2. Be backwards compatible with XMPP clients that do not support real-time text.
    3. -
    4. Minimize reliance on network traversal protocols and/or out-of-band transmission protocols.
    5. +
    6. Minimize reliance on network traversal mechanisms and/or out-of-band transmission protocols.
    7. Compatible with multi-user chat (MUC) and simultaneous logins.
    @@ -119,7 +126,7 @@
      -
    1. Allow XMPP to follow the ITU-T Rec. F.703 ITU-T Rec. F.703: Multimedia conversational services. <http://www.itu.int/rec/T-REC-F.703>. Total Conversation accessibility standard for simultaneous voice, video, and real-time text.
    2. +
    3. Allow XMPP to follow the ITU-T Rec. F.703 ITU-T Rec. F.703: Multimedia conversational services. <http://www.itu.int/rec/T-REC-F.703>. Total Conversation standard for simultaneous voice, video, and real-time text.
    4. Be a candidate technology for use with Next Generation 9-1-1 / 1-1-2 emergency services.
    5. Be suitable for transcription services and (when coupled with voice at user's choice) for TTY/text telephone alternatives, relay services, and captioned telephone systems.
    6. Be an accessible enhancement for mobile phone text messaging and mainstream instant messaging.
    7. @@ -128,7 +135,7 @@

      real-time – A conversational latency of less than 1 second, as defined by ITU-T Rec. F.700 ITU-T Rec. F.700: Framework Recommendation for multimedia services <http://www.itu.int/rec/T-REC-F.700>., section 2.1.2.1.

      -

      real-time text – Text transmitted in real-time while it is being typed or created.

      +

      real-time text – Text transmitted instantly while it is being typed or created.

      real-time message – Recipient's real-time view of the sender's message still being typed or created.

      action element – An XML element that represents a single real-time message edit, such as text insertion or deletion.

      RTT – Acronym for real-time text.

      @@ -136,7 +143,7 @@

      Real-time text is transmitted via an <rtt/> child element of a <message/> stanza. The <rtt/> element is transmitted at regular intervals by the sender client while a message is being composed, to allow the recipient to see the sender type the message, without waiting for the full message sent in a <body/> element.

      -

      This is a basic example of a real-time message "Hello, my Juliet!” transmitted live while it is being typed, before a final message delivery in a <body/> element (to remain Backwards Compatible):

      +

      This is a basic example of a real-time message "Hello, my Juliet!” transmitted in real-time while it is being typed, before a final message delivery in a <body/> element (to remain Backwards Compatible):

      Example 1: Introductory Example

      @@ -229,8 +236,8 @@ -

      For the best balance between interoperability and usability, the default transmission interval of <rtt/> elements for a continuously-changing message SHOULD be approximately 0.7 second. This interval meets ITU-T Rec. F.700 for real-time conversation. If a different transmission interval needs to be used, the interval SHOULD be between 0.3 second and 1 second.

      -

      A longer interval will lead to a less optimal user experience. Conversely, a much shorter interval may more frequently trigger throttling or flooding protection algorithms in public XMPP servers, leading to dropped <message/> elements and/or Congestion Considerations.

      +

      For the best balance between interoperability and usability, the default transmission interval of <rtt/> elements for a continuously-changing message SHOULD be approximately 0.7 second. This interval meets ITU-T Rec. F.700 Section A.3.2.1 for good quality real-time conversation. If a different transmission interval needs to be used, the interval SHOULD be between 0.3 second and 1 second.

      +

      A longer interval will lead to a less optimal user experience in most network conditions. Conversely, a much shorter interval may more frequently trigger throttling or flooding protection algorithms in public XMPP servers, leading to dropped <message/> elements and/or Congestion Considerations.

      To provide fluid real-time text, one or more of the following methods can be used:

      • Preserving Key Press Intervals for natural typing display, independently of the transmission interval.
      • @@ -240,7 +247,7 @@

        The <rtt/> element MAY contain one or more action elements representing real-time text operations, including text being appended, inserted, or deleted.

        -

        Many chat clients allow a sender to edit their message before sending (via a Send button, or pressing Enter). The inclusion of real-time text functionality, in existing client software, needs to preserve the sender's existing expectation of being able to edit their messages. In a chat session with real-time text, the recipient can watch the sender compose and edit their message before it is delivered.

        +

        Many chat clients allow a sender to edit their message before sending (via a Send button, or pressing Enter). The inclusion of real-time text functionality, in existing client software, needs to preserve the sender's existing expectation of being able to edit their messages. In a chat session with real-time text, the recipient can watch the sender compose and edit their message before it is completed.

        This is a short summary of action elements that operate on a real-time message. For detailed information, see List of Action Elements.

        @@ -378,7 +385,7 @@

        Recipients MUST keep track of separate real-time messages per sender, including maintaining independent seq values. Recipients MAY also use additional methods to distinguish Simultaneous Logins, including using the full JID and/or <thread/>. 

        -

        Loss of sync occurs if the seq attribute does not increment as expected when Staying In Sync. In this case:

        +

        Loss of sync occurs if the seq attribute do not increment as expected when Staying In Sync. In this case:

        • Recipients MUST freeze the current real-time message; and
        • Recipients MUST ignore action elements within the current and subsequent <rtt/> elements; and
        • @@ -396,8 +403,8 @@
          • When the recipient starts sending messages from a different full JID (e.g. switched systems);
          • When the recipient sends a presence update (e.g. from offline to online);
          • -
          • When the sender resumes composing after an extended pause (e.g. recovery from Stale Messages handling);

          • -
          • When the conversation is unlocked (e.g. section 5.1 of &xmppim;);

          • +
          • When the sender resumes composing after an extended pause (e.g. recovery from Stale Messages handling);
          • +
          • When the conversation is unlocked (e.g. section 5.1 of &xmppim;);

          A message reset is done using the <rtt/> attribute event value of 'reset' (see RTT Attributes).

          @@ -455,7 +462,7 @@

          Activation of real-time text in a chat session (immediate or user-initiated) can be done by:

          • Immediately transmitting real-time text (if allowed after Determining Support); or
          • -
          • Signaling first (by transmitting a single <rtt event='init'/> and waiting for response).
          • +
          • Signaling first (by transmitting a single <rtt event='init'/> and waiting for response).

          Recipient clients can respond to incoming real-time text with an appropriate response, such as:

            @@ -463,6 +470,7 @@
          • Accepting after recipient confirmation (by also activating in response, after a user confirmation prompt); or
          • Deny (by transmitting <rtt event='cancel'/> which is used in Deactivation Methods); or
          • Ignoring (by discarding incoming <rtt/> as a last resort, without using Deactivation Methods).
          • +
          • Other appropriate responses (e.g. only display incoming real-time text during Multi-User Chat).

          If Determining Support allows, then it is not necessary for senders or recipients to transmit <rtt event='init'/> first, as any incoming RTT Element (other than <rtt event='cancel'/>) signals the start of incoming real time text. However, it allows signaling of activation before the sender begins typing.

          In the absence of Determining Support, sender clients can send a single <rtt event='init'/> element to attempt to activate real-time text. It is inappropriate for sender clients to send any further <rtt/> elements unless support is confirmed by discovery, or when the recipient client responds with incoming <rtt/> elements during the same chat session.

          @@ -471,16 +479,16 @@

          Real-time text can be deactivated by any of:

            -
          • Sending a signal (using <rtt event='cancel'/> upon deactivation, deny, or end of chat session); or
          • +
          • Sending a signal (using <rtt event='cancel'/> upon deactivation, deny, or end of chat session); or
          • Simply ending the chat session (without transmitting any further <rtt/> elements).

          Recipient clients can respond to deactivation with appropriate responses, such as:

            -
          • Deactivating in response (stop transmitting <rtt/> elements); and
          • -
          • Handling any unfinished incoming real-time message (clearing and/or saving it); and
          • -
          • Inform the recipient user that real-time text has ended (or denied/cancelled, if no real-time text was received).
          • +
          • Discontinue transmission of <rtt/> elements as well (not applicable to Multi-User Chat); and
          • +
          • Handle the sender's unfinished incoming real-time message (such as clearing it and/or saving it); and
          • +
          • Inform the recipient user that sender ended real-time text (or denied/cancelled, if no real-time text was received).
          -

          Sending an <rtt event='cancel'/> is useful in situations where the user closes a chat window, and ends the chat session. It is useful when the user wants to deactivate real-time text, while still continuing the chat session. After deactivation, either party may reactivate real-time text again in accordance to Activation Methods.

          +

          Sending an <rtt event='cancel'/> is useful in situations where the user closes a chat window, and ends the chat session. It is useful when the user wants to deactivate real-time text, while still continuing the chat session. After deactivation, any client can reactivate real-time text again in accordance to Activation Methods.

          @@ -578,7 +586,9 @@

          The in-band nature of this real-time text standard allows one-to-many situations. Thus, real-time text is appropriate for use with &xep0045; (MUC), as well as concurrent simultaneous logins.

          -

          For simplicity, clients can implement real-time text only for one-on-one chat, and not for MUC. However, it can be appropriate to support <rtt/> elements in MUC, even if not all participants support real-time text. Participants that enable real-time text during group chat need to keep track of multiple concurrent real-time messages on a per-participant basis. Participants, with real-time text, will see real-time text coming from each participant that has real-time text enabled. Participants that turn off real-time text for themselves, can simply ignore incoming <rtt/> and not transmit outgoing <rtt/>. Participant clients without real-time text (whether unsupported or turned off) will simply see group chat function normally on a line-by-line basis, since it is Backwards Compatible. For the same participant logged in multiple times in the same room, see Simultaneous Logins. Participant clients in MUC receiving an <rtt event=’cancel’/> should not automatically turn off outgoing real-time text (otherwise, one participant could deny real-time text between other willing participants). To minimize on-screen clutter of multiple idle real-time messages, clients can hide idle messages, clear old Stale Messages, and/or prioritize the display of the most useful real-time messages. Prominent visibility of real-time text can be assigned to recent typists and/or moderators (e.g. classroom teacher, convention speaker). In situations of simultaneous typing by a large number of participants, see Congestion Considerations.

          +

          For simplicity, clients can implement real-time text only for one-on-one chat, and not for MUC. However, it can be appropriate to support <rtt/> elements in MUC, even if not all participants support real-time text. Participants that enable real-time text during group chat need to keep track of multiple concurrent real-time messages on a per-participant basis. Participants, with real-time text, will see real-time text coming from each participant that has real-time text enabled. Participant clients without real-time text (whether unsupported or turned off) will simply see group chat function normally on a line-by-line basis, since it is Backwards Compatible.

          +

          Participants that turn off real-time text for themselves, can simply ignore incoming <rtt/> and not transmit outgoing <rtt/>. Participant clients in MUC receiving an incoming <rtt event=’cancel’/> needs to keep outgoing transmission unaffected during Deactivation Methods (otherwise, one participant could deny real-time text between other willing participants).

          +

          To minimize on-screen clutter of multiple idle real-time messages, clients can hide idle messages, clear old Stale Messages, and/or prioritize the display of the most useful real-time messages. Prominent visibility of real-time text can be assigned to recent typists and/or moderators (e.g. classroom teacher, convention speaker). For the same participant logged in multiple times in the same room, see Simultaneous Logins. In situations of simultaneous typing by a large number of participants, see Congestion Considerations.

          In simultaneous login situations, transmitting of <rtt/> works in one-to-many situations without any special software support. For many-to-one situations where there is incoming <rtt/> from more than one simultaneous login, Keeping Real-Time Text Synchronized will pause the real-time message upon conflicting <rtt/>, and resume during the next Message Reset, presumably from the active login. This provides a seamless system-switching experience. A good implementation of Message Reset will improve user experience, regardless of whether or not the client follows Best Practices For Resource Locking (XEP-0296). Clients can choose to distinguish the <rtt/> streams (via full JID and/or via <thread/>) and keep multiple concurrent real-time messages similar in manner to Multi-User Chat, with the Stale Messages being timed-out.

          @@ -590,7 +600,7 @@

          Senders that resume composing a message (i.e. continues a partially-composed message hours later) can transmit a Message Reset, which allows recipients to redisplay the real-time message.

          -

          With real-time text, frequent screen updates can occur. Screen updates are a potential performance bottleneck, since fast typists type many key presses per second. Optimizing screen updates becomes especially important for slower platforms. Real-time messages need to be displayed efficiently in a flicker-free manner. The real-time message might be implemented as a separate window or separate display element.

          +

          With real-time text, frequent screen updates can occur. Screen updates are a potential performance bottleneck, since fast typists type many key presses per second. Optimizing screen updates becomes especially important for slower platforms. Real-time messages need to be redisplayed efficiently in a flicker-free manner. The real-time message might be implemented as a separate window or separate display element.

          Battery life considerations are closely related to performance, as the addition of real-time text may impact battery life. If Preserving Key Press Intervals are supported, then the implementation of Element <w/> – Wait Interval needs to be implemented in a battery-efficient manner. The Transmission Interval may vary dynamically to optimize for battery life and wireless reception. For devices where screen updates are an unavoidable inefficient bottleneck, see Low-Bandwidth and Low-Precision Text Smoothing to reduce the number of screen updates per second.

          @@ -941,8 +951,8 @@

          There are other real-time text formats with interoperability considerations relating to the session setup level, the media transport level, and presentation level. Interoperability specifications between multiple real-time text formats can be found in the Interoperability section of Real-Time Text Taskforce - Real-Time Text Taskforce, a foundation for real-time text standardization - <http://www.realtimetext.org>..

          -

          It is appropriate for implementers to choose the most appropriate real-time text standard for the session control standard in use during a particular session. Clients that use XMPP (e.g. Google Talk) would utilize this XEP-0301 specification. Clients that use SIP can utilize IETF RFC 4103, IETF RFC 5194 IETF RFC 5194: Framework for Real-Time Text over IP Using the Session Initiation Protocol (SIP). <http://tools.ietf.org/html/rfc5194>. and ITU-T T.140. Clients that run on multiple networks, may need to utilize multiple real-time text standards. To interoperate between incompatible real-time text standards, gateway servers can transcode between different real-time text standards, along with other media such as audio and video. See Total Conversation – Combination with Audio and Video.

          + Real-Time Text Taskforce, a foundation for real-time text standardization <http://www.realtimetext.org>..

          +

          It is appropriate for implementers to choose the most appropriate real-time text standard for the session control standard in use during a particular session. Clients that use XMPP (e.g. Google Talk) would utilize this XEP-0301 specification. Clients that use SIP can utilize IETF RFC 4103, IETF RFC 5194 IETF RFC 5194: Framework for Real-Time Text over IP Using the Session Initiation Protocol (SIP). <http://tools.ietf.org/html/rfc5194>. and ITU-T T.140. Clients that run on multiple networks, may need to utilize multiple real-time text standards. To interoperate between incompatible real-time text standards, gateway servers can transcode between different real-time text standards, along with other media such as audio and video. This can include TTY and textphones. Also, see Total Conversation – Combination with Audio and Video.

          In the SIP environment, real-time text is specified in IETF RFC 4103 and ITU-T T.140. SIP is a popular real-time session control protocol, and there are many implementations of real-time text controlled by SIP. This includes some emergency service organizations (e.g. Reach 112).

          Interoperability considerations include addressing translation, media negotiation and translation, and media transcoding. Transcoding is straightforward between this specification and T.140/RFC4103, except for editing in the middle of messages. Text insertions or deletions, occurring far back in the message, can cause a large number of erase operations in T.140 that consume time and bandwidth. T.140 specifies the use of ISO 6429 control codes for presentation characteristics, such as text color, that are not supported by this specification. During transcoding, these control codes needs to be filtered off in order to not disturb the presentation of text.

          @@ -955,7 +965,7 @@
          -

          It is important for implementers of real-time text to educate users about real-time text. Users of real-time text needs to be aware that their typing is now visible in real-time to everyone in the current chat conversation. This may have security implications if users copy & paste private information into their chat entry buffer (e.g. a shopping invoice) before editing out the private parts of the pasted text (e.g. a credit card number) before they send the message. With real-time message editing, recipients can watch all text changes that occur in the sender's text, before the sender sends the final message. Implementation behaviors and improved education can be added to reduce privacy issues. Examples include introduction upon first activation of feature, special handling for copy and pastes (i.e. preventing them, or prompting for confirmation), recipient confirmation of real-time text via Activating and Deactivating Real-Time Text, etc.

          +

          It is important for implementers of real-time text to educate users about real-time text. Users of real-time text needs to be aware that their typing is now visible in real-time to everyone in the current chat conversation. This may have security implications if users copy & paste private information into their chat entry buffer (e.g. a shopping invoice) before editing out the private parts of the pasted text (e.g. a credit card number) before they send the message. With real-time message editing, recipients can watch all text changes that occur in the sender's text, before the sender sends the final message. Implementation behaviors and improved education can be added to reduce privacy issues. Examples include showing an introduction upon first activation of feature, special handling for copy and pastes (i.e. preventing them, or prompting for confirmation), recipient confirmation of real-time text via Activating and Deactivating Real-Time Text, etc.

          Real-time text (<rtt/> elements) transmit the content contained within messages. Therefore, a client that encrypts <body/>, also needs to also encrypt <rtt/> as well:

          @@ -1056,7 +1066,7 @@ ]]>

          -

          The author would like to thank Real-Time Text Taskforce (R3TF) at www.realtimetext.org for their contribution to the technology documented in this specification. Mark Rejhon leads the Jabber/XMPP Taskgroup at R3TF. Members of R3TF who have contributed to this document include Gunnar Helstrom (Omnitor), Paul E. Jones (Cisco), Gregg Vanderheiden (Trace R&D Center, University of Wisconsin), Barry Dingle (Interopability Leader, R3TF), and Arnoud van Wijk (Founder, R3TF). Others contributors include Bernard Aboba (Microsoft), Darren Sturman (Teligent Telecom), Christian Vogler (Gallaudet University), Norm Williams (Gallaudet University), and several members from the XMPP Standards Mailing List, including Kevin Smith (XSF), Peter Saint Andre (XSF), and many others.

          +

          The author would like to thank Real-Time Text Taskforce (R3TF) at www.realtimetext.org for their contribution to the technology documented in this specification. Mark Rejhon leads the Jabber/XMPP Taskgroup at R3TF. Members of R3TF who have contributed to this document include Gunnar Hellstrom (Omnitor), Paul E. Jones (Cisco), Gregg Vanderheiden (Trace R&D Center, University of Wisconsin), Barry Dingle (Interopability Leader, R3TF), and Arnoud van Wijk (Founder, R3TF). Others contributors include Bernard Aboba (Microsoft), Darren Sturman (Teligent Telecom), Christian Vogler (Gallaudet University), Norm Williams (Gallaudet University), and several members from the XMPP Standards Mailing List, including Kevin Smith (XSF), Peter Saint Andre (XSF), and many others.

          “Natural Typing”, the technique of Preserving Key Press Intervals, is acknowledged as an invention by Mark Rejhon, who is deaf. This technology is provided to XMPP.org as part of this specification in compliance of the XSF's Intellectual Property Rights Policy at http://xmpp.org/extensions/ipr-policy.shtml.