From 8e4f9a5d8206beaaaf8ac1a0af06bb8730e71853 Mon Sep 17 00:00:00 2001
From: Peter Saint-Andre Removed the streamBase64 transfer mechanism and replaced it with In-band bytestreams (IBB). Created new examples, avoiding re-use of known public examples. Extended the descriptions of the different transfer mechanisms. Ability to state capabilities of the client in the request. Changed IQ stanza type from 'get' to 'set' for all HTTP methods. Updated format of encoding table, and made it into a definition list. Language corrections. Added more information about chunking. Added implementation notes regarding bandwidth and stanza size limitations in XMPP Servers. Added support for XEP-0131, Stanza Headers and Internet Metadata.
- Many documents have been written about how to transport XMPP datagrams using HTTP. The motivation behind such solutions has often been to be able to use XMPP in
+ Many documents have been written on how to transport XMPP datagrams using HTTP. The motivation behind such solutions has often been to be able to use XMPP in
scripting languages such as Java Script running in web browsers.
- But very little has been written up to this point about the reverse: How to transport HTTP methods and HTTP responses over an XMPP-based peer-to-peer network.
+ But up to this point very little has been written about the reverse: How to transport HTTP methods and HTTP responses over an XMPP-based peer-to-peer network.
Here, the motivation is as follows: There are multitudes of applications and APIs written that are based on HTTP over TCP as the basic communication transport protocol.
As these are moving closer and closer to the users, problems arise when the users want to protect their data and services using firewalls. Even though there are methods
today to open up firewalls manually or automatically permit communication with such devices and applications, you still open up the application for everybody. This
@@ -80,7 +128,7 @@ TODO: Inform:
Furthermore, with the advent of semantic web technologies and its use in web 3.0 and Internet of Things applications, such applications move even more rapidly into
- the privte spheres of the users, where security and privacy is of paramount importance, the need to use more secure transport protocols than HTTP over TCP is necessary.
+ the private spheres of the users, where security and privacy is of paramount importance, it is necessary to use more secure transport protocols than HTTP over TCP.
There are many different types of HTTP-based communication that one would like to be able to transport over XMPP. A non-exhaustive list can include:
@@ -109,13 +157,17 @@ TODO: Inform:
Here HTTP Client authentication to resources on the HTTP Server is made by a third party, an XMPP Server.
This document presupposes the server already has a web server (HTTP Server) implementation, and that it hosts content through it, content which can be both
- dynamic (i.e. generated) or static (for example files) in nature. Content, which it wants to
+ dynamic (i.e. generated) or static (e.g. files) in nature. Content, which it wants to
publish to XMPP clients as well as HTTP clients. It also presupposes that the client is aware of HTTP semantics and MIME encoding.
- All HTTP communication is done usig the Request/Response paradigm. Each HTTP Request is made sending an iq-stanza
- containing a req element to the server. Each iq-stanza sent is of type get (not to be confused by the HTTP Method
- GET).
+ All HTTP communication is done using the Request/Response paradigm. Each HTTP Request is made sending an iq-stanza
+ containing a req element to the server. Each iq-stanza sent is of type set.
When the server responds, it does so by sending an iq-stanza response (type result) back to the client containing a resp
@@ -189,67 +240,125 @@ TODO: Inform:
Requests or responses containing data must also consider how this data should be encoded within the XML telegram. Normally in HTTP, content and headers are separated
- by a blank line, and the transfer of the content is made in the same stream. Specific HTTP headers are used to define how the content is transfered and encoded within
- the stream (Content-Type, Content-Length, Content-Encoding, Content-Transfer-Encoding). This approach is not possible if the response is to be inbedded in an XML telegram,
+ by a blank line, and the transfer of the content is made in the same stream. Specific HTTP headers are used to define how the content is transferred and encoded within
+ the stream (Content-Type, Content-Length, Content-Encoding, Content-Transfer-Encoding). This approach is not possible if the response is to be embedded in an XML telegram,
since it can interfere with the encoding of the encompassing XML.
To solve this, this document specifies additional data transfer mechanisms that are compatible with the XMPP protocol. The normal HTTP-based content transfer headers will
still be transported, but do not affect the content encoding used in the XMPP transport. The following content encoding methods are available:
- text: Normal text content. The text is encoded as text within XML, using the same encoding used by the XML stream. XML escape characters (<, > and &)
- are escaped using the normal <, > and & character escape sequences.
-
- xml: Xml content embedded in the XML telegram. Note however, that any processing instructions or XML version statements must be avoided, since it may cause
- the XML stream to become invalid XML. If this is a problem, normal text encoding can be used as an alternative. The advantage of xml
- instead of text or base64 encodings is when used in conjuncion with EXI compression &xep0322;. EXI compression has the ability to
- compress XML efficiently. Text will not be compressed, unless response exists in internal string tables. Base-64 encoded data will be compressed so that the 33%
- size gain induced by the encoding is recaptured.
-
- base64: Base-64 encoded binary content. Can be used to easily embed binary content in the telegram.
-
- chunkedBase64: Chunked Base-64 encoded binary content. The content is not embedded in the telegram. Instead it is sent in chunks, using separate
- chunk messages to the client. Chunked transport can be used by the server when it doesn't know the size of the final result. Chunked transport is
- similar to the streamBase64 encoding with the (loose) distinction that a stream is indefinite, while chunked transport conveys finitely sized content.
-
- sipub: The sender might deem the content to be too large for sending embedded in the XMPP telegram. To circumnavigate this, the sender publishes
- the content as a file using &xep0137; (Publishing Stream Initiation Requests), instead of embedding the content directly. This might be the case for instance, when
- a client requests a video resource, without using a ranged request.
-
- streamBase64: This option may be used to encode indefinite streams, like live audio or video streams (HLS, SHOUTcast, Motion JPeg web cams, etc). It works
- as chunkedBase64 with the distinction that the content is indefinite. This option is not available in requests, only in responses.
-
- jingle: For demanding multi-media streams alternative methods to transport streaming rather than embedded into the XMPP stream may be
- required. Even though the streamBase64 method may be sufficient to stream a low-resolution web cam in the home, or listen to a microphone
- or a radio station, it is probably badly suited for high-resolution video streams with multiple video angles and audio channels. If such content is accessed
- and streamed, the server can negotiate a different way to stream the content using &xep0166;.
-
+ Normal text content. The text is encoded as text within XML, using the same encoding used by the XML stream. XML escape characters (<, > and &)
+ are escaped using the normal <, > and & character escape sequences.
+
+ Xml content embedded in the XML telegram. Note however, that any processing instructions or XML version statements must be avoided, since it may cause
+ the XML stream to become invalid XML. If this is a problem, normal text encoding can be used as an alternative. The advantage of xml
+ instead of text or base64 encodings is when used in conjunction with EXI compression &xep0322;. EXI compression has the ability to
+ compress XML efficiently. Text will not be compressed, unless response exists in internal string tables. Base-64 encoded data will be compressed so that the 33%
+ size gain induced by the encoding is recaptured.
+
+ Base-64 encoded binary content. Can be used to easily embed binary content in the telegram.
+
+ Chunked Base-64 encoded binary content. The content is not embedded in the telegram. Instead it is sent in chunks, using separate
+ chunk messages to the client. Chunked transport can be used by the server when it doesn't know the size of the final result.
+ Streaming content, i.e. content of infinite length, must use ibb or jingle transport types to transfer content.
+ If the content consists of a file, sipub should be used.
+
+ Chunked encoding is perfect for dynamic responses of moderate sizes, for instance for API method responses. The server does not know when the response
+ is begun to be generated what the final size will be, but it will be most probably "manageable". Using the chunked transfer mechanism enables the
+ server to start sending the content, minimizing the need for buffers, and at the same time minimizing the number of messages that needs to be sent,
+ increasing throughput.
+
+ The client can limit the maximum chunk size to be used by the server, using the maxChunkSize attribute in the request. The chunk
+ size can be set to a value between 256 and 65536. If not provided in the request, the server chooses an appropriate value. Note that chunks can
+ be sent containing a smaller amount of bytes than the maximum chunk size provided in the request.
+
+ The sender might deem the content to be too large for sending embedded in the XMPP telegram. To circumnavigate this, the sender publishes
+ the content as a file using &xep0137; (Publishing Stream Initiation Requests), instead of embedding the content directly. This might be the case for instance, when
+ a client requests a video resource, without using a ranged request.
+
+ This transfer mechanism is of course the logical choice, if the content is already stored in a file on the server, and the size of the file
+ is sufficiently large to merit the overhead of sipub. Smaller files can simply be returned using the text, xml
+ or base64 mechanisms.
+
+ The client can disable the use of sipub by the server, by including a sipub='false' attribute in the request.
+ sipub is enabled by default. On constrained devices with limited support for different XEP's, this can be a way to avoid the
+ use of technologies not supported by the client.
+
+ This option may be used to encode indefinite streams, like live audio or video streams (HLS, SHOUTcast, Motion JPeg web cams, etc).
+ It uses &xep0047; to send the content over an in-band bytestream to the client. This option is not available in requests, only in responses.
+
+ Streams must not use any of the above mechanisms. Only ibb and jingle mechanisms can be used. If the content
+ represents multimedia jingle is preferrable, especially if different encodings are available.
+
+ The client can disable the use of ibb by the server, by including a ibb='false' attribute in the request.
+ ibb is enabled by default. On constrained devices with limited support for different XEP's, this can be a way to avoid the
+ use of technologies not supported by the client.
+
+ For demanding multi-media streams alternative methods to transport streaming rather than embedded into the XMPP stream may be
+ required. Even though the ibb method may be sufficient to stream a low-resolution web cam in the home, or listen to a microphone
+ or a radio station, it is probably badly suited for high-resolution video streams with multiple video angles and audio channels. If such content is accessed
+ and streamed, the server can negotiate a different way to stream the content using &xep0166;.
+
+ The client can disable the use of ingle by the server, by including a jingle='false' attribute in the request.
+ jingle is enabled by default. On constrained devices with limited support for different XEP's, this can be a way to avoid the
+ use of technologies not supported by the client.
+
- Note: Content endoded using chunkedBase64 or streamBase64 encoding methods can be terminated, either by the receptor going
- off-line, or by sending a close command to the sender.
+ Note: Content encoded using chunkedBase64 encoding method can be terminated, either by the receptor going off-line, or by
+ sending a close command to the sender. The transfer methods sipub, ibb and jingle have
+ their own mechanisms for aborting content transfer.
@@ -262,12 +371,14 @@ TODO: Inform:
-
+
+
- Note 2: The above Turtle example was taken from http://www.w3.org/TR/turtle/#sec-intro. -
@@ -343,12 +448,14 @@ TODO: Inform:
- Note: The above SPARQL example was taken from http://www.w3.org/TR/sparql11-query/#MultipleOptionals - in combination with http://www.w3.org/TR/sparql11-protocol/#select-longpost. -
-- Note 2: If using xml encoding of data, care has to be taken to avoid including the version and encoding information + Note: If using xml encoding of data, care has to be taken to avoid including the version and encoding information (<?xml version="1.0"?>) at the top of the document, otherwise the resulting XML will be invalid.
- Note: The example is taken from §2.1 RFC 5789. -
-- In the following sub-sections, the different data encoding formats are discussed with corresponding examples to illustrate how they work. + In the following sub-sections, the different data encoding formats are discussed, each with corresponding examples to illustrate how they work. The interesting part of these examples is the data element and its contents.
-Text responses is a simple way to return text responses (i.e. any MIME Type starting with text/). Since the text is embedded into XML, the characters <, > and & need to be escaped to <, > and & respectively. @@ -590,42 +704,35 @@ Host: clayster.com
The following example shows how a TURTLE response, which is text-based, is returned using the text encoding:
-- XML is a conveniant way to return XML embedded in the XMPP response. This can be suitable for MIME Types of the form .*/(.*[+])?xml + XML is a convenient way to return XML embedded in the XMPP response. This can be suitable for MIME Types of the form .*/(.*[+])?xml (using regular expression to match them), like text/xml, application/soap+xml or application/sparql-results+xml. Care has to be taken however, since not all XML constructs can be embedded as content to an XML element without invalidating it, like the xml version and encoding declaration (<?xml version="1.0"?> as an example). @@ -635,45 +742,38 @@ Host: clayster.com encode the XML escape characters <, > and &, or use another encoding, like base64.
- The advantage of xml instead of text or base64 encodings is when used in conjuncion with + The advantage of xml instead of text or base64 encodings is when used in conjunction with EXI compression. EXI compression has the ability to compress XML efficiently. Text will not be compressed, unless response exists in internal string tables. Base-64 encoded data will be compressed so that the 33% size gain induced by the encoding is recaptured.
-- Base-64 encoding is a simple way to encode content that is easilly embedded into XML. Apart from the advantage of being easy to encode, + Base-64 encoding is a simple way to encode content that is easily embedded into XML. Apart from the advantage of being easy to encode, it has the disadvantage to increase the size of the content by 33% (unless EXI compression is used at the same time), since it requires 4 bytes to encode 3 bytes of data. Care has to be taken not to send too large items using this encoding.
The following example shows an image is returned using the base64 encoding:
-In HTTP, Chunked Transfer Encoding is used when the sender does not know the size of the content being sent, and to avoid having its buffers overflow, sends the content in chunks with a definite size. @@ -726,17 +828,19 @@ Host: clayster.com predetermined. Chunks are naturally delimited and embedded in the XML stanza. The last chunk in a response must have the last attribute set to true.
-Note: Chunked encoding assumes the content to be finite. If content is infinite (i.e. for instance live streaming), - the streamBase64 transfer encoding must be used instead. If the sender is unsure if the content is finit or inifinite, - the streamBase64 must be used. + the ibb or jingle transfer encodings must be used instead. If the sender is unsure if the content is + finite or infinite, ibb or jingle must be used. +
++ Note 2: If the web server sends chunked data to the client it uses the HTTP header Transfer-Encoding: chunked, + and then sends the data in chunks but with chunk sizes inserted so the receiving end can decode the incoming data. Note that this data will + be included in the data sent in the XMPP chunks defined by this document. In this case, data will be chunked twice: First by the web server, + and then by the HTTP over XMPP transport layer. When received by the client, it is first reassembled by the HTTP over XMPP layer on the client, + and then by the HTTP client who will read the original chunk size elements inserted into the content. More information about HTTP chunking, + can be found in RDF2616 §3.6.1.
Often content being sent can be represented by a file, virtual or real, especially if the content actually represents a file and is not dynamically generated. In these instances, instead of embedding the contents in the response, @@ -769,17 +881,19 @@ Host: clayster.com XEP 0137: Publishing Stream Initiation Requests. This is done using the sipub element.
-Some web servers provide streaming content, i.e. content where packets are sent according to a timely fashion. Examples are video and audio streams like HLS (HTTP Live Streams), SHOUTcast, ICEcast, Motion JPeg, etc. In all these examples, content is infinite, @@ -804,31 +918,26 @@ Host: clayster.com example.
- Such content must use the streamBase64 encoding scheme, if used. The streamBase64 is similar to - the chunkedBase64 encoding, except it is imlicitly understood that the stream can be indefinite and needs an - explicit way to close the stream. Closing the stream is done by sending a close command to the sender of the - stream. The close command can also be used to close a chunked stream, and senders of chunked data should - support terminating the output if a close command on the stream is received. + Such content must use the ibb transfer mechanism, if used (or the jingle transfer machanism). + The ibb transfer mechanism uses In-Band Bytestreams + to transfer data from the server to the client. It starts by sending an a ibb element containing a sid + attribute identifying the stream. Then the server sends an ibb:open IQ-stanza to the client according to + XEP-0047. The client can choose to reject, negotiate or acceopt the request + whereby the transfer is begun. When the client is satisified and wants to close the stream, it does so, also according to + XEP-0047. The sid value returned in the HTTP response + is the same sid value that is later used by the IBB messages that follow. In this way, the client can relate + the HTTP request and response, with the corresponding data transferred separately.
-- Note: All streams (or chunked responses) are automatically closed if one of the parties goes off-line. -
-- Note 2: A streamed response does not need to be indefinite. It can be finite. If it is finite, the last chunk - must have the last attribute set to true. -
-- Note 3: The close command can be sent using a message or iq stanza. Use iq stanzas if a - response is desired. The response will be the same close command sent. -
-For demanding multi-media streams alternative methods to transport streaming rather than embedded into the XMPP stream may be - required. Even though the streamBase64 method may be sufficient to stream a low-resolution web cam in the home, or listen to a microphone + required. Even though the ibb method may be sufficient to stream a low-resolution web cam in the home, or listen to a microphone or a radio station, it is probably badly suited for high-resolution video streams with multiple video angles and audio channels. If such content is accessed and streamed, the server can negotiate a different way to stream the content using XEP 0166: Jingle.
-By creating a new scheme for HTTP over XMPP transport, and implementing support for it in web browsers, XML HTTP request objects and web servers, - Web Applications previously requiring web hosting on the Internet will be able to be hosted privatly behind firewalls instead, by simply switching + Web Applications previously requiring web hosting on the Internet will be able to be hosted privately behind firewalls instead, by simply switching from the http:// scheme to an xmpp:// scheme. All relative URL's within the application, including URL's sent to the XHR object (Ajax) will automatically be directed to use the HTTP over XMPP transport instead.
@@ -985,7 +1111,7 @@ Host: clayster.com Here, the JID to use in the URL, must not include a resource. Only user name, the @ character and the domain. The / separator between the JID and the Path is actually part of the Part. -- When resolving an URL using the xmpp scheme, the browser needs to extract the JID of the server hosting the resource. If that JID + When resolving an URL using the xmpp:// scheme, the browser needs to extract the JID of the server hosting the resource. If that JID is already in the roster, the request can proceed as usual.
@@ -1036,7 +1162,7 @@ Host: clayster.com
- Only relative URL's are used within references (images, audio, video, links, objekts, etc.). If absolute URL's are used (including scheme), + Only relative URL's are used within references (images, audio, video, links, objects, etc.). If absolute URL's are used (including scheme), the browser might get the first page correctly, but will be unable to get the content with the absolute URL, unless the URL has the same scheme as the principal page.
@@ -1048,13 +1174,13 @@ Host: clayster.com- Any URL's sent to the XML HTTP Request Object directed to API's or resources hosted by the same application must also be relative, + Any URL's sent to the XML HTTP Request (XHR) Object directed to API's or resources hosted by the same application must also be relative, for the same reasons as above. The XHR Object supports relative URL's.
- If the above rules are met, which they should under normal conditions as well, typing in the xmpp:// URL in the browser (for instance when + If the above rules are met, which they should under normal conditions, typing in the xmpp:// URL in the browser (for instance when you're at the office) should display the application (hosted for example at home behind a firewall) in the same way as when you use http:// (or https://) when you have access to the server (for instance when you're home), as long as friendship exists between the browser JID and the server JID. @@ -1067,30 +1193,33 @@ Host: clayster.com are mostly HTTP over TCP based, even though there are bindings which are not based on this. The most common APIs today (REST) are however all based on HTTP over TCP. Being HTTP over TCP requires the web server hosting the web services either to be public or directly accessible by the client. But as the services move closer to end users (for instance a Thermostat publishing a REST API for control in your home), problems arise - when you try to access the web service outside of private network in which the API is available. As explained previously, the HTTP over XMPP solves this. + when you try to access the web service outside of private network in which the API is available. As explained previously, the use of HTTP over XMPP + solves this.
- The following example shows a simple SOAP method call, taken from - http://www.w3schools.com/soap/soap_example.asp: + The following example shows a simple SOAP method call: