<abstract>This document defines an XMPP protocol extension to enable entities to page through and otherwise manage the receipt of large result sets.</abstract>
<remark><p>Per a vote of the Jabber Council, advanced status to Draft.</p></remark>
</revision>
<revision>
<version>0.14</version>
<date>2006-09-13</date>
<initials>ip</initials>
<remark><p>Clarified terminology regarding pages and result sets.</p></remark>
</revision>
<revision>
<version>0.13</version>
<date>2006-09-07</date>
<initials>ip/psa</initials>
<remark><p>Reverted to v0.11 with slight wording changes.</p></remark>
</revision>
<revision>
<version>0.12</version>
<date>2006-09-06</date>
<initials>psa/vm</initials>
<remark><p>Added index attribute to before element; removed index element (use after or before instead).</p></remark>
</revision>
<revision>
<version>0.11</version>
<date>2006-08-25</date>
<initials>ip</initials>
<remark><p>Made count and index optional, changed protocol for getting count, more clarifications and examples.</p></remark>
</revision>
<revision>
<version>0.10</version>
<date>2006-08-24</date>
<initials>ip</initials>
<remark><p>Added before and first elements, specified how to return an empty page and how to request the last page, removed reverse order sets, added out-of-order page access.</p></remark>
</revision>
<revision>
<version>0.9</version>
<date>2006-08-23</date>
<initials>ip</initials>
<remark><p>Eliminated static result sets, justified expanded and clarified dynamic result sets, added page-not-found error, described when minimal state is necessary, added reverse order sets.</p></remark>
</revision>
<revision>
<version>0.8</version>
<date>2006-08-22</date>
<initials>psa</initials>
<remark><p>Added optional method for handling dynamic result sets.</p></remark>
</revision>
<revision>
<version>0.7</version>
<date>2006-08-10</date>
<initials>psa</initials>
<remark><p>Updated implementation note to clarify handling of result sets (static vs. dynamic).</p></remark>
</revision>
<revision>
<version>0.6</version>
<date>2006-07-12</date>
<initials>psa</initials>
<remark><p>Updated implementation note to clarify handling of result sets (static vs. dynamic).</p></remark>
</revision>
<revision>
<version>0.5</version>
<date>2006-05-02</date>
<initials>psa</initials>
<remark><p>Clarified error handling, determination of support in the context of using protocols, and security considerations.</p></remark>
</revision>
<revision>
<version>0.4</version>
<date>2006-04-24</date>
<initials>psa</initials>
<remark><p>Specified that an item count may be approximate; specified that an item count may be returned with a page of results.</p></remark>
</revision>
<revision>
<version>0.3</version>
<date>2006-04-21</date>
<initials>psa/vm</initials>
<remark><p>Added <end/> element to specify last result set; added service discovery information; added more examples.</p></remark>
<p>In &xep0055;, &xep0030;, &xep0060;, &xep0136;, and probably other future XMPP extensions, it is possible to receive large dynamic result sets in response to information requests (e.g., a user directory search on a common first name or a service discovery items request sent to a &xep0045; service). This XMPP protocol extension enables the following functionality for use by other XMPP protocols:</p>
<li>Page forwards or backwards through a result set by retrieving the items in smaller subsets.</li>
<li>Discover the size of a result set without retrieving the items themselves.</li>
<li>Retrieve a page (subset) of items starting at any point in a result set.</li>
</ol>
</section1>
<section1topic='Use Cases'anchor='usecases'>
<section2topic='Limiting the Number of Items'anchor='limit'>
<p>In order to limit the number of items of a result set to be returned, the requesting entity specifies a request type of "set" and the maximum size of the desired subset (via the XML character data of the <max/> element):</p>
<examplecaption='Requesting a Limit to the Result Set'><![CDATA[
<section2topic='Paging Forwards Through a Result Set'anchor='forwards'>
<p>An entity often needs to retrieve a page of items adjacent to a page it has already received. For examples, when retrieving a complete result set in order page by page, or when a user 'scrolls' forwards one page.</p>
<p>The set of items that match a query MAY change over time, even during the time that a requesting entity pages through the result set (e.g., a set of chatrooms, since rooms can be created and destroyed at any time). The paging protocol outlined in this section is designed so that entities MAY provide the following features:</p>
<ul>
<li>Each page of the result set is up-to-date at the time it is sent (not just at the time the first page was sent).</li>
<li>No items will be omitted from pages not yet sent (even if, after earlier pages were sent, some of the items they contained were removed from the set).</li>
<li>When paging through the list in order, duplicate items are never received.</li>
<li>The responding entity maintains no state (or a single minimal state for all requesting entities containing the positions of all recently deleted items).</li>
<li>Rapid calculation of which items should appear on a requested page by responding entity (even for large result sets).</li>
</ul>
<p>Note: If a responding entity implements dynamic result sets then receiving entities paging through the complete result set should be aware that it may not correspond to the result set as it existed at any one point in time.</p>
<p>The request for the first page is the same as when <linkurl='#limit'>Limiting the Number of Items</link>:</p>
<examplecaption='Requesting the First Page of a Result Set'><![CDATA[
<p>Responding entity support for paging through a result set is optional. If it does support paging (not just <linkurl='#limit'>Limiting the Number of Items</link>), then in each page it returns, the responding entity MUST include <first/> and <last/> elements that specify the unique ID (UID) for the first and last items in the page. If there is only one item in the page, then the first and last UIDs MUST be the same. If there are no items in the page, then the <first/> and <last/> elements MUST NOT be included.</p>
<p>The responding entity may generate these UIDs in any way, as long as the UIDs are unique in the context of all possible members of the full result set. Each UID MAY be based on part of the content of its associated item, as shown below, or on an internal table index. Another possible method is to serialize the XML of the item and then hash it to generate the UID. Note: The requesting entity MUST treat all UIDs as opaque.</p>
<p>The responding entity SHOULD also include the number of items in the full result set (which MAY be approximate) encapsulated in a <count/> element. The <first/> element SHOULD include an 'index' attribute. This integer specifies the position within the full set (which MAY be approximate) of the first item in the page. If that item is the first in the full set, then the index SHOULD be '0'. If the last item in the page is the last item in the full set, then the value of the <first/> element's 'index' attribute SHOULD be the specified count minus the number of items in the last page.</p>
<p>Note: The <count/> element and 'index' attribute enable important functionality for requesting entities (for example, a scroll-bar user-interface component). They MAY be omitted, but <em>only</em> if it would be either impossible or exceptionally resource intensive to calculate reasonably accurate values.</p>
<examplecaption='Returning the First Page of a Result Set'><![CDATA[
<p>The requesting entity can then ask for the next page in the result set by including in its request the UID of the <em>last</em> item from the previous page (encapsulated in an <after/> element), along with the maximum number of items to return. Note: If no <after/> element is specified, then the UID defaults to "before the first item in the result set" (i.e., effectively an index of negative one).</p>
<examplecaption='Requesting the Second Page of a Result Set'><![CDATA[
<p>The <em>first</em> item in the page returned by the responding entity MUST be the item that immediately <em>follows</em> the item that the requesting entity indicated in the <after/> element:</p>
<examplecaption='Returning the Second Page of a Result Set'><![CDATA[
<p>It may sometimes be necessary to return an empty <em>page</em> to the requesting entity. For example, with dynamic result sets the responding entity MAY delete some items from the full result set between requests. Another example occurs when the requesting entity specifies "0" for the maximum number items to return (see <linkurl='#count'>Getting the Item Count</link>).</p>
<examplecaption='Returning an Empty Page'><![CDATA[
<p>If there are no items whatsoever in the <em>full</em> result set, the responding entity MUST return a response that adheres to the definition of the wrapper protocol (e.g., "jabber:iq:search", "http://jabber.org/protocol/disco#items", or "http://jabber.org/protocol/pubsub"). For both <cite>XEP-0055</cite> and <cite>XEP-0030</cite>, that means the responding entity shall return an empty &QUERY; element; for <cite>XEP-0060</cite>, that means the responding entity shall return an empty <pubsub/> element; for <cite>XEP-0136</cite>, that means the responding entity shall return an empty <list/> or <store/> element.</p>
<section2topic='Paging Backwards Through a Result Set'anchor='backwards'>
<p>The requesting entity MAY ask for the previous page in a result set by including in its request the UID of the <em>first</em> item from the page that has already been received (encapsulated in a <before/> element), along with the maximum number of items to return.</p>
<examplecaption='Requesting the Previous Page of a Result Set'><![CDATA[
<p>The <em>last</em> item in the page returned by the responding entity MUST be the item that immediately <em>preceeds</em> the item that the requesting entity indicated it has already received:</p>
<examplecaption='Returning the Previous Page of a Result Set'><![CDATA[
<p>The responding entity MUST reply with an 'item-not-found' error if <em>all</em> the following circumstances apply:</p>
<ol>
<li><p>The item specified by the requesting entity via the UID in the <after/> or <before/> element no longer exists (it was deleted after the previous page was sent).</p></li>
<li><p>The UID itself cannot be used to derive directly the next item within the set (e.g. the alphabetical or numerical order of the UIDs do not specify the order of the items).</p></li>
<li><p>The responding entity does not remember the position of the deleted item within the full list. (Even if the responding entity bothers to remember the position of each deleted item, it will typically be necessary to expire that 'state' after an implementation-specific period of time.)</p></li>
</ol>
<examplecaption='Returning a Page-Not-Found Error'><![CDATA[
<section2topic='Requesting the Last Page in a Result Set'anchor='last'>
<p>The requesting entity MAY ask for the last page in a result set by including in its request an empty <before/> element, and the maximum number of items to return.</p>
<examplecaption='Requesting the Last Page of a Result Set'><![CDATA[
<section2topic='Retrieving a Page Out of Order'anchor='jump'>
<p>The requesting entity MAY choose not to retrieve pages from the result set in order. (For example, when its user drags a user-interface slider to a radically new position within a very large result set.)</p>
<p>Only if the UID before the start (or after the end) of a desired result set page is not known, then the requesting entity MAY request the page that <em>starts</em> at a particular index within the result set. It does that by including in its request the index of the <em>first</em> item to be returned (encapsulated in an <index/> element), as well as the maximum number of items to return. Note: For reasons mentioned in <linkurl='#forwards'>Paging Forwards Through a Result Set</link> requesting entities SHOULD, where possible, specify pages using a UID instead of an index.</p>
<p>Note: If the responding entity omitted the <count/> element from previous responses for this result set, then the requesting entity SHOULD assume that the responding entity does not support page retrieval by index for this result set (see error below).</p>
<examplecaption='Requesting a Result Page by Index'><![CDATA[
<p>The responding entity SHOULD derive the first UID from the specified index (the method used MAY be approximate) before returning the requested result set page in the normal way. If the specified index was "0" then the responding entity SHOULD derive the UID that is the first in the full result set.</p>
<p>Note: The 'index' attribute of the <first/> element MUST be the same as the index specified in the request. If the index specified by the requesting entity is greater than or equal to the number of items in the full set then the responding entity MUST return an empty page (see <linkurl='#forwards'>Paging Forwards Through a Result Set</link>).</p>
<examplecaption='Returning a Result Page at an Index'><![CDATA[
<p>If it would be either impossible or exceptionally resource intensive for the responding entity to derive the first UID from the specified index with reasonable accuracy then the responding entity MAY return a &e501; error.</p>
<examplecaption='Returning a Feature-not-Implemented Error'><![CDATA[
<section2topic='Getting the Item Count'anchor='count'>
<p>In order to get the item count of a result set without retrieving the items themselves, the requesting entity simply specifies zero for the maximum size of the result set page:</p>
<examplecaption='Requesting the Item Count'><![CDATA[
<p>The responding entity then returns the item count, which MAY be approximate rather than precise if determining the exact number of items would be resource-intensive:</p>
<examplecaption='Returning the Item Count'><![CDATA[
<p>Note: The <count/> element MAY be omitted, but <em>only</em> if it would be either impossible or exceptionally resource intensive to calculate reasonably accurate values.</p>
<p>Note: If there are no items in the <em>full</em> result set then the responding entity MUST return a response that adheres to the definition of the wrapper protocol (see <linkurl='#forwards'>Paging Forwards Through a Result Set</link>).</p>
<p>The foregoing examples show the use of result set management in the context of <cite>Jabber Search</cite>. In the following examples we show the use of this protocol in the context of <cite>Service Discovery</cite>. <cite>XEP-0136</cite> ("Message Archiving") includes more examples. A future version of this document may also include examples from <cite>Publish-Subscribe</cite> and other XMPP protocol extensions.</p>
<p>In order for a requesting entity to determine if a responding entity supports result set management, it SHOULD send a <cite>Service Discovery</cite> information request to the responding entity:</p>
<p>An entity SHOULD NOT include the result set management extensions defined in this document in its requests if it does not have positive knowledge that the responding entity supports the protocol defined herein. If the responding entity does not understand result set management, it MUST ignore such extensions.</p>
<p>Note: Even if a responding entity understands the result set management protocol, its support for result set management in the context of any given using protocol is OPTIONAL (e.g., an implementation could support it in the context of the 'jabber:iq:search' namespace but not in the context of the 'http://jabber.org/protocol/disco#items' namespace). Currently the only way for a requesting entity to determine if a responding entity supports result set management in the context of a given using protocol is to include result set management extensions in its request. If the responding entity does not include result set management extensions in its response, then the requesting entity SHOULD NOT include such extensions in future requests wrapped by the using protocol namespace.</p>
<p>Security considerations are the responsibility of the using ("wrapper") protocol, such as <cite>XEP-0030</cite> for the 'http://jabber.org/protocol/disco#items' namespace, <cite>XEP-0055</cite> for the 'jabber:iq:search' namespace, and <cite>XEP-0136</cite> for the 'http://jabber.org/protocol/archive' namespace.</p>