%ents; ]>
Order-By This specification allows to change order of items retrieval in a Pubsub or MAM query &LEGALNOTICE; 0413 Experimental Standards Track Standards Council XMPP Core NOT_YET_ASSIGNED Jérôme Poisson goffi@goffi.org goffi@jabber.fr 0.2 2021-07-21 jp

Add a way to discover on which protocols Order-By applies

Remove references to SQL (except in implementation notes)

Specify that order-by operate on the whole item set and inside a RSM result set

Explicitly says that creation and modification dates are set by Pubsub service itself

Specify that Clark notation should be used for extensions

Add a full example with Pubsub and RSM

Add hint for SQL based implementations

removed XEP-0060 and XEP-0313 as dependencies, they are mentioned as use cases, but are not mandatory

better wording following feedback

Namespace bump

0.1.1 2019-08-20 edhelas

Editorial language fixes

0.1.0 2019-02-04 XEP Editor (jsc) Accepted by vote of Council on 2018-01-09. 0.0.1 2019-01-05 jp

First draft.

&xep0060; §6.5.7 allows to retrieve the "most recent items" and &xep0313; state in §3.1 that archives are ordered in "chronological order". While this order is straightforward in general use cases, it is sometimes desirable to use a different order, for instance while using &xep0277;: a spelling mistake correction should not bring an old blog post to the top of retrieved items.

This specification allows to explicitly change business logic to retrieve the items in a different order.

In XEP-0060, there is no such thing as "updated item". This XEP changes the business logic as follow:

Juliet wants to retrieve plays of her favorite writer, William Shakespeare. She wants to retrieve the 3 most recent ones by date of creation.

To do so, her client do a regular Pubsub request, but adds the <order> element as a children of the <pubsub> element with the "urn:xmpp:order-by:1" namespace, a by attribute equal to creation and a desc attribute equal to true.

]]>

The Pubsub service then returns the 3 most recently created plays, first one being the most recent.

Henry VIII Tempest Wintter's Tale ]]>

Juliet realizes that there is a spelling mistake, it's "Winter's Tale" and not "Wintter's Tale". She fixes it by overwritting the item:

Winter's Tale ]]>

To check that everything is alright, she requests again the last 3 items, but this time by date of modification. To do so, the client proceeds the same way as for date of creation, except that it uses the value modification for the by attribute.

]]>

The Pubsub service returns again the 3 plays but the "Winter Tales" item has been overwritten recently, while the 2 others have never been overwritten, so it returns the items in the following order, with the most recently modified item on top:

Winter's Tale Henry VIII Tempest ]]>

With &xep0313; the logic is the same, but the <order> element is added as a child of the <query> element:

]]>

This way, filters can be used with a specific ordering.

By default, ordering MUST be done in ascending order. This can be reversed by using the desc boolean attribute, which MAY have a value of either true or 1.

This section provides a full example of using Order-By with Pubsub and RSM. For readability, we'll use a node with 4 items that will have following IDs (in order of their creation) A, B, C and D. Items C has been overwritten after D creation, and item A has been overwritten even later. Thus, when ascending creation order is requested, items are in order A, B, C, D. When ascending modification order is requested, items are in order B, D, C, A.
Let's see how this work when Juliet wants to retrieve all items in ascending modification order with RSM using a page size of 2 items:

2 ]]> item B item D B D 4 ]]>

Now Juliet wants to get the second and last page to complete her collection. She does this as usual with RSM, by using the value advertised in <last> element in a <after> element.

NOTE: in this example the value used in <last> element is the item ID, but as specified in &xep0059;, an implementation MAY use whatever makes sense to it, the requesting client MUST treat this as an opaque value.

2 D ]]> item C item A C A 4 ]]>

Juliets wonders which are the 2 last items created. To discover this, she request again the node, but this time with a creation order field, and in descending order:

2 ]]> item D item C D C 4 ]]>

Now she knows that last created item is D, and the one created before is C.

Please note that items are in descending order in the whole result set but also inside the RSM page (thus the first item here is D), and that in this order, this request returns the first page, so index is 0 here.

If Juliet wanted to retrieve the second page of items by descending order of creation, she would do like this:

2 C ]]> item B item A B A 4 ]]>

This specification can be extended by further XEPs, proposing other kind of ordering in the 'by' attribute (e.g. ordering by filename for a file sharing service). But this is beyond the scope of this XEP, and a client should not assume that other ordering than "creation" and "modification" are available without further negotiation. Any new ordering specified in a other XEP SHOULD use the Clark notation to avoid any collision (i.e.: {some_namespace}some_ordering).

It is important to note the following points:

If a server supports the "order by" protocol, it MUST advertize it including the "urn:xmpp:order-by:1" discovery feature &NSNOTE; in response to a &xep0030; information request.
In addition to the general feature support, an entity MUST indicated on which protocols Order-By can be used, by using the notation urn:xmpp:order-by:1@other_namespace, i.e. a concatenation of:

So if Order-By is implemented for &xep0060;, the service MUST advertise urn:xmpp:order-by:1@http://jabber.org/protocol/pubsub. If Order-By is implemented for &xep0313;, it is urn:xmpp:order-by:1@urn:xmpp:mam:2.
In the following example, the server example.org advertizes Order-By support, and indicates that it is implemented for Pubsub and MAM:

]]> ]]>

Several ordering elements may be used, this allows to solve next levels of ordering in case of equality. In this case, the first ordering (i.e. the top most <order> element) is the main one, the second <order> element is used in case of equality, then the next one if a new equality happens and so on.

In case of equality, if no new <order> element is specified, the item order is not guaranteed and is up to the implementation (the implementation MUST keep this order consistent across requests though).

It may be difficult to find a correct value for <first> and <last> elements of RSM. Indeed, internal ID of items can't be suited for all orderings. For Pubsub service using a SQL database as backend, item ID (XMPP or internal) could be used with a window function such as row_number (supported by major database engines such as PostgreSQL, MariaDB/MySQL or SQLite) over the requested ordering. For instance, on a hypothetical table where items are requested by ascending creation then modification dates after the value ABC (which correspond to XMPP item ID in our case), a request similar to this could be used:

(SELECT cte_1.item_index FROM cte_1 WHERE cte_1.id = "ABC") ORDER BY cte_1.item_index ASC LIMIT 10; ]]>

In this example, row_number is decreased by 1 to match RSM index (row_number starts at 1 while RSM index starts at 0), thus the item_index column can be used directly to fill RSM metadata. A Common Table Expression has been used for better readability.

This document introduces no additional security considerations above and beyond those defined in the documents on which it depends.

This document requires no interaction with &IANA;.

This specification defines the following XML namespace:

  • 'urn:xmpp:order-by:1'
&NSVER;
]]>

Thanks to Philipp Hörist, Evgeny xramtsov, Jonas Schäfer¸ and Holger Weiß for their feedback.