%ents; ]>
File Repository and Sharing While a protocol has been described for initiating a file transfer from one user to another, there is not yet a defined way for users to designate a set of files as available for retrieval by other users of their choosing. This extension defines a common syntax for this purpose which is based on PubSub Collections. &LEGALNOTICE; 0214 Experimental Standards Track Standards JIG Council XMPP Core XEP-0001 XEP-0060 XEP-0082 XEP-0137 XEP-0248 NOT_YET_ASSIGNED Nicholas Parker nickbp@gmail.com nickp@jabber.org 0.2 2009-01-05 nbp

Non-functional rewording and refactoring. Now reflects the move of PubSub Collections into a separate extension.

0.1 2007-04-20 psa

Initial published version.

0.0.3 2007-03-18 nbp

Added support for Stream Initiation Requests. Clarified purpose of additional mirror types.

0.0.2 2007-01-26 nbp

Rewritten to use Pubsub.

0.0.1 2006-09-01 nbp

First draft.

This document defines a common format which allows a Jabber user to submit, find, and retrieve files within XMPP. The file listing itself is stored on a &xep0060; server, allowing multiple users to manage the same listing. Other features specified by this extension include file metadata, revisions, and download mirrors.

Retrieval of files provided in a listing MAY be performed through any relevant protocol for transferring data (http, ftp, etc). This protocol standardizes the use of &xep0137; to establish the file transfer, but also allows for usage of outside protocols such as http or ftp.

The protocol defined herein provides the following functionality:

  1. Publication of a list of available files to a PubSub server, with support for hierarchical listings, file metadata, user privileges, and file versioning.
  2. Request that a file be sent from a Jabber contact directly to oneself using Stream Initiation.
File Listing A Root-level Pubsub Collection Node, containing information about files and/or subsections which a user or group of users have published.
Subsection A Non-Root Collection Node which contains files and/or other subsections.
File A Pubsub Node, stored within a File Listing, which describes all revisions of a given file. The filename and (optional) description are provided here.
Revision A Pubsub Item which describes a given file revision. Other metadata which can vary between revisions, such as filesize, checksum, or available mirrors, is provided here.
Mirror A location which has a given Revision available for download. Additional information about a Mirror MAY be specified in instances where the protocol requires it. A list of example file transfer protocols is provided below, but others may also be deployed.

The following use cases describe tasks which are already covered by &xep0060; in a more generic context. These tasks are again being provided here in order to demonstrate the functionality provided by this protocol and convey the structure and syntax of the file listing. As a result of this close relationship, many details of PubSub are omitted here for brevity. Consult &xep0060; and &xep0248; for the full specification of node and user management commands as well as their server responses.

Juliet wishes to make her sonnets available for retrieval by the public. She creates a Root Pubsub Collection Node which will contain her file listing:

http://jabber.org/protocol/pubsub#node_config collection http://jabber.org/protocol/pubsub#meta-data Juliet's Sonnets Optional Description ]]>

Juliet also wishes to add a subsection for her sonnets about Romeo. She creates another PubSub Collection Node under the Root Node:

http://jabber.org/protocol/pubsub#node_config juliets_sonnets collection http://jabber.org/protocol/pubsub#meta-data Sonnets About Romeo Optional Description ]]>

Romeo wishes to view all of Juliet's shared sonnets. To do this, Romeo subscribes to the Root Collection Node:

http://jabber.org/protocol/pubsub#subscribe_options items all ]]>

Juliet has just finished a new sonnet and wishes to announce its availability on her File Listing. She adds the sonnet as a new PubSub Node stored in her Collection Node, then inserts a first revision of her sonnet as an Item within that Node:

juliets_sonnets http://jabber.org/protocol/pubsub#meta-data sonnet.txt Sonnet 42 5623 2006-12-13T18:30:02Z 59282c5db190bdc3b152c5b38363442bfda8ebdd text/plain My Latest Sonnet! sonnet.txt /source/23A53F01/ /preview/90266EA1/ ]]>

The Item ID is set to 1, signifying the first revision for this file. Subsequent revisions/items will have incremented ID values, like one would see in a versioning system such as CVS or SVN. Implementations MAY follow this convention, but are not required to do so. For example, a given implementation may instead mark revisions using version numbers ("Beta 1", "6.2", etc) or use other arbitrary strings. However, no two revisions of a given file may share the same ID.

Node IDs MAY take the form of "path/to/file.ext", rather than the randomized string "a6190c5d38e22452041d1c5798eff3f5" provided in the above use case. For example, Juliet's sonnet MAY instead use a Node ID of "juliets_sonnets/sonnet.txt", as long as this ID is unique within the PubSub server. Randomized strings are used in this document to illustrate that Node IDs SHOULD NOT be used for providing information about files.

Here is a listing of the possible metadata in a file revision (Item), each field is OPTIONAL:

SizeThe size, in bytes, of the file.
ModifiedThe last modified time of the revision. Follows the format described in &xep0082;. If a publisher prefers to only make a single revision available to clients, the publisher MAY instead update this value (and others, such as size and/or checksum) to announce that a new version of the file is available.
ChecksumA checksum of the revision, using the specified hash algorithm. Acceptable types are "sha512", "sha1", "md5", and "crc32".
MimeThe file's MIME type.
DescriptionDescription text for the revision. As an example, could contain release notes.
MirrorsA list of mirrors; their properties are defined below. If no downloads are available, MAY be left empty or removed entirely.

Because Romeo is now subscribed, he receives notice of Juliet's addition:

http://jabber.org/protocol/pubsub#node_config juliets_sonnets http://jabber.org/protocol/pubsub#meta-data Sonnet 42 sonnet.txt 5623 2006-12-13T18:30:02Z 59282c5db190bdc3b152c5b38363442bfda8ebdd text/plain My Latest Sonnet! ... MIRRORS ... ]]>

The above examples give a listing of several possible file transfer protocols in example configurations. Only the sipub mirror type is REQUIRED; the other types are OPTIONAL. Here is a full listing of those protocols and their available settings:

Protocol DescriptionRef AddressPort (default) UserPass
sipub (REQUIRED) OPTIONALN/A N/AN/A N/AN/A
http (OPTIONAL) OPTIONALREQUIRED REQUIREDOPTIONAL (80) OPTIONALOPTIONAL
https (OPTIONAL) OPTIONALREQUIRED REQUIREDOPTIONAL (443) OPTIONALOPTIONAL
ftp (OPTIONAL) OPTIONALREQUIRED REQUIREDOPTIONAL (21) OPTIONALOPTIONAL
sftp (OPTIONAL) OPTIONALREQUIRED REQUIREDOPTIONAL (22) OPTIONALOPTIONAL
smb (OPTIONAL) OPTIONALREQUIRED REQUIREDOPTIONAL (445) OPTIONALOPTIONAL

The Description field is where an arbitrary description of the mirror MAY be placed. For example, if a File Listing is advertising mirrors which are located in different geographic locations, then this field may be used to specify those locations.

The Ref field is a unique address or identifier for retrieving the file from the mirror server. In the above examples, it is used as a path to the file.

The address and port fields describe the server where the file may be retrieved using the specified protocol. If a port is not provided, the default value (specified in parentheses) MAY be assumed.

The User and Pass fields are for providing credentials which, if given by the File Listing, SHOULD be used when requesting the file. For example, an sftp mirror MAY require that the user log in using specified credentials before the file may be retrieved.

Juliet has revised her sonnet and wishes to publish the new version, while still leaving the original copy available for retrieval. To do this, she inserts a new Item, representing her new revision, into the file's Node:

6102 2007-01-13T18:30:02Z 6aaa20212a99548765b3b15f24f19aaa 97cbc0e445435af94db5cc2133b94ab5faf1399a text/plain A revised copy, fixed some spelling errors. ]]>

Juliet has uploaded a copy of her revised sonnet to a new mirror, and wishes to let her subscribers know about this secondary source. She is able to do this by modifying the revision in question to include a reference to her website, overwriting the existing mirrors in the Item with an updated list:

]]>

Juliet now wishes to allow others to contribute to her sonnet collection. She gives owner access for the entire Listing to Romeo, and publisher access to her nurse:

]]>

Romeo uses his owner access to remove the older revision of Juliet's sonnet:

]]>

Other deletion, modification, and user management operations are available as described in &xep0060; and &xep0248;.

Romeo is interested in seeing what files Juliet has made available. To do this, Romeo sends a request to Juliet for repositories which she is associated with:

]]>

Juliet responds with a list of PubSub nodes where she has published files or which she believes would be interesting to Romeo. If no such locations exist, Juliet SHOULD respond with an empty list.

]]>

After browsing Juliet's repository, Romeo has chosen to download her sonnet. The most recent revision of this file contains a listing of available mirrors, and Romeo sees that one of them is an SI stream. Romeo sends an SI request to that mirror:

]]>

The rest of the negotiation and file transfer occurs as described in &xep0137;.

Since PubSub is used for the File Listing, the access models described in &xep0060; and &xep0248; MUST be followed. Users MUST NOT be able to view or control information in the File Listing to which they do not have access.

If user access to files is restricted, the Mirror servers and the PubSub server MUST be able to synchronize these restrictions between them. See Security Considerations.

When restricted files are being distributed, mirrors need to know which users have sufficient privileges to access which files. If mirrors are not kept up to date on user privileges, unauthorized users could access files directly from those mirrors, thus bypassing any restrictions being set on the PubSub server.

No interaction with the Internet Assigned Numbers Authority (IANA) is required as a result of this XEP.

TODO

TODO