**Binary Network Protocol** # Goals * Backward, forward and sideward compatible * Low bandwidth * Encoder and decoder are easy to implement without libraries * Tools can create human-readable representations of messages # Design decisions ## Built on top of reliable stream-oriented protocols Messages are self-delimited. Their size can be inferred from their content and information agreed upon by server and client beforehand. A future version may remove redundant size information if it can be inferred from the underlying protocol. The protocol does not handle error correction, message ordering or retransmission of lost messages. If such features are required they are to be implemented by the next layer. This may be TCP. The only requirement for this protocol is that the initial message has to be decoded without error before any further messages can be decoded. ## Messages are made of fields To accomplish backward, forward and sideward compatibility, messages are made of fields. When a connection is established, the server sends a list of fields it can offer to the client, the client sends the subset of fields it understands back to the server. Further messages contain values for this subset of fields and may be asynchronous. This allows implementations to add fields without breaking compatibility with older versions or other implementations. A field can be replaced by letting the server offer both the original field and its replacement, and letting the client choose the preferred version. The order of the fields in the server message dictates the order of values in the messages. Server and client may agree beforehand on restrictions on this order. Clients may ignore fields if they understand them but don't support them in the order given by the server. This simplifies encoding and decoding for client and server as they can be implemented as a list of fixed optional steps. Fields are identified by UUIDs. This allows developers to generate IDs for new fields without coordination. UUIDs were chosen because they are a well established standard. Shorter random IDs could be used, but as field IDs are only transmitted in the initial messages, their size is not a concern. Human readable names or type information could be included in the initial messages, but these are not necessary for server and client. ## Fields are either of fixed size or of variable size This is a small restriction on the values of a field and allows decoders to skip fields they don't recognize when reading messages between two parties, as long as they know whether the field has a variable size or know its fixed size. ## A server may provide a JSON document with encoding information for tools As server and client exchange lists of fields they understand, it can be assumed that they know how to encode the fields in subsequent messages. A debugging tool reading messages between two parties needs additional information to decode messages. This information may be provided by the server as a JSON document. This document may contain a human readable name and type information for each field UUID understood by the server. A debugging tool may request this document. At a minimum, this document should assign each field whether it has a fixed or variable length, and its fixed length if it has a fixed length. ## Fields can have multiple types It may be possible for the value of a field to be interpreted in multiple ways. Types are identified by UUIDs to allow applications to define their own. Types can have type-specific parameters, which are represented as arbitrary JSON objects. A tool decoding a message may choose an interpretation it understands from the list to display the values. The list of interpretations of a field must include one of fixed size byte array or variable size byte array. # Specification ## Backus-Naur form ```BNF ::= ::= ::= ::= ::= ::= ::= | ::= | ::= ::= | ::= | ::= ::= ``` `` and `` are the network streams sent by server and client respectively. `` and `` are one byte each, reserved for future use, and must be set to zero when encoding messages, and ignored when decoding messages. `` is a LEB128 encoded integer representing the size of the following expression in bytes. `` is a 128 bit big-endian encoded UUID. `` must not contain any ``s not contained in ``. `` contains one `` per `` in ``. `` is an arbitrary field-specific string of bytes, following an encoding the developers of server and client agree upon beforehand. ## JSON schema A server may offer additional information to fields for tools as a JSON document. This document should be accessible through a HTTPS request from the server. This JSON document should follow the following schema. ```json { "type": "object", "properties": { "fields": { "type": "object", "additionalProperties": false, "patternProperties": { "[0-9a-f]{8}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{12}": { "type": "object", "properties": { "name": { "type": "string" }, "type": { "type": "object", "additionalProperties": false, "patternProperties": { "[0-9a-f]{8}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{12}": { "type": "object" } } } }, "required": [ "type" ] } } } }, "required": [ "fields" ] } ``` ## Predefined types ### Variable length field `1bc08826-7d62-459b-b8aa-ca09924b7bf8` Variable length fields have values of arbitrary byte strings prefixed by a LEB128 encoded size in bytes (not including the size bytes). Its parameters must be empty. ```json { "type": "object", "additionalProperties": false } ``` ### Fixed length field `6cc2b827-0ca4-43ea-901f-37c683f20397` The values of fixed length fields are a fixed number of arbitrary bytes. The number of bytes in a value is specified in the parameters of the type. The size may be 0, meaning its values take up no space in the messages. 0-sized values may be used to modify the behavior of other fields as defined in their type. ```json { "type": "object", "additionalProperties": false, "properties": { "size": "integer" }, "required": ["size"] } ``` # Suggested types This section is non-normative. ## Table `1ab68366-7ee6-4388-82f3-a13b2a2e1094` A variable number of rows, each row is itself a message consisting of a subset of the fields of the original message. Its parameter follows the same schema as the message schema described in Section [JSON schema]. Its value is encoded as LEB128 encoded size in bytes followed by an arbitrary number of `message`s. Its set of fields is the intersection of fields listed in its parameter and the fields listed in `request-uuids`. Fields used in a table should be included in the `fields` property of the root object, with a type of fixed length with a `size` of 0 to instruct decoders unaware of the table type to skip the fields. # Examples This section is non-normative. ## Positional audio stream A server may offer this JSON schema. ```json { "fields": { "6338d6ac-6527-4d5d-b952-bf462832fb39": { "name": "position", "type": { "6cc2b827-0ca4-43ea-901f-37c683f20397": { "size": 6 }, "4a60a467-d75e-47fa-a30e-cefdaf512bf4": {}, "cd8999ab-936b-4606-8b11-ea65ed54a39d": {} } }, "534dbd67-f936-4886-b3b8-d9feaa18b114": { "name": "audio-opus", "type": { "1bc08826-7d62-459b-b8aa-ca09924b7bf8": {}, "391a3041-4de7-44b6-a8ec-ea458de53074": {} } }, "028cd5c1-c22f-45a1-98d1-a08b7730e69d": { "name": "audio-mp3", "type": { "1bc08826-7d62-459b-b8aa-ca09924b7bf8": {}, "461d5855-4eea-4f9c-a8b7-e48c93c67432": {} } } } } ``` Where `4a60a467-d75e-47fa-a30e-cefdaf512bf4` may represent an array of 16 bit signed big-endian integers, `cd8999ab-936b-4606-8b11-ea65ed54a39d` may represent a position in space, `391a3041-4de7-44b6-a8ec-ea458de53074` an Opus encoded audio frame, and `461d5855-4eea-4f9c-a8b7-e48c93c67432` an mp3 encoded audio frame. Upon connecting, a server may send the following bytes: bytes in hexadecimal | explanation - | - 00 | version 00 | flags 30 | 48 bytes length 6338d6ac65274d5db952bf462832fb39 | uuid of position field 534dbd67f9364886b3b8d9feaa18b114 | uuid of audio-opus field 028cd5c1c22f45a198d1a08b7730e69d | uuid of audio-mp3 field The client may reply with the following bytes: bytes in hexadecimal | explanation - | - 00 | version 00 | flags 20 | 32 bytes length 6338d6ac65274d5db952bf462832fb39 | uuid of position field 534dbd67f9364886b3b8d9feaa18b114 | uuid of audio-opus field This establishes that the client is interested in the fields `position` and `audio-opus`. After server and client agreed on a set of fields, they may both send messages like the following: bytes in hexadecimal | explanation - | - 00 01 00 02 00 03 | 3D position (1, 2, 3) 05 | length of audio-opus field is 5 bytes 01 02 03 04 05 | bytes of audio-opus field 12 bytes. A minimal tool, that only understands the pre-defined types can request the JSON schema and display the message above as such: name (short id) | value - | - position (6338d) | 00 01 00 02 00 03 audio-opus (534db) | 01 02 03 04 05 A tool that understands integer arrays may display the position as `(1, 2, 3)`, whereas a tool that understands positions in space may further visualize them. A tool that understands audio formats may decode the audio and display a waveform or offer playback. Encoding such messages can be done with little code in most languages. Here is an example in Python: ```python class Connection: def __init__(self): # set when decoding initial message self.has_position = False self.has_audio_opus = False self.has_audio_mp3 = False def encode(self, position, audio): message = bytes() if self.has_position: message += bytes().join([v.to_bytes(2, "big") for v in position]) if self.has_audio_opus: message += encode_opus(audio) if self.has_audio_mp3: message += encode_mp3(audio) ``` # Comparison ## MessagePack In MessagePack, type information and keys are included with values, increasing the size of messages compared to this encoding. Including the keys in the messages offers backward, forward and sideward compatibility. To avoid name collisions between keys, namespaces as in JSON-LD may be used, increasing the size of messages further. The message in Section [Positional audio stream] without namespaces could be encoded in message pack with 32 bytes. 18 of those bytes are taken up by the field names. ``` 82 a8 70 6f 73 69 74 69 6f 6e 93 01 02 03 aa 61 75 64 69 6f 2d 6f 70 75 73 c4 05 01 02 03 04 05 ``` When encoding the message as an array, without keys, it takes up 12 bytes. ## Protocol Buffer Protocol Buffer transmits limited type information ("wire-types") and field numbers, but not keys, with values, increasing the size of messages compared to this encoding. Backward and forward compatibility is offered, however, to provide sideward compatibility, implementations need to agree on a way to allocate field numbers. With about 29 bit length, field numbers can be allocated randomly with low chance of collisions. The message in Section [Positional audio stream] without random field numbers could be encoded in Protocol Buffer with 12 bytes. ``` 0a 03 01 02 03 1a 05 01 02 03 04 05 ```