5. CDOC2 container format¶

5.1 Abstracted format¶

This section describes the CDOC2 format from an abstract point of view, presenting the data contents and data models used therein without referencing the specifics of the serialized format.

5.1.1 Basic principles¶

The basic principles outlined below will provide the user of this specification with points of reference for understanding the details of the specification.

The abstracted format consists of a header and an encrypted payload.
The abstracted format contains a single encrypted payload consisting of one or several encrypted files. File information, as well as the sizes and sequence of the files, in case there is more than one file, is also encrypted.
The payload is encrypted using a single symmetric key (Content Encryption Key; CEK), using AEAD (Authenticated Encryption with Additional Data) encryption.
The CEK is derived from the CDOC file master key (File Master Key; FMK). See section Key derivation.
The FMK can be encrypted in parallel using one or several key encryption keys (KEK), one per recipient. On KEK generation see section Descriptions of header elements and KEK computation.
The header describes the protection of the FMK (i.e. how the recipients can acquire the KEK required for decrypting the FMK).
Header integrity is ensured using a message authentication code computed using the message authentication key derived from the FMK (Header HMAC Key; HHK). See section Header authentication code.
To ensure format universality, no elements specific to the Estonian eID infrastructure have been used in the description. Thus, the recipient is described with reference to their public key rather than their certificate.
Decryption always follows the same pattern: 1) the recipient acquires a KEK, 2) the recipient decrypts the FMK, 3) the recipient derives the HHK and validates the header, 4) the recipient derives the CEK, 5) the recipient decrypts the payload.

5.1.2 Header structure¶

Header structure is described with the help of pseudocode that is based on no specific programming or schema language but should be intuitively understood.

The header consists of one or several structures describing a recipient. Each recipient structure contains complete information on how the specific recipient can access the FMK (for identification, access to personal encrypted materials, etc.).

A message authentication code is computed for the header using a key derived from the FMK. This is necessary for preventing the manipulation of the header by the senders, e.g. for the purpose of concealing some recipient. The header authentication code is computed for a header serialized in a specific manner (see section Serialized format).

Header = {
    Recipients              = :Recipient[](1..k)
    PayloadEncryptionMethod = :enum(CHACHA20-POLY1305)
}

A message authentication code is computed for the header (see section Header authentication code):

Checksum = {
    value = HMAC(HHK, Serialize(Header))
}

The recipient is described using the structure Recipient. The format of the structure allows for quick and unambiguous decisions on whether the reader can decrypt the payload using the specific instance of Recipient.

Recipient = {
    Capsule = Union(:ECCPublicKeyCapsule | :KeyServerCapsule | 
            :SymmetricKeyCapsule | :RSAPublicKeyCapsule )
    KeyLabel = :string
    EncryptedFMK = :byte[]
    FMKEncryptionMethod = :enum(XOR)
}

The Recipient structure consists of a capsule, a recipient key label, an encrypted FMK, and an FMK encryption method identifier.

Capsule – encryption method specific data that the recipient can use to decrypt the FMK.
EncryptedFMK – encrypted FMK.
FMKEncryptionMethod –FMK encryption method type.
KeyLabel – human-readable label of the private or secret key required for decrypting the FMK. This label is necessary for building a sensible user interface. The sender fills this field based on the key or the related certificate. No concrete method for achieving this is indicated in the specification as this is not relevant to cryptographic processing. KeyLabel is a UTF-8 string.

Successful processing of the Capsule structure returns a cryptographic key for decrypting the FMK using the method defined as FMKEncryptionMethod. See section 6.4 on the details of cryptographic operations. The following capsule types have been specified to ensure the support of a variety of encryption methods (CDOC2 encryption schemes).

ECCPublicKeyCapsule – the recipient is identified by ECC public key RecipientPublicKey (e.g. the public key of the first ID-card key pair). The KEK is derived using ECDH. Used in the SC.01 encryption method.
RSAPublicKeyCapsule – the recipient is identified by RSA public key RecipientPublicKey. The KEK is derived by decrypting the capsule using the RSA private key. Used in the SC.03 encryption method.
KeyServerCapsule – the recipient is identified by ECC or RSA public key RecipientPublicKey, used by the recipient for authentication on a Capsule Server. The Capsule Server returns an ECCPublicKeyCapsule or a RSAPublicKeyCapsule used as described above. Used in the SC.02 and SC.04 encryption methods.
SymmetricKeyCapsule – the recipient is identified by key label KeyLabel. The KEK is derived using HKDF from a symmetric key provided by the user. Used in the SC.05 encryption method.

This list may be expanded in future versions of the specification.

5.1.2.1 KeyLabel recommendations¶

Although not required by the specification, KeyLabel should however follow consistent formating rules and be structured in a machine-readable format for Client Application to show the User what decryption method is allowed and, in case of password and symmetric key encryption, a reminder of what password or key to use.

Dependent upon the encryption method the following formatting rules are used in the reference implementation:

1. For machine parsable text data url format was chosen, that starts with data:

data:[<mediatype>][;base64],<data>

The mediatype can be omitted and is application/x-www-form-urlencoded if not specified and fields are encoded as url parameters. Parameter names are case-insensitive. ;base64 encoding is optional and means then that Base64 encoded has been applied to the <data> part.

type and v fields are required, rest of fields are type specific.

Examples:

Smart-ID/Mobile-ID - PNO=ETSI:{ETSI indentifier} e.g. "ETSI:PNOEE-48010010101", where PNO means personal number issued by a national authority and {ETSI identifier} is replaced by the Recipient's identifier. Example: type=Smart-ID&PNO=ETSI%3APNOEE-48010010101
Password, with an integrated password manager - KM=bitwarden&VAULT=CDOC2&KEY_ID=HELLO.CDOC2&USER_DESC=hello, where KM means key manager and VAULT refers to the name of a secure vault, keyring or wallet inside the password manager. KEY_ID is the name given to the key in the vault.
Symmetric key - KM=bitwarden&VAULT=CDOC2&KEY_ID=HELLO.CDOC2&FILE=~/folder/secret.pem&USER_DESC=hello, where KM means key manager and VAULT refers to the name of a secure vault, keyring or wallet inside the password manager. KEY_ID is the name given to the key in the vault. FILE is the path to the symmetric key.
Certificate - FILE=~/folder/filename&CERT_HASH=XXYYXXYY, where FILE is the path to the certificate and CERT_HASH is a result of applying a digest algorithm.
ID-card and Digi-ID and Digi-ID E-RESIDENT - TYPE=ID-card&cn={cn}, TYPE means eID type. The current known values are: 'ID-CARD', 'Digi-ID E-RESIDENT', 'Digi-ID'. For these types the following fields and requirements are defined

Known fields are defined in Appendix: KeyLabel field specification v1.

Machine-readable KeyLabel examples:

data:v=1&type=ID-card&serial_number=PNOEE-38001085718&cn=J%C3%95EORG%2CJAAK-KRISTJAN%2C38001085718
data:application/x-www-form-urlencoded,v=1&type=ID-card&serial_number=PNOEE-38001085718&cn=J%C3%95EORG%2CJAAK-KRISTJAN%2C38001085718
data:application/x-www-form-urlencoded;base64,dj0xJnR5cGU9SUQtY2FyZCZzZXJpYWxfbnVtYmVyPVBOT0VFLTM4MDAxMDg1NzE4JmNuPUolQzMlOTVFT1JHJTJDSkFBSy1LUklTVEpBTiUyQzM4MDAxMDg1NzE4
data:;base64,dj0xJnR5cGU9SUQtY2FyZCZzZXJpYWxfbnVtYmVyPVBOT0VFLTM4MDAxMDg1NzE4JmNuPUolQzMlOTVFT1JHJTJDSkFBSy1LUklTVEpBTiUyQzM4MDAxMDg1NzE4

2. The second format for KeyLabel is free text format and it doesn't start with data:

This format is meant for password encryption and symmetric key encryption use cases when no integrated password manager is used.

Free text KeyLabel examples:

"131:40:16"
"kevade"
"poem"
"password manager, key: hello.cdoc2"

5.1.3 Capsule types¶

ECC public key capsule. The recipient is identified by ECC public key RecipientPublicKey.

ECCPublicKeyCapsule = {
    Curve              = :enum(secp384r1)
    RecipientPublicKey = :byte[]
    SenderPublicKey    = :byte[]
}

Curve – identifier of the elliptic curve employed.
RecipientPublicKey – recipient’s ECC public key, used by the recipient to establish the corresponding recipient record.
SenderPublicKey – sender’s public key used by the recipient to derive the KEK using ECDH.

RSA public key capsule. The recipient is identified by RSA public key RecipientPublicKey.

RSAPublicKeyCapsule = {
    RecipientPublicKey = :byte[]
    EncryptedKEK       = :byte[]
}

RecipientPublicKey - recipient’s RSA public key, used by the recipient to establish the corresponding recipient record.
EncryptedKEK - key encryption key encrypted with the receipient's public key.

Server capsule. The receipient is identified by ECC or RSA public key RecipientPublicKey.

KeyServerCapsule = {
    RecipientKey = Union(:EccKeyDetails | :RsaKeyDetails)
    KeyServerID         = :string
    TransactionID       = :string
}

RsaKeyDetails = {
    RecipientPublicKey  = :byte[]
}

EccKeyDetails = {
        Curve              = :enum(secp384r1)
        RecipientPublicKey = :byte[]
}

RecipientKey – information on the recipient key, used by the recipient for authentication on the Capsule Server.
KeyServerID – Capsule Server identifier. The recipient must be able to use this to establish the Capsule Server’s network address and connect to the server.
TransactionID – identifier of the capsule sent to the key exchange server by the sender for transmission to the recipient.

Symmetric key capsule. The recipient is identified by the label of the symmetric key held by the user, KeyLabel.

SymmetricKeyCapsule = {
    Salt   = :byte[]
}

Salt – random number generated by the sender, used by the sender as input for the HKDF-Extract function.

5.1.4 Format extension¶

To allow for format extension and ensure general forward compatibility, the union type field Capsule is included in the header structure Recipient. Each type of the union describes a specific type of recipient along with corresponding cryptographic primitives and key management tools. Types can be added to the format as necessary both in abstracted and concrete forms.

5.2 Serialized format¶

The specification describes the implementation of the abstracted format using the FlatBuffers format.

5.2.1 General description of the format¶

The format consists of an envelope, which is essentially made up of a serialized and concatenated header, a message authentication code, and a payload. The message authentication code and payload are serialized using simple serialization. For the sake of extensibility and the necessity of transmitting messages identical to the header from the point of view of processing logic via the capsule server, the header is described here with reference to the FlatBuffers format. Aside from the header extension mechanism, the envelope used in this format also defines another point of extension. This point of extension is provided by the version identifier, set as 2 (byte value) in the specification. The identifier must be changed in the descriptions of new versions.

5.2.2 Envelope¶

The envelope consists of the following data elements, presented sequentially as bytes. The designation of the start and end of the envelope is outside the scope of the specification, insofar as the main and natural use case of the format is one where a CDOC file contains a single envelope.

4 bytes: the string “CDOC” – format designator (prelude), UTF-8 encoded.
1 byte: version identifier, set as 2 in the specification.
4 bytes: length of the following header, big-endian order. Header length is a 32-bit signed integer, i.e. the maximum header size is 2 GB. For the sake of the simplicity of implementation, header size is limited to 1 MB (2²⁰ bytes).
x bytes, where x is as defined above: serialized FlatBuffers header.
32 bytes: header message authentication code (see section 6.5).
The rest of the bytes, until the end of the envelope: payload encrypted using the method and key specified in the header.

Table 1 presents an overview of the envelope structure. Table 1. Envelope structure

Field	“CDOC”	Version	Header length	Header	HMAC	Payload
Length	4	1	4	Header length	32	Until end of envelope
Start	1	5	6	10	10 + header length	10 + header length + 32

5.2.3 Header and HMAC¶

The technical description (schema) of the FlatBuffers format can be found in the reference implementation source code repository, under cdoc2-schema/. The schema is described in two files and reproduced as appendices to the specification.

src/main/fbs/header.fbs Description of the FlatBuffers header.
src/main/fbs/recipients.fbs Descriptions of recipient types; can be shared with schemas presented in other files.

The header, serialized following the FlatBuffers rule set, is written to the envelope, preceded a 4-byte length field as per the envelope description. The Header Message Authentication Code (HMAC) is computed as described in section Header authentication code and written, bytewise, immediately after the header. The message authentication code algorithm and, consequently, HMAC length are defined in this specification.

5.2.4 Payload¶

Lastly, the payload is written to a container composed following the CDOC format, immediately after the HMAC. The format presumes that the end-of-payload indicator is defined outside the format, e.g. as end-of-file.

Note that the end-of-payload indicator is purely optional: the true integrity of the payload is determined by whether the payload can be completely decrypted.

The composition of the payload plaintext is described in section Unencrypted payload.

The encryption of the payload is described in section Payload assembly and encryption.

5.2.5 Format composition procedure¶

This section makes reference to the reference implementation source code, using Java package names and identifiers. References to source code are styled as monotype. The following steps are needed to compose a CDOC2 container.

Compile the list of all recipients.
Generate FMK, HHK, and CEK.
Compose the header along with all corresponding cryptographic operations.
Compute the HMAC.
Generate the payload plaintext.
Encrypt the payload.
Generate the serialized envelope.
Securely delete the FMK, HHK, and CEK values used during the procedure.

Generation of the payload plaintext is described in section Unencrypted payload, container.Tar.archiveFiles().

Next, the cryptographic material used for the protection of the header and payload must be prepared. Generation and derivation of the corresponding keys (FMK, HHK, CEK) is described in section Key derivation, container.Envelope.prepare() and container.Envelope().

The list of all desired recipients must then be compiled and serialized, as the cryptographic methods used for ensuring the integrity of the container operate with an integral serialized header.

The requisite cryptographic procedures described in sections Descriptions of header elements and KEK computation and FMK encryption and decryption must be executed for each recipient.

The HMAC is then computed as per section Header authentication code.

Payload encryption is described in section Payload assembly and encryption, crypto.ChaChaCipher.encryptPayload() and crypto.ChaChaCipher.initChaChaOutputStream().

The detailed serialized envelope format is presented in section Envelope.

At the end of the encryption process, the employed cryptographic materials (symmetric keys, ephemeral private keys) must be securely deleted. Secure deletion significantly depends on the operational environment; in some cases (e.g. in JVM) it might not be possible. The developer must evaluate what options for secure deletion are provided by the employed programming language and operational environment.

5.2.6 Format parsing procedure¶

This section makes reference to the reference implementation source code, using Java package names and identifiers. References to source code are styled as monotype.

The container parsing reference implementation is found in the function container.Envelope.decrypt(). This functions serves as the primary entry point into the decryption logic and its main purpose is to take the encrypted container provided as input and write all the files contained therein to the selected folder.

The function does the following and all its alternative implementations must do the same while employing all relevant security checks.

Parsing the envelope and decoding the envelope header.
Decrypting/deriving the KEK.
Decrypting/deriving the container-specific keys, i.e. FMK, HHK, and CEK.
Checking the HMAC.
Decrypting the archive.
Extracting files from the encrypted archive and writes the files to the selected folder. In stream processing mode, decryption and extraction form a single operation.

The envelope parsing and header decryption reference implementation is found in the function container.Envelope.readFBSHeader().

The header must be parsed using the FlatBuffers library, utilizing the root type fbs.header.Header (in the reference implementation, this is implemented as the function fbs.header.Header.getRootAsHeader() generated from the FlatBuffers schema).

Parsing of the complete header is implemented as the reference implementation function container.Envelope.deserializeFBSHeader().

A recipient (Recipient) corresponding to the party processing the container must be found in the header and the KEK, FMK, and HHK must be derived or decrypted.

Recipient identification methods corresponding to each encryption method are described in section Descriptions of header elements and KEK computation. In case no recipient corresponding to the processing party is not found, the container cannot be decrypted. In this case, the algorithm must return a “container not meant for opening by the processor” error and terminate.

KEK computation is described in section Descriptions of header elements and KEK computation. Should an error occur during KEK computation (e.g. the point is not located on the ellipse curve), the algorithm must return an error and terminate. KEK computation functions are found in the class crypto.KekTools.

FMK decryption is described in section FMK encryption and decryption, crypto.Crypto.xor().

HHK derivation procedure is described in section Key derivation, crypto.Crypto.deriveHeaderHmacKey().

The HHK and the original serialized form of the header must be used to check the HMAC using container.Envelope.checkHmac().

After a successful message authentication code check, the payload may be decrypted. In case the HMAC check was unsuccessful, the algorithm must return an error and terminate.

Decryption requires deriving the CEK. The corresponding procedure is described in section Key derivation, Crypto.deriveContentEncryptionKey().

Payload decryption is carried out in three stages: decryption, cryptogram authentication, and unpacking the decrypted archive.

Decryption and cryptogram authentication are described in section Payload assembly and encryption.

Archive unpacking is described in section Requirements for payload unpacking.

5.3 Unencrypted payload¶

This section provides a more detailed description of the format and processing of the unencrypted payload.

Main features of the format:

The transmitted files are archived using the POSIX tar format.
The archived files are packed using ZLIB, standardized in IETF RFC 1950.

The container payload plaintext is formed as follows: the transmitted files (or file) are added to a POSIX tar archive which is then packed into the ZLIB format as a single bloc.

Implementation note: the DD4 client uses the relevant Qt wrapper functions to call the zlib library. Since these functions cannot be used in streaming mode, the specification recommends replacing the use of Qt wrappers with streaming mode zlib calls. This becomes especially crucial in storage cryptography where data volumes may be very high and encryption of the data in memory buffers in one piece may be unfeasible.

5.3.1 Requirements for POSIX tar archive assembly¶

Given the long history and large number of variations of the tar format, this subsection presents an overview of the requirements for archives created for CDOC2. The purpose of these requirements is to reduce compatibility issues between different client applications and/or operating systems and facilitate the save extraction of the files from the archive to the file system.

Standardized POSIX tar dialect is used. This format is also known as ‘POSIX 1003.1-2001’ or ‘PAX’.
All file names are UTF-8 encoded.
100B filenames supported by PAX extended header.
8 GiB files supported by PAX extended header.
Filenames are added to the archive without basenames.
Permission bits and other security attributes added to the archive are ignored (can be written but not read).
Only normal files (type 0) are added to the archive.
Files are processed as binary files.

5.3.2 Requirements for payload unpacking¶

The payload format is chosen to enable unpacking in streaming mode. This means the encrypted payload does not have to be loaded to memory in one piece. The payload can be decrypted, unpacked, and files written to the disk in plaintext sequentially.

When data is processed in streaming mode, the decrypted data will be used before encryption checksum verification. Unpacking must be done with account of the possibility that the payload could be faulty and not meet the rules set out in the specification or might have even been maliciously assembled by an attacker. As the sender of a CDOC2 container is unauthenticated, the possibility of the payload having been assembled by an attacker must always be accounted for, even if the encryption checksums match.

When processing data in streaming mode, errors encountered in processing the plaintext (packing or archival errors) cannot be handled before the entire payload has been processed and the cryptogram authenticated. If cryptogram authentication fails, this must be reported as an error. Errors encountered in plaintext processing can only be reported if cryptogram authentication was successful. In case of an error, all created files must be deleted.

Below, we have described two types of attacks that software based on this specification must be able to deploy countermeasures against.
The list of potential attacks is inconclusive. Thus, any file might contain a virus or malware and needs to be checked by antivirus software before use, but this type of attack is not specific to CDOC2 but is equally valid for the use of files received from any untrusted source and is hence not covered here in more detail.

Attack 1: The attacker may create a compressed payload that will unpack into a massive file. This may cause the application to crash when the recipient processes this payload in memory. It can cause disk space to run out when written to disk. The pragmatic solution is to set a maximum size limit for unpacked files and continuously monitor free memory or free disk space during unpacking. If the files being unpacked are larger than permitted or free memory or free disk space has decreased below the permitted limit, unpacking must be aborted, files written to the disk in the process deleted, and the error reported.

Attack 2: The attacker may manipulate the attributes of the files in the tar archive – file names, permission bits, security attributes and types. In case such a tar file is unpacked without additional checks, the attacker may be able to overwrite existing system files, add new files, create files invisible to normal users but necessary for certain attacks, etc.

Since the CDOC2 container is not meant to serve as a universal archive format but simply provide a means for the simultaneous encryption of multiple files while retaining original file names for the user’s convenience, a number of rules have been set out for the unpacking of tar files which will ensure protection from the forms of manipulation described above if enforced:

File creation must ignore permission bits, file owner and group identifiers and other security attributes found in the archive – all files must be created non-executable, owned by the user running the application, and readable and writable by this user.
Only normal files (type 0) must be created. If the archive contains a file of some other type, abort unpacking, delete files written to the disk before this point, and return an error message. A correctly implemented CDOC2 client application should not create files containing files of other types.
Validate file name safety before writing a file to the disk. If a file name containing unpermitted symbols is found, abort unpacking, delete files written to the disk before this point, and return an error message.

File name safety verification serves the following purposes:

Prevention of path traversal attacks and creation of files outside the folder selected by the user.
Prevention of the creation of files with names containing special symbols, inaccessible or difficult to access for the user.

Different operating systems have different requirements for file names. The pragmatic solution is to use a tried and tested method for validating file names received from untrusted sources. Using multiple validation mechanisms is advantageous.

Pathvalidate is a comprehensive Python library for file name validation – similar checks must also be used in other programming languages.

The SEI CERT coding standard describes an additional method for preventing path traversal.

List of requirements for file names used in the reference implementation container.FileNameValidator:

Cannot begin with a space or hyphen.
Cannot end with a space or period.
Cannot be any of the following: CON, PRN, AUX, NUL, COM[1-9], LPT[1-9].
Cannot contain any of the following symbols: <, >, :, \, /, |, ?, *.
Cannot contain control characters.
Cannot contain the Unicode character Right-To-Left Override (U+202E).