Digital Cinema Package
A Digital Cinema Package (DCP) is a collection of digital files used to store and convey digital cinema (DC) audio, image, and data streams.
The term was popularized by Digital Cinema Initiatives, LLC in its original recommendation[1] for packaging DC contents. However, the industry tends to apply the term to the structure more formally known as the composition.[2]) A DCP is a container format for compositions,[3] a hierarchical file structure that represents a title version. The DCP may carry a partial composition (e.g. not a complete set of files), a single complete composition, or multiple and complete compositions.[3]
The composition consists of a Composition Playlist (in XML format) that defines the playback sequence of a set of Track Files. Track Files carry the essence, which is wrapped using Material eXchange Format (MXF). Two track files at a minimum must be present in every composition (see SMPTE ST429-2 D-Cinema Packaging – DCP Constraints, or Cinepedia[4]): a track file carrying picture essence, and a track file carrying audio essence. The composition, consisting of a Composition Playlist (CPL) and associated track files, are distributed as a Digital Cinema Package (DCP). A composition is a complete representation of a title version, while the DCP need not carry a full composition. However, as already noted, it is commonplace in the industry to discuss the title in terms of a DCP, as that is the deliverable to the cinema.
The Picture Track File essence is compressed using JPEG 2000 and the Audio Track File carries a 24-bit linear PCM uncompressed multichannel WAV file. Encryption may optionally be applied to the essence of a track file to protect it from unauthorized use. The encryption used is AES 128-bit in CBC mode.
In practice, there are two versions of composition in use. The original version is called Interop DCP.[5] In 2009, a specification was published by SMPTE (SMPTE ST 429-2 Digital Cinema Packaging – DCP Constraints) for what is commonly referred to as SMPTE DCP. SMPTE DCP is similar but not backwards compatible with Interop DCP, resulting in an uphill effort to transition the industry from Interop DCP to SMPTE DCP.[6] SMPTE DCP requires significant constraints to ensure success in the field, as shown by ISDCF.[7] While legacy support for Interop DCP is necessary for commercial products, new productions are encouraged to be distributed in SMPTE DCP.
3D DCP[edit]
The DCP format is also used to store stereoscopic (3D) contents for 3D films. In this case, 48 frames exist for every second – 24 frames for the left eye, 24 frames for the right.
Depending on the projection system used, the left eye and right eye pictures are either shown alternately (double or triple flash systems) at 48 fps or, on 4k systems, both left and right eye pictures are shown simultaneously, one above the other, at 24 fps. In triple flash systems, active shutter glasses are required whereas optical filtering such as circular polarisation is used in conjunction with passive glasses on polarized systems.
Since the maximum bit rate is always 250 Mbit/s, this results in a net 125 Mbit/s for single frame, but the visual quality decrease is generally unnoticeable.
Creation[edit]
Most film producers and distributors rely on digital cinema encoding facilities to produce and quality control check a digital cinema package before release. Facilities follow strict guidelines set out in the DCI recommendations to ensure compatibility with all digital cinema equipment. For bigger studio release films, the facility will usually create a Digital Cinema Distribution Master (DCDM).
A DCDM is the post-production step prior to a DCP. The frames are in XYZ TIFF format and both sound and picture are not yet wrapped into MXF files. A DCP can be encoded directly from a DCDM. A DCDM is useful for archiving purposes and also facilities can share them for international re-versioning purposes. They can easily be turned into alternative version DCPs for foreign territories. For smaller release films, the facility will usually skip the creation of a DCDM and instead encode directly from the Digital Source Master (DSM) the original film supplied to the encoding facility. A DSM can be supplied in a multitude of formats and color spaces. For this reason, the encoding facility needs to have extensive knowledge in color space handling including, on occasion, the use of 3D LUTs to carefully match the look of the finished DCP to a celluloid film print. This can be a highly involved process in which the DCP and the film print are "butterflied" (shown side by side) in a highly calibrated cinema.
Less demanding DCPs are encoded from tape formats such as HDCAM SR. Quality control checks are always performed in calibrated cinemas and carefully checked for errors. QC checks are often attended by colorists, directors, sound mixers and other personnel to check for correct picture and sound reproduction in the finished DCP.
Accessibility[edit]
Hearing impaired audio[edit]
A Hearing Impaired (HI) audio track is designed for people who are hearing-impaired to better hear dialog.[11] Moviegoers can wear headphones which play this audio track synchronized with the film.[11] Hearing Impaired audio is stored in the DCP on Sound Track channel 7.[10]
Audio description[edit]
Audio description is narration for people who are blind or visually impaired. Audio description is stored in the DCP as "Visually Impaired-Native" (VI-N) audio on Sound Track channel 8.[11][10]
Sign Language Video[edit]
A Sign Language Video track can be included in a DCP to allow for display of sign language synchronized with the film.[12] Sign Language Video tracks are displayed to moviegoers in portrait orientation on a second screen device.[11] In September 2017, new Libras accessibility requirements took effect in Brazil mandating availability of Brazilian Sign Language for films shown in Brazilian movie theaters.
Sign Language Video tracks have no audible audio and are encoded in VP9 format with a maximum bit rate of 1 Mbps, in 480x640px resolution, and with a frame rate of 24 frames per second (even if the main film is a different frame rate).[13]
VP9 video is stored in Sound Track channel 15, identified by an MCA (Multichannel Audio) Tag Name of "Sign Language Video Stream".[9][10] VP9-compressed video is stored in an uncompressed PCM audio channel with a 48 kHz sample rate and a 24-bit bitrate, occupying a fixed bandwidth of 1.152 Mbps. Since VP9 uses a variable bitrate, video is stored in evenly-distributed 2-second chunks, decoded by the media block.[13] This method retains random access playback ability ("trick play") and is compatible with all existing digital cinema projection systems.[13] Sign Language Video tracks are unencrypted and not watermarked.[10]
The InterSociety Digital Cinema Forum (ISDCF) released an open-source encoder and decoder for Sign Language Video on GitHub.[14]
Watermarking[edit]
Forensic Marking (FM) refers to tracking information embedded via digital watermarking to the image ("Image Forensic Marking") and audio ("Audio Forensic Mark") channels via an embedder in the Media Block.[15] Watermarking does not stop unauthorized recordings or their distribution, though it may deter unauthorized copying by those aware of the watermarking process.[15] Watermarks are designed to be detectable in any copies, including unauthorized recordings (such as cams).[15] Information obtained from watermarks in unauthorized copies, along with logs generated by Media Blocks, can be examined as part of an investigation to identify the source of the recordings.[15]
Up to 30 minutes of a recording may be required to positively identify the watermark.[15] The DCI specification does not mandate a specific watermarking technology, and there are multiple such systems available.[15] A watermark is generated in real time (or faster) and is inserted every 5 minutes.[15] The watermark includes a time stamp and a unique ID associated with the Secure Processing Block (SPB).[15] Audio and image watermarking is required to be transparent to humans as well as to survive any tampering or format conversions, including multiple conversions between analog and digital formats, pitch shifting, scaling, or cropping.[15]
Watermarking must be applied to all encrypted channels (audio or image) and must not be applied to unencrypted channels.[15] Watermarking can be disabled for all channels ("no FM mark") or specific audio tracks only ("selective audio FM mark") via setting the associated value in the ForensicMarkFlagList element of the KDM.[15] In the case of specific unwatermarked audio tracks, all tracks numbered above the specified track number will have FM disabled.[15] D-Box and Sign Language Video tracks must have FM disabled.[10]
Delivery methods[edit]
The most common method uses a specialist hard disk (most commonly the CRU DX115) designed specifically for digital cinema servers to ingest from. These hard drives were originally designed for military use but have since been adopted by digital cinema for their hard wearing and reliable characteristics. The hard disk drives are usually formatted with the Linux ext2 or ext3 file system as D-Cinema servers are typically Linux-based and are required to have read support for these file systems. Usually the inode size is set to 128 bytes to avoid compatibility issues with some servers. NTFS and FAT32 are also occasionally used. Hard drive units are normally hired from a digital cinema encoding company, sometimes in quantities of thousands. Drives are commonly shipped in protective hard cases. The drives are delivered via express courier to the exhibition site. Other methods adopt a full digital delivery, using either dedicated satellite links or high-speed Internet connections.[16]
For example, theaters received Cats before opening day, Friday 20 December 2019. When the film received poor reviews after its world premiere on 16 December, Universal notified theaters on 20 December that an updated version of the film with "some improved visual effects" would be available for download on 22 December and on hard drive by 24 December.[17]