Misplaced Pages

Rec. 709

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
(Redirected from Rec.709) Standard for HDTV image encoding and signal characteristics

Rec. 709
record. Code 709
BT.709 primaries shown on the CIE 1931 x, y chromaticity diagram. All chromaticities of the BT.709 color gamut fall within the triangle that connects the primaries. This includes Illuminant D65, the white point.
StatusApproved
First publishedNovember 16, 1993; 31 years ago (1993-11-16)
Latest versionBT.709-6
June 17, 2015; 9 years ago (2015-06-17)
AuthorsITU-R
Base standardsRec.709, BT.709, ITU—709
DomainDigital image processing
Websitewww.itu.int/rec/R-REC-BT.709

ITU-R Recommendation 709, usually abbreviated Rec. 709, BT.709, or ITU-R 709, is a standard developed by the Radiocommunication Sector of the International Telecommunication Union (ITU-R) for image encoding and signal characteristics of high-definition television (HDTV). The standard specifies a scheme for digital encoding of colors as triplets of small integers, a widescreen format with 1080 active lines per picture and 1920 square pixels per line (a 16:9 aspect ratio), as well as several details of signal capture, transmission, and display. While directed to HDTV, some of its specifications (such as the color encoding) have also been adopted for other uses.

Technical details

The standard is freely available at the ITU website, and that document should be used as the authoritative reference. The essentials are summarized below.

Image format and definition

Recommendation ITU-R BT.709-6 defines a common image format (CIF) where picture characteristics are independent of the frame rate. The image is 1920x1080 pixels, for a total pixel count of 2,073,600 and a 16:9 aspect ratio.

Frame rates

BT.709-6 specifies the following possible frame rates and pixel scanning order. The options for the latter are progressively scanned frame (P), progressive segmented frames (PsF), and interlaced (I)

24/P, 24/PsF, 23.976/P, 23.976/PsF
These combinations match the frame rate used for theatrical motion pictures. The fractional rates are included for compatibility with the "pull-down" rates used with NTSC.
50/P, 25/P, 25/PsF, 50/I (25 fps)
These combinations are provided for compatibility with earlier "50 Hz" TV standards, such as PAL or SECAM. There are no fractional rates as PAL and SECAM did not have the pull-down issue of NTSC.
60/P, 59.94/P, 30/P, 30/PsF, 29.97/P, 29.97/PsF, 60/I (30 fps), 59.94/I (29.97 fps)
These combinations offer compatibility with earlier "60 Hz" TV standards, as NTSC. Here again, the fractional rates are for compatibility with legacy NTSC pull-down rates.

Cameras and monitors may use any of these modes. Video captured in progressive mode can be recorded, broadcast, or streamed in progressive or progressive segmented frame modes. Video captured using an interlaced mode must be distributed as interlace unless a de-interlace process is applied in post production.

In cases where a progressive captured image is distributed in segmented frame mode, segment/field frequency must be twice the frame rate. Thus 30/PsF has the same field rate as 60/I.

The RGB color space

BT.709 RGB color cube (image encoded with an ICC profile)

Colors in the BT.709 standard are basically described according to the RGB color model, namely as mixtures of three primaries, "red" (R), "green" (G) and "blue" (B). For BT.709, their coordinates in the CIE 1931 chromaticity diagram are

ITU-R BT.709 color primaries
White point Primaries
xW yW xR yR xG yG xB yB
0.3127 0.3290 0.64 0.33 0.30 0.60 0.15 0.06

In the BT.709 standard, a color value is conceptually represented by three numbers ( E R , E G , E B ) {\displaystyle (E_{R},E_{G},E_{B})} between 0 and 1, where 0 means the absence of the corresponding primary color and 1 means the maximum intensity that the device can represent. If these numbers are interpreted as Cartesian coordinates in a three-dimensional space, the representable colors correspond to points in an axis-aligned cube of side 1, with corner ( 0 , 0 , 0 ) {\displaystyle (0,0,0)} representing the color black and ( 1 , 1 , 1 ) {\displaystyle (1,1,1)} representing the maximum-brightness white. More generally, points along the cube's diagonal represent shades of grey. The white point coordinates above define this white color as being CIE illuminant D65 for 2° standard observing conditions.

Non-linear encoding

The coordinates ( E R , E G , E B ) {\displaystyle (E_{R},E_{G},E_{B})} are supposed to be proportional to the physical intensity of each primary, namely emitted or received light power per unit of area. For efficiency reasons, the standard specifies a non-linear transformation of each component signal, resulting in ( E R , E G , E B ) {\displaystyle (E'_{R},E'_{G},E'_{B})} . This optical electrical transfer transfer function, is defined as

V = { 4.500 L if  L < 0.018 1.099 L 0.45 0.099 if  L 0.018 {\displaystyle V={\begin{cases}4.500\,L&{\mbox{if }}L<0.018\\1.099\,L^{0.45}-0.099&{\mbox{if }}L\geq 0.018\end{cases}}}

where L {\displaystyle L} is the linear coordinate ( E R {\displaystyle E_{R}} , E G {\displaystyle E_{G}} , or E B {\displaystyle E_{B}} ), and V {\displaystyle V} is the corresponding non-linear value ( E R {\displaystyle E'_{R}} , E G {\displaystyle E'_{G}} , or E B {\displaystyle E'_{B}} ), both in the range [ 0 , 1 ] {\displaystyle \left} .

Non-linear decoding

In order to display the colors on a device, such as a HDTV monitor, the encoded values ( E R , E G , E B ) {\displaystyle (E'_{R},E'_{G},E'_{B})} should be converted back to physical intensities of the primaries. Mathematically, the inverse of the non-linear encoding above would be

L = { V / 4.5 if  V < 0.081 ( V + 0.099 1.099 ) 1 / 0.45 if  V 0.081 {\displaystyle L={\begin{cases}V/4.5&{\mbox{if }}V<0.081\\\left({\dfrac {V+0.099}{1.099}}\right)^{1/0.45}&{\mbox{if }}V\geq 0.081\end{cases}}}

However, the BT.709 standard does not specify this conversion (sometimes referred as the "display gamma"). In practice, it depends on various factors such as the capabilities of the monitor, the viewing conditions, and desired visual effects (such as contrast or saturation stretching). The standard response for HDTV monitors is covered in standards ITU-R BT.1886 and EBU Tech 3320.

The Y'C'BC'R color space

The BT.709 standard also defines an alternative representation of colors by three coordinates ( E Y , E C B , E C R ) {\displaystyle (E'_{Y},E'_{CB},E'_{CR})} which are linear combinations of the (non-linear) RGB coordinates ( E R , E G , E B ) {\displaystyle (E'_{R},E'_{G},E'_{B})} . Namely,

E Y = 0.2126 E R + 0.7152 E G + 0.0722 E B {\displaystyle E'_{Y}=0.2126\,E'_{R}+0.7152\,E'_{G}+0.0722\,E'_{B}}
E C B = E B E Y 1.8556 = 1 1.8556 ( 0.2126 E R 0.7152 E G + 0.9278 E B ) {\displaystyle E'_{CB}={\frac {E'_{B}-E'_{Y}}{1.8556}}={\frac {1}{1.8556}}(-0.2126\,E'_{R}-0.7152\,E'_{G}+0.9278\,E'_{B})}
E C R = E R E Y 1.5748 = 1 1.5748 ( + 0.7874 E R 0.7152 E G 0.0722 E B ) {\displaystyle E'_{CR}={\frac {E'_{R}-E'_{Y}}{1.5748}}={\frac {1}{1.5748}}(+0.7874\,E'_{R}-0.7152\,E'_{G}-0.0722\,E'_{B})}

The value E Y {\displaystyle E'_{Y}} is called "luminance" in the standard, and is roughly an approximation of the CIE Y coordinate (which is presumed to measure the perceptual brightness of the color) modified by the non-linear function above. However, since E Y {\displaystyle E'_{Y}} is computed from the non-linear RGB components, this equivalence is correct only for shades of gray. The other two coordinates indicate the "blueness" and "redness" of the color's hue.

According to these formulas, as E R {\displaystyle E'_{R}} , E G {\displaystyle E'_{G}} , and E B {\displaystyle E'_{B}} vary between 0 and 1, the luminance E Y {\displaystyle E'_{Y}} will vary between 0 and 1, while E C B {\displaystyle E'_{CB}} and E C R {\displaystyle E'_{CR}} will vary between 0.5 {\displaystyle -0.5} and + 0.5 {\displaystyle +0.5}

Quantization

For digital storage, transmission, and processing, the BT.709 standard specifies that the non-linear color coordinates E R {\displaystyle E'_{R}} , E G {\displaystyle E'_{G}} , E B {\displaystyle E'_{B}} , E Y {\displaystyle E'_{Y}} , E C B {\displaystyle E'_{CB}} , and E C R {\displaystyle E'_{CR}} shall be converted into integers D R {\displaystyle D'_{R}} , D G {\displaystyle D'_{G}} , D B {\displaystyle D'_{B}} , D Y {\displaystyle D'_{Y}} , D C B {\displaystyle D'_{CB}} , and D C R {\displaystyle D'_{CR}} with a fixed number n {\displaystyle n} of bits, either 8 or 10. This quantization shall be performed by simple scaling and rounding, so as to yield integers that span a proper subset of the n {\displaystyle n} -bit integers. Specifically,

D R = round ( ( 219 E R + 16 ) 2 n 8 ) {\displaystyle D'_{R}={\mbox{round}}((219\,E'_{R}+16)\,2^{n-8})}

and similarly for D G {\displaystyle D'_{G}} , D B {\displaystyle D'_{B}} , D Y {\displaystyle D'_{Y}} ; whereas

D C B = round ( ( 224 E C B + 128 ) 2 n 8 ) {\displaystyle D'_{CB}={\mbox{round}}((224\,E'_{CB}+128)\,2^{n-8})}

and similarly for D C R {\displaystyle D'_{CR}} . The round {\displaystyle {\mbox{round}}} function should round the argument to the nearest integer, with ties rounded up (that is, round ( 3.4999 ) = 3 {\displaystyle {\mbox{round}}(3.4999)=3} and round ( 3.5000 ) = 4 {\displaystyle {\mbox{round}}(3.5000)=4} .

These quantization formulas are the same as those defined in ITU-R BT.601. As implied by these formulas, the signals E R {\displaystyle E'_{R}} , E G {\displaystyle E'_{G}} , E B {\displaystyle E'_{B}} , and E Y {\displaystyle E'_{Y}} are mapped from the range [ 0 , 1 ] {\displaystyle } to 8-bit integers in ; while E C B {\displaystyle E'_{CB}} and E C R {\displaystyle E'_{CR}} are mapped from the range [ 0.5 , + 0.5 ] {\displaystyle } to integers in , with 0 mapped to 128. For n = 10 {\displaystyle n=10} bits, the quantized values range in and , respectively.

It follows that in 8-bit R'G'B' the color black is represented as (16,16,16) while white is (235, 235, 235). In 8-bit Y'C'BC'R, black is (16, 128, 128) and white is (235, 128, 128).

Quantized color coordinates outside the nominal ranges above are allowed, but typically they would be clamped for broadcast or for display (except for Superwhite and xvYCC). However, the 8-bit values 0 and 255 and the 10 bit values 0..3 and 1020..1023 are reserved for timing marks (SAV and EAV) and may not appear in color data.

History

The creation of a worldwide HDTV standard was approved in 1989 by the Comité consultatif international pour la radio (CCIR) as "Recommendation XA/11 MOD F". The first official version of the standard was approved in 1990 by the CCIR, under the name "Recommendation 709". The CCIR became the ITU-R in 1992, and released a new version of the standard (BT.709-1) in November 1993. These early versions still left many unanswered questions, and the lack of consensus toward a worldwide HDTV standard was evident. So much so, some early HDTV systems such as 1035i30 and 1152i25 were still a part of the standard as late as 2002 in BT.709-5.

The most recent version is BT.709-6 released in 2015.

The standard strictly determined the picture size but offered several options for the pixel scanning order and frame rate. This flexibility allows BT.709 to become the worldwide standard for HDTV. This allows manufacturers to create a single television set or display for all markets world-wide.

Justification for the non-linear encoding

The BT.709 standard calls the non-linear encoding of ( E R , E G , E B ) {\displaystyle (E_{R},E_{G},E_{B})} to ( E R , E G , E B ) {\displaystyle (E'_{R},E'_{G},E'_{B})} the optical electrical transfer function because it was meant to resemble the conversion of light intensity into analog electrical signals implemented by older non-digital cameras. It had long been known that a non-linear encoding of colors was more efficient than a linear one because human vision is more sensitive to brightness changes at low light levels. That conversion was commonly described as a power law V = L γ {\displaystyle V=L^{\gamma }} with exponent γ {\displaystyle \gamma } near 0.5 (hence the common names "gamma correction" or "camera gamma" for the encoding function). Indeed, the BT.709 encoding function is close to a power law with exponent near 1/2.35.

The BT.709 encoding function is not a simple power law because the latter has infinite slope at the origin, which emphasizes camera noise and is problematic for analog-to-digital converters. Thus the standard opted for a piecewise function that combines a simple linear function V = a L {\displaystyle V=a\,L} for low light levels and a shifted power law ( ( L + c ) / ( 1 + c ) ) γ {\displaystyle ((L+c)/(1+c))^{\gamma }} for larger values. Having chosen 0.45 as the exponent γ {\displaystyle \gamma } and 4.5 as the slope of the linear part, the conditions for the function to be continuous (without sudden jumps) and smooth (without sudden changes of slope) at the break point L = L 0 {\displaystyle L=L_{0}} are

{ 4.5 L 0 = ( 1 + c ) L 0 0.45 c 4.5 = 0.45 ( 1 + c ) L 0 0.55 {\displaystyle {\begin{cases}4.5\,L_{0}=(1+c)\,L_{0}^{0.45}-c\\4.5=0.45(1+c)\,L_{0}^{-0.55}\end{cases}}}

The solution of these equations is c 1.099296826809442... {\displaystyle c\approx 1.099296826809442...} and L 0 0.018053968510807... {\displaystyle L_{0}\approx 0.018053968510807...} These values were rounded to 0.099 and 0.018, respectively.

Standards conversion

Conversion between different standards of video frame rates and color encoding has always been a challenge for content producers distributing through regions with different standards and requirements. While BT.709 has eased the compatibility issue in terms of the consumer and television set manufacturer, broadcast facilities still use a particular frame rate based on region, such as 29.97 in North America, or 25 in Europe meaning that broadcast content still requires at least frame rate conversion.

Color gamuts

The BT.709 red and blue primaries are the same as the EBU Tech 3213 (PAL) primaries. The yG coordinate too is the same, while xG is halfway between EBU Tech 3213's xG and SMPTE C's xG.

The resulting BT.709 color space is almost identical to that of the BT.601-6 used by PAL and NTSC, and covers 35.9% of it. It also covers 33.24% of the CIE 1976 u’v’ space and 33.5% of the CIE 1931 x y diagram.

Converting standard definition

The vast legacy library of standard-definition programs and content presents further challenges. NTSC, PAL, and SECAM are all interlaced formats in a 4:3 aspect ratio, and at a relatively low resolution. Scaling them up to HD resolution with a 16:9 aspect ratio presents a number of challenges.

First is the potential for distracting motion artifacts due to interlaced video content. The solution is to either up-convert only to an interlaced BT.709 format at the same field rate, and scale the fields independently, or use motion processing to remove the inter-field motion and deinterlace, creating progressive frames. In the latter case, motion processing can introduce artifacts and can be slow to process.

Second is the issue of accommodating the SD 4:3 aspect ratio into the HD 16:9 frame. Cropping the top and/or bottom of the standard-definition frame may or may not work, depending on if the composition allows it and if there are graphics or titles that would be cut off. Alternately, pillar-boxing can show the entire 4:3 image by leaving black borders on the left and right. Sometimes this black is filled with a stretched and blurred form of the image.

In addition, the SMPTE C RGB primaries used in North American standard definition are different than those of BT.709 (SMPTE C is commonly referred to as NTSC, however it is a different set of primaries and a different white point than the 1953 NTSC.). The red and blue primaries for PAL and SECAM are the same as BT.709, with a change in the green primary. Converting the image precisely requires a LUT (lookup table) or a color managed workflow to convert the colors to the new colorspace. However, in practice this is often ignored, except in mpv, because even if the player is color managed (most of them are not, including VLC), it can see BT.709 or BT.2020 primaries only.

Luma coefficients

When encoding Y’CBCR video, BT.709 creates gamma-encoded luma (Y’) using matrix coefficients 0.2126, 0.7152, and 0.0722 (together they add to 1). BT.709-1 used slightly different 0.2125, 0.7154, 0.0721 (changed to standard ones in BT.709-2). Although worldwide agreement on a single R’G’B’ system was achieved with Rec. 709, adoption of different luma coefficients (as those are derived from primaries and white point) for Y’CBCR requires the use of different luma-chroma decoding for standard definition and high definition.

Conversion software and hardware

These problems can be handled with video processing software which can be slow, or hardware solutions which allow for realtime conversion, and often with quality improvements.

Film retransfer

A more ideal solution is to go back to original film elements for projects that originated on film. Due to the legacy issues of international distribution, many television programs that shot on film used a traditional negative cutting process, and then had a single film master that could be telecined for different formats. These projects can re-telecine their cut negative masters to a BT.709 master at a reasonable cost, and gain the benefit of the full resolution of film.

On the other hand, for projects that originated on film, but completed their online master using video online methods would need to re-telecine the individual needed film takes and then re-assemble, a significantly greater amount of labor and machine time is required in this case, versus a telecine for a conformed negative. In this case, to enjoy the benefits of the film original would entail much higher costs to conform the film originals to a new HD master.

Comparison to sRGB

sRGB was created after the early development of Rec.709. The creators of sRGB chose to use the same primaries and white point as Rec.709, but changed the tone response curve (sometimes referred to as gamma) to better suit the intended use in offices and brighter conditions than television viewing in a dark living room.

Rec. 709 and sRGB share the same primary chromaticities and white point chromaticity; however, sRGB is explicitly output (display) referred with an equivalent gamma of 2.2 (the actual function is also piecewise to avoid near black issues). Display P3 uses sRGB EOTF with its linear segment, a change of that segment from 709 is needed by either using parametric curve encoding of ICC v4 or by using slope limit.

See also

References

  1. ^ "BT.709 : Parameter values for the HDTV standards for production and international programme exchange". www.itu.net. n.d. Archived from the original on October 27, 2022. Retrieved April 19, 2021.
  2. ^ "BT.709: Basic parameter values for the HDTV standard for the studio and for international programme exchange". www.itu.net. Rec. 709-01. November 16, 1993. Archived from the original on December 26, 2022. Retrieved April 19, 2021.
  3. ^ "BT.709 : Parameter values for the HDTV standards for production and international programme exchange". www.itu.net. Rec. 709-06. June 7, 2015. Archived from the original on May 25, 2022. Retrieved April 19, 2021.
  4. Jeff Poskanzer (): PBM/PGM/PPM image format
  5. "Rec.709-6" (PDF). ITU-R. Archived (PDF) from the original on November 26, 2022. Retrieved December 10, 2020.
  6. ^ "ITU-R BT.709-5". ITU-R. Archived from the original on May 9, 2021. Retrieved December 10, 2020.
  7. ^ EBU Tech 3320 Version 4.1 USER REQUIREMENTS FOR VIDEO MONITORS IN TELEVISION PRODUCTION, p. 11,
  8. "BT.1886:10 Questions, 10 Answers" (PDF). Archived (PDF) from the original on February 13, 2021.
  9. "ChromaPure Video Calibration Software". www.chromapure.com. Archived from the original on July 24, 2021. Retrieved March 28, 2021.
  10. "ITU-R BT.2035 A reference viewing environment for evaluation of HDTV program material or completed programmes". Archived from the original on October 29, 2021. Retrieved February 7, 2021.
  11. ITU-R BT.1886 Reference electro-optical transfer function for flat panel displays used in HDTV studio production Archived February 12, 2021, at the Wayback Machine
  12. ITU-R Rec. BT.601-5, 1995
  13. "SpectraCal • View topic - YCbCr v RGB, Which color space is bible?". September 12, 2015. Archived from the original on September 12, 2015. Retrieved April 28, 2021.
  14. "BT.2020: Parameter values for ultra-high definition television systems for production and international programme exchange". www.itu.int. Archived from the original on January 21, 2021. Retrieved January 16, 2021.
  15. "Conclusions of the extraordinary meeting of study group 11 on high-definition television" (PDF). 1989. Archived (PDF) from the original on September 30, 2022. Retrieved March 22, 2021.
  16. "Gamma 2.2 vs Gamma 2.4 - How, Why and When (in DaVinci Resolve)?". Mixing Light. Archived from the original on May 26, 2022. Retrieved April 29, 2022.
  17. Roberts, A. BBC RD 1991/6 Methods of Measuring and Calculating Display Transfer Characteristics (PDF) (Report). BBC. p. 1. Archived (PDF) from the original on July 5, 2024. Retrieved September 3, 2024.
  18. "H.273: Coding-independent code points for video signal type identification". www.itu.int. Archived from the original on February 4, 2018. Retrieved January 16, 2021.
  19. ""Super Hi-Vision" as Next-Generation Television and Its Video Parameters". Information Display. Archived from the original on January 12, 2013. Retrieved January 1, 2013.
  20. Xu Yan; Li Yan; Li Guiling (May 2009). "A kind of nonlinear quantization method to extend the color gamut of DTV system". 2009 IEEE 13th International Symposium on Consumer Electronics. pp. 141–143. doi:10.1109/ISCE.2009.5156953. ISBN 978-1-4244-2975-2. S2CID 5922384. Archived from the original on February 12, 2021. Retrieved February 5, 2021.
  21. ^ "The Pointer's Gamut - The coverage of real surface colors by RGB color spaces and wide gamut - TFT Central". www.tftcentral.co.uk. February 19, 2014. Archived from the original on May 6, 2021. Retrieved February 5, 2021.
  22. Pascale, Danny. A Review of RGB Color Spaces (PDF). BableColor. Archived (PDF) from the original on October 25, 2021. Retrieved October 19, 2021.
  23. "Open source LUT based color management". Open Color IO. OCIO. Archived from the original on October 27, 2021. Retrieved October 19, 2021.
  24. RP 177:1993 - SMPTE Recommended Practice - Derivation of Basic Television Color Equations. 1993. pp. 1–4. doi:10.5594/SMPTE.RP177.1993. ISBN 978-1-61482-191-5. Archived from the original on June 15, 2018. Retrieved April 10, 2021.
  25. "Luminance, luma, and the migration to DTV". poynton.ca. February 6, 1998. Archived from the original on April 17, 2021. Retrieved May 7, 2021.
  26. "Blackmagic Teranex Converter". Blackmagic Designs. Archived from the original on December 9, 2020. Retrieved December 10, 2020.
  27. Poynton, Charles (2012). Digital Video and HD Algorithms and Interfaces. Burlington, Mass.: Elsevire/Morgan Kaufmann. p. 321. ISBN 978-0-12-391926-7.

External links

Color space
CAM
CIE
RGB
Y′UV
Other
Color systems
and standards
For the vision capacities of organisms or machines, see  Color vision.
Categories: