Revision as of 21:11, 20 June 2023 edit103.162.244.21 (talk) →HistoryTag: Reverted← Previous edit | Latest revision as of 14:50, 19 November 2024 edit undoKvng (talk | contribs)Extended confirmed users, New page reviewers108,221 editsm unpiped links using script | ||
(25 intermediate revisions by 20 users not shown) | |||
Line 10: | Line 10: | ||
| extension = .L16, .WAV, .AIFF, .AU, .PCM<ref name="rfc2586">{{cite journal|first1=Harald Tveit |last1=Alvestrand |last2=Salsman |first2=James |url=http://tools.ietf.org/html/rfc2586 |title=RFC 2586 – The Audio/L16 MIME content type |date=May 1999 |publisher=The Internet Society |doi=10.17487/RFC2586 |access-date=2010-03-16}}</ref> | | extension = .L16, .WAV, .AIFF, .AU, .PCM<ref name="rfc2586">{{cite journal|first1=Harald Tveit |last1=Alvestrand |last2=Salsman |first2=James |url=http://tools.ietf.org/html/rfc2586 |title=RFC 2586 – The Audio/L16 MIME content type |date=May 1999 |publisher=The Internet Society |doi=10.17487/RFC2586 |access-date=2010-03-16}}</ref> | ||
| mime = audio/L16, audio/L8,<ref name="rfc4856">{{cite journal|first=S. |last=Casner |url=http://tools.ietf.org/html/rfc4856#page-17 |title=RFC 4856 – Media Type Registration of Payload Formats in the RTP Profile for Audio and Video Conferences – Registration of Media Type audio/L8 |date=March 2007 |publisher=The IETF Trust |doi=10.17487/RFC4856 |access-date=2010-03-16}}</ref> audio/L20, audio/L24<ref name="rfc3190">{{cite journal |last1=Bormann |first1=C. |last2=Casner |first2=S. |last3=Kobayashi |first3=K. |last4=Ogawa |first4=A. | | mime = audio/L16, audio/L8,<ref name="rfc4856">{{cite journal|first=S. |last=Casner |url=http://tools.ietf.org/html/rfc4856#page-17 |title=RFC 4856 – Media Type Registration of Payload Formats in the RTP Profile for Audio and Video Conferences – Registration of Media Type audio/L8 |date=March 2007 |publisher=The IETF Trust |doi=10.17487/RFC4856 |access-date=2010-03-16}}</ref> audio/L20, audio/L24<ref name="rfc3190">{{cite journal |last1=Bormann |first1=C. |last2=Casner |first2=S. |last3=Kobayashi |first3=K. |last4=Ogawa |first4=A. | ||
|url=http://tools.ietf.org/html/rfc3190 |title=RFC 3190 – RTP Payload Format for 12-bit DAT Audio and 20- and 24-bit Linear Sampled Audio |date=January 2002 |publisher=The Internet Society |doi=10.17487/RFC3190 |access-date=2010-03-16}}</ref><ref>{{cite web |url=https://www.iana.org/assignments/media-types/audio/ |title=Audio Media Types |publisher=Internet Assigned Numbers Authority |access-date=2010-03-16}}</ref> | |url=http://tools.ietf.org/html/rfc3190 |title=RFC 3190 – RTP Payload Format for 12-bit DAT Audio and 20- and 24-bit Linear Sampled Audio |date=January 2002 |publisher=The Internet Society |doi=10.17487/RFC3190 |access-date=2010-03-16|doi-access=free }}</ref><ref>{{cite web |url=https://www.iana.org/assignments/media-types/audio/ |title=Audio Media Types |publisher=Internet Assigned Numbers Authority |access-date=2010-03-16}}</ref> | ||
| type code = "AIFF" for L16,<ref name="rfc2586" /> none<ref name="rfc3190" /> | | type code = "AIFF" for L16,<ref name="rfc2586" /> none<ref name="rfc3190" /> | ||
| uniform type = | | uniform type = | ||
Line 20: | Line 20: | ||
| type = Uncompressed ] | | type = Uncompressed ] | ||
| container for = | | container for = | ||
| contained by = ], ], ], ], ], ], ], and many others | | contained by = ], ], ], ], ], ], ], and many others | ||
| extended from = | | extended from = | ||
| extended to = | | extended to = | ||
Line 30: | Line 30: | ||
{{Modulation techniques}} | {{Modulation techniques}} | ||
'''Pulse-code modulation''' ('''PCM''') is a method used to ] represent |
'''Pulse-code modulation''' ('''PCM''') is a method used to ] represent ]s. It is the standard form of ] in computers, ]s, ] and other digital audio applications. In a PCM ], the ] of the analog signal is ] at uniform intervals, and each sample is ] to the nearest value within a range of digital steps. ], ], ] and ] are credited with its invention.<ref>{{Cite book |last=Noll |first=A. Michael |url=https://books.google.com/books?id=rpkuAgAAQBAJ&dq=pulse+code+modulation+claude+shannon&pg=PA50 |title=Highway of Dreams: A Critical View Along the Information Superhighway |date=1997 |publisher=Erlbaum |isbn=978-0-8058-2557-2 |edition=Revised |series=Telecommunications |location=Mahwah, NJ |pages=50 |language=en}}</ref><ref>{{Cite web |last=Leibson |first=Steven |date=2021-09-07 |title=A Brief History of the Single-Chip DSP, Part I |url=https://www.eejournal.com/article/a-brief-history-of-the-single-chip-dsp-part-i/ |access-date=2024-09-19 |website=EEJournal |language=en-US}}</ref><ref>{{Cite book |last=Barrett |first=G. Douglas |url=https://books.google.com/books?id=r9-SEAAAQBAJ&dq=Audio+Engineering+claude+shannon&pg=PA102 |title=Experimenting the Human: Art, Music, and the Contemporary Posthuman |publisher=] |year=2023 |isbn=978-0-226-82340-9 |location=Chicago London |pages=102 |language=en}}</ref> | ||
'''Linear pulse-code modulation''' ('''LPCM''') is a specific type of PCM in which the quantization levels are linearly uniform.<ref name="LOC_LPCM" /> This is in contrast to PCM encodings in which quantization levels vary as a function of amplitude (as with the ] or the ]). Though ''PCM'' is a more general term, it is often used to describe data encoded as LPCM. | '''Linear pulse-code modulation''' ('''LPCM''') is a specific type of PCM in which the quantization levels are linearly uniform.<ref name="LOC_LPCM" /> This is in contrast to PCM encodings in which quantization levels vary as a function of amplitude (as with the ] or the ]). Though ''PCM'' is a more general term, it is often used to describe data encoded as LPCM. | ||
A PCM stream has two basic properties that determine the stream's fidelity to the original analog signal: the ], which is the number of times per second that samples are taken; and the ], which determines the number of possible digital values that can be used to represent each sample. | A PCM stream has two basic properties that determine the stream's fidelity to the original analog signal: the ], which is the number of times per second that samples are taken; and the ], which determines the number of possible digital values that can be used to represent each sample. | ||
==History== | ==History== | ||
Early electrical communications started to ] signals in order to ] samples from multiple ] sources and to convey them over a single telegraph cable. The American inventor ] conceived telegraph ] (TDM) as early as 1853. Electrical engineer W. M. Miner, in 1903, used an electro-mechanical ] for time-division multiplexing multiple telegraph signals; he also applied this technology to ]. He obtained intelligible speech from channels sampled at a rate above 3500–4300 Hz; lower rates proved unsatisfactory. | |||
In 1920, the ] used telegraph signaling of characters punched in paper tape to send samples of images ] to 5 levels.<ref name="digicamhistory">{{cite web |url=http://www.digicamhistory.com/1906_1920.html |title=The Bartlane Transmission System |publisher=DigicamHistory.com |access-date=7 January 2010| archive-url = https://web.archive.org/web/20100210053055/http://www.digicamhistory.com/1906_1920.html| archive-date=February 10, 2010}}</ref> In 1926, Paul M. Rainey of ] patented a ] |
In 1920, the ] used telegraph signaling of characters punched in paper tape to send samples of images ] to 5 levels.<ref name="digicamhistory">{{cite web |url=http://www.digicamhistory.com/1906_1920.html |title=The Bartlane Transmission System |publisher=DigicamHistory.com |access-date=7 January 2010| archive-url = https://web.archive.org/web/20100210053055/http://www.digicamhistory.com/1906_1920.html| archive-date=February 10, 2010}}</ref> In 1926, Paul M. Rainey of ] patented a ] that transmitted its signal using 5-bit PCM, encoded by an opto-mechanical ].<ref>U.S. patent number 1,608,527; also see p. 8, ''Data conversion handbook'', Walter Allan Kester, ed., Newnes, 2005, {{ISBN|0-7506-7841-0}}.</ref> The machine did not go into production.<ref name=Vardalas>{{citation |publisher=] |title=Pulse Code Modulation: It all Started 75 Years Ago with Alec Reeves |url= https://insight.ieeeusa.org/articles/your-engineering-heritage-pulse-code-modulation-it-all-started-75-years-ago-with-alec-reeves/ |date= June 2013 |author=John Vardalas}}</ref> | ||
British engineer ], unaware of previous work, conceived the use of PCM for voice communication in 1937 while working for ] in France. He described the theory and its advantages, but no practical application resulted. Reeves filed for a French patent in 1938, and his US patent was granted in 1943.<ref>{{cite patent |country=US |number=2272070}}</ref> By this time Reeves had started working at the ].<ref name=Vardalas/> | British engineer ], unaware of previous work, conceived the use of PCM for voice communication in 1937 while working for ] in France. He described the theory and its advantages, but no practical application resulted. Reeves filed for a French patent in 1938, and his US patent was granted in 1943.<ref>{{cite patent |country=US |number=2272070}}</ref> By this time Reeves had started working at the ].<ref name=Vardalas/> | ||
The first transmission of ] by digital techniques, the ] encryption equipment, conveyed high-level ] during ]. In 1943 the ] researchers who designed the SIGSALY system became aware of the use of PCM binary coding as already proposed by Reeves. In 1949, for the Canadian Navy's ] system, ] built a working PCM radio system that was able to transmit digitized radar data over long distances.<ref>{{cite book |author=Porter, Arthur |title=So Many Hills to Climb |date=2004 |publisher=Beckham Publications Group |isbn=9780931761188}}{{page needed|date=September 2017}}</ref> | The first transmission of ] by digital techniques, the ] encryption equipment, conveyed high-level ] during ]. In 1943 the ] researchers who designed the SIGSALY system became aware of the use of PCM binary coding as already proposed by Reeves. In 1949, for the Canadian Navy's ] system, ] built a working PCM radio system that was able to transmit digitized radar data over long distances.<ref>{{cite book |author=Porter, Arthur |title=So Many Hills to Climb |date=2004 |publisher=Beckham Publications Group |isbn=9780931761188}}{{page needed|date=September 2017}}</ref> | ||
PCM in the late 1940s and early 1950s used a ] ] with a ] having encoding perforations.<ref>{{cite book |url=https://archive.org/details/bstj27-1-44 |author=Sears, R. W. |work=Bell Systems Technical Journal |volume=27 |title=Electron Beam Deflection Tube for Pulse Code Modulation |pages=44–57 |publisher=] |date=January 1948 |access-date=14 May 2017}}</ref> As in an ], the beam was swept horizontally at the sample rate while the vertical deflection was controlled by the input analog signal, causing the beam to pass through higher or lower portions of the perforated plate. The plate collected or passed the beam, producing current variations in binary code, one bit at a time. Rather than natural binary, the grid of Goodall's later tube was perforated to produce a glitch-free ] and produced all bits simultaneously by using a fan beam instead of a scanning beam.<ref>{{cite book |url=https://archive.org/details/bstj30-1-33 |author=Goodall, W. M. |work=Bell Systems Technical Journal |volume=30 |title=Television by Pulse Code Modulation |pages=33–49 |publisher=] |date=January 1951 |access-date=14 May 2017}}</ref> | PCM in the late 1940s and early 1950s used a ] ] with a ] having encoding perforations.<ref>{{cite book |url=https://archive.org/details/bstj27-1-44 |author=Sears, R. W. |work=Bell Systems Technical Journal |volume=27 |title=Electron Beam Deflection Tube for Pulse Code Modulation |pages=44–57 |publisher=] |date=January 1948 |access-date=14 May 2017}}</ref> As in an ], the beam was swept horizontally at the sample rate while the vertical deflection was controlled by the input analog signal, causing the beam to pass through higher or lower portions of the perforated plate. The plate collected or passed the beam, producing current variations in binary code, one bit at a time. Rather than natural binary, the grid of Goodall's later tube was perforated to produce a glitch-free ] and produced all bits simultaneously by using a fan beam instead of a scanning beam.<ref>{{cite book |url=https://archive.org/details/bstj30-1-33 |author=Goodall, W. M. |work=Bell Systems Technical Journal |volume=30 |title=Television by Pulse Code Modulation |pages=33–49 |publisher=] |date=January 1951 |access-date=14 May 2017}}</ref> | ||
Line 92: | Line 92: | ||
The ] system, introduced in 1961, uses two twisted-pair transmission lines to carry 24 PCM ] calls sampled at 8 kHz and 8-bit resolution. This development improved capacity and call quality compared to the previous ] schemes. | The ] system, introduced in 1961, uses two twisted-pair transmission lines to carry 24 PCM ] calls sampled at 8 kHz and 8-bit resolution. This development improved capacity and call quality compared to the previous ] schemes. | ||
In 1973, ] (ADPCM) was developed, by P. Cummiskey, ] and ].<ref>P. Cummiskey, N. S. Jayant, and J. L. Flanagan, "Adaptive quantization in differential PCM coding of speech," Bell Syst. Tech. J., vol. 52, pp. |
In 1973, ] (ADPCM) was developed, by P. Cummiskey, ] and ].<ref>P. Cummiskey, N. S. Jayant, and J. L. Flanagan, "Adaptive quantization in differential PCM coding of speech," Bell Syst. Tech. J., vol. 52, pp. 1105–1118, Sept. 1973.</ref> | ||
===Digital audio recordings=== | ===Digital audio recordings=== | ||
{{Main|Digital audio|Digital recording}} | {{Main|Digital audio|Digital recording}} | ||
In 1967, the first PCM recorder was developed by ]'s research facilities in Japan.<ref name="Fine">{{cite journal |author=Thomas Fine |year=2008 |title=The dawn of commercial digital recording |journal=] |volume=39 |issue=1 |pages=1–17 |url=http://www.aes.org/aeshc/pdf/fine_dawn-of-digital.pdf}}</ref> The 30 kHz 12-bit device used a ] (similar to ]) to extend the dynamic range, and stored the signals on a ]. In 1969, NHK expanded the system's capabilities to 2-channel ] and 32 kHz 13-bit resolution. In January 1971, using NHK's PCM recording system, engineers at ] recorded the first commercial digital recordings.<ref group=note>Among the first recordings was ''Uzu: The World Of Stomu Yamash'ta 2'' by ].</ref><ref name="Fine"/> | In 1967, the first PCM recorder was developed by ]'s research facilities in Japan.<ref name="Fine">{{cite journal |author=Thomas Fine |year=2008 |title=The dawn of commercial digital recording |journal=] |volume=39 |issue=1 |pages=1–17 |url=http://www.aes.org/aeshc/pdf/fine_dawn-of-digital.pdf}}</ref> The 30 kHz 12-bit device used a ] (similar to ]) to extend the dynamic range, and stored the signals on a ]. In 1969, NHK expanded the system's capabilities to 2-channel ] and 32 kHz 13-bit resolution. In January 1971, using NHK's PCM recording system, engineers at ] recorded the first commercial digital recordings.<ref group=note>Among the first recordings was ''Uzu: The World Of Stomu Yamash'ta 2'' by ].</ref><ref name="Fine"/> | ||
In 1972, Denon unveiled the first 8-channel digital recorder, the DN-023R, which used a 4-head open reel broadcast video tape recorder to record in 47.25 kHz, 13-bit PCM audio.<ref group=note>The first recording with this new system was recorded in ] during April 24–26, 1972.</ref> In 1977, Denon developed the portable PCM recording system, the DN-034R. Like the DN-023R, it recorded 8 channels at 47.25 kHz, but it used 14-bits "with ], making it equivalent to 15.5 bits."<ref name="Fine"/> | In 1972, Denon unveiled the first 8-channel digital recorder, the DN-023R, which used a 4-head open reel broadcast video tape recorder to record in 47.25 kHz, 13-bit PCM audio.<ref group=note>The first recording with this new system was recorded in ] during April 24–26, 1972.</ref> In 1977, Denon developed the portable PCM recording system, the DN-034R. Like the DN-023R, it recorded 8 channels at 47.25 kHz, but it used 14-bits "with ], making it equivalent to 15.5 bits."<ref name="Fine"/> | ||
Line 108: | Line 108: | ||
{{Main|Digital telephony}} | {{Main|Digital telephony}} | ||
The rapid development and wide adoption of PCM ] was enabled by ] (MOS) ] (SC) circuit technology, developed in the early 1970s.<ref name="Allstot">{{cite book |last1=Allstot |first1=David J. |chapter=Switched Capacitor Filters |editor-last1=Maloberti |editor-first1=Franco |editor-last2=Davies |editor-first2=Anthony C. |title=A Short History of Circuits and Systems: From Green, Mobile, Pervasive Networking to Big Data Computing |date=2016 |publisher=] |isbn=9788793609860 |pages=105–110 |chapter-url=https://ieee-cas.org/sites/default/files/a_short_history_of_circuits_and_systems-_ebook-_web.pdf}}</ref> This led to the development of PCM codec-filter chips in the late 1970s.<ref name="Allstot"/><ref name="Gibson26">{{cite book |last1=Floyd |first1=Michael D. |last2=Hillman |first2=Garth D. |chapter=Pulse-Code Modulation Codec-Filters |title=The Communications Handbook |edition=2nd |date=8 October 2018 |orig-year=1st pub. 2000 |pages=26-1, 26-2, 26-3 |publisher=] |isbn=9781420041163 |chapter-url=https://books.google.com/books?id=Tokk5bZxB0MC&pg=SA26-PA1}}</ref> The ] ] (complementary MOS) PCM codec-filter chip, developed by ] and W.C. Black in 1980,<ref name="Allstot"/> has since been the industry standard for digital telephony.<ref name="Allstot"/><ref name="Gibson26"/> By the 1990s, ]s such as the ] (PSTN) had been largely ] with ] (VLSI) CMOS PCM codec-filters, widely used in ]s for ], user-end ] and a wide range of ] applications such as the ] (ISDN), ] and ].<ref name="Gibson26"/> | The rapid development and wide adoption of PCM ] was enabled by ] (MOS) ] (SC) circuit technology, developed in the early 1970s.<ref name="Allstot">{{cite book |last1=Allstot |first1=David J. |chapter=Switched Capacitor Filters |editor-last1=Maloberti |editor-first1=Franco |editor-last2=Davies |editor-first2=Anthony C. |title=A Short History of Circuits and Systems: From Green, Mobile, Pervasive Networking to Big Data Computing |date=2016 |publisher=] |isbn=9788793609860 |pages=105–110 |chapter-url=https://ieee-cas.org/sites/default/files/a_short_history_of_circuits_and_systems-_ebook-_web.pdf |access-date=November 29, 2019 |archive-date=September 30, 2021 |archive-url=https://web.archive.org/web/20210930151716/https://ieee-cas.org/sites/default/files/a_short_history_of_circuits_and_systems-_ebook-_web.pdf |url-status=dead }}</ref> This led to the development of PCM codec-filter chips in the late 1970s.<ref name="Allstot"/><ref name="Gibson26">{{cite book |last1=Floyd |first1=Michael D. |last2=Hillman |first2=Garth D. |chapter=Pulse-Code Modulation Codec-Filters |title=The Communications Handbook |edition=2nd |date=8 October 2018 |orig-year=1st pub. 2000 |pages=26-1, 26-2, 26-3 |publisher=] |isbn=9781420041163 |chapter-url=https://books.google.com/books?id=Tokk5bZxB0MC&pg=SA26-PA1}}</ref> The ] ] (complementary MOS) PCM codec-filter chip, developed by ] and W.C. Black in 1980,<ref name="Allstot"/> has since been the industry standard for digital telephony.<ref name="Allstot"/><ref name="Gibson26"/> By the 1990s, ]s such as the ] (PSTN) had been largely ] with ] (VLSI) CMOS PCM codec-filters, widely used in ]s for ], user-end ] and a wide range of ] applications such as the ] (ISDN), ] and ].<ref name="Gibson26"/> | ||
==Implementations== | ==Implementations== | ||
Line 117: | Line 117: | ||
* ] (specified in 1985, upon which ] is based) is a particular format using LPCM. | * ] (specified in 1985, upon which ] is based) is a particular format using LPCM. | ||
* ]s with digital sound have an LPCM track on the digital channel. | * ]s with digital sound have an LPCM track on the digital channel. | ||
* On PCs, PCM and LPCM often refer to the format used in ] (defined in 1991) and ] audio container formats (defined in 1988). LPCM data may also be stored in other formats such as ], ] (header-less file) and various multimedia ]. | * On PCs, PCM and LPCM often refer to the format used in ] (defined in 1991) and ] audio container formats (defined in 1988). LPCM data may also be stored in other formats such as ], ] (header-less file) and various multimedia ]. | ||
* LPCM has been defined as a part of the ] (since 1995) and ] (since 2006) standards.<ref name="bd">{{citation |url=http://www.blu-raydisc.com/Assets/Downloadablefile/2b_bdrom_audiovisualapplication_0305-12955-15269.pdf |title=White paper Blu-ray Disc Format – 2.B Audio Visual Application Format Specifications for BD-ROM |author=Blu-ray Disc Association |date=March 2005 |access-date=2009-07-26}}</ref><ref>{{cite web |url=http://www.mpeg.org/MPEG/DVD/Book_B/Audio.html |title=DVD Technical Notes (DVD Video – "Book B") – Audio data specifications |date=1996-07-21 |access-date=2010-03-16}}</ref><ref>{{cite web |url=http://dvddemystified.com/dvdfaq.html#3.6.2 |title=DVD Frequently Asked Questions (and Answers) – Audio details of DVD-Video |author=Jim Taylor |access-date=2010-03-20}}</ref> It is also defined as a part of various digital video and audio storage formats (e.g. ] since 1995,<ref>{{cite web |url=http://seaspray.trinity-bris.ac.uk/~altwfaq/graphics/video/1394/1394formats.html |title=How DV works |archive-url=https://web.archive.org/web/20071206032412/http://seaspray.trinity-bris.ac.uk/~altwfaq/graphics/video/1394/1394formats.html |archive-date=2007-12-06 |access-date=2010-03-21}}</ref> ] since 2006<ref>{{cite web |url=http://www.avchd-info.org/format/index.html |title=AVCHD Information Website – AVCHD format specification overview |access-date=2010-03-21}}</ref>). | * LPCM has been defined as a part of the ] (since 1995) and ] (since 2006) standards.<ref name="bd">{{citation |url=http://www.blu-raydisc.com/Assets/Downloadablefile/2b_bdrom_audiovisualapplication_0305-12955-15269.pdf |title=White paper Blu-ray Disc Format – 2.B Audio Visual Application Format Specifications for BD-ROM |author=Blu-ray Disc Association |date=March 2005 |access-date=2009-07-26}}</ref><ref>{{cite web |url=http://www.mpeg.org/MPEG/DVD/Book_B/Audio.html |title=DVD Technical Notes (DVD Video – "Book B") – Audio data specifications |date=1996-07-21 |access-date=2010-03-16}}</ref><ref>{{cite web |url=http://dvddemystified.com/dvdfaq.html#3.6.2 |title=DVD Frequently Asked Questions (and Answers) – Audio details of DVD-Video |author=Jim Taylor |access-date=2010-03-20}}</ref> It is also defined as a part of various digital video and audio storage formats (e.g. ] since 1995,<ref>{{cite web |url=http://seaspray.trinity-bris.ac.uk/~altwfaq/graphics/video/1394/1394formats.html |title=How DV works |archive-url=https://web.archive.org/web/20071206032412/http://seaspray.trinity-bris.ac.uk/~altwfaq/graphics/video/1394/1394formats.html |archive-date=2007-12-06 |access-date=2010-03-21}}</ref> ] since 2006<ref>{{cite web |url=http://www.avchd-info.org/format/index.html |title=AVCHD Information Website – AVCHD format specification overview |access-date=2010-03-21}}</ref>). | ||
* LPCM is used by ] (defined in 2002), a single-cable digital audio/video connector interface for transmitting uncompressed digital data. | * LPCM is used by ] (defined in 2002), a single-cable digital audio/video connector interface for transmitting uncompressed digital data. | ||
* ] container format (defined in 2007) uses LPCM and also allows non-PCM bitstream storage: various compression formats contained in the RF64 file as data bursts (Dolby E, Dolby AC3, DTS, MPEG-1/MPEG-2 Audio) can be "disguised" as PCM linear.<ref>{{citation |url=http://tech.ebu.ch/docs/tech/tech3306-2009.pdf |title=EBU Tech 3306 – MBWF / RF64: An Extended File Format for Audio |date=July 2009 |author=EBU |access-date=2010-01-19}}</ref> | * ] container format (defined in 2007) uses LPCM and also allows non-PCM bitstream storage: various compression formats contained in the RF64 file as data bursts (Dolby E, Dolby AC3, DTS, MPEG-1/MPEG-2 Audio) can be "disguised" as PCM linear.<ref>{{citation |url=http://tech.ebu.ch/docs/tech/tech3306-2009.pdf |title=EBU Tech 3306 – MBWF / RF64: An Extended File Format for Audio |date=July 2009 |author=EBU |access-date=2010-01-19 |archive-date=November 22, 2009 |archive-url=https://web.archive.org/web/20091122155436/http://tech.ebu.ch/docs/tech/tech3306-2009.pdf |url-status=dead }}</ref> | ||
==Modulation== | ==Modulation== | ||
] | ] | ||
In the diagram, a ] (red curve) is sampled and quantized for PCM. The sine wave is sampled at regular intervals, shown as vertical lines. For each sample, one of the available values (on the y-axis) is chosen. The PCM process is commonly implemented on a single ] called an ] (ADC). This produces a fully discrete representation of the input signal (blue points) that can be easily encoded as digital data for storage or manipulation. Several PCM streams could also be multiplexed into a larger aggregate ], generally for transmission of multiple streams over a single physical link. One technique is called ] (TDM) and is widely used, notably in the modern public telephone system. | In the diagram, a ] (red curve) is sampled and quantized for PCM. The sine wave is sampled at regular intervals, shown as vertical lines. For each sample, one of the available values (on the y-axis) is chosen. The PCM process is commonly implemented on a single ] called an ] (ADC). This produces a fully discrete representation of the input signal (blue points) that can be easily encoded as digital data for storage or manipulation. Several PCM streams could also be multiplexed into a larger aggregate ], generally for transmission of multiple streams over a single physical link. One technique is called ] (TDM) and is widely used, notably in the modern public telephone system. | ||
Line 130: | Line 130: | ||
The electronics involved in producing an accurate analog signal from the discrete data are similar to those used for generating the digital signal. These devices are ]s (DACs). They produce a ] or ] (depending on type) that represents the value presented on their digital inputs. This output would then generally be filtered and amplified for use. | The electronics involved in producing an accurate analog signal from the discrete data are similar to those used for generating the digital signal. These devices are ]s (DACs). They produce a ] or ] (depending on type) that represents the value presented on their digital inputs. This output would then generally be filtered and amplified for use. | ||
To recover the original signal from the sampled data, a ''demodulator'' can apply the procedure of modulation in reverse. After each sampling period, the demodulator reads the next value and transitions the output signal to the new value. As a result of these transitions, the signal retains a significant amount of high-frequency energy due to imaging effects. To remove these undesirable frequencies, the demodulator passes the signal through a ] that suppresses energy outside the expected frequency range (greater than the ] <math>f_s / 2 </math>).<ref group=note>Some systems use ]ing to remove some of the aliasing, converting the signal from digital to analog at a higher sample rate such that the analog ] is much simpler. In some systems, no explicit filtering is done at all; as it |
To recover the original signal from the sampled data, a ''demodulator'' can apply the procedure of modulation in reverse. After each sampling period, the demodulator reads the next value and transitions the output signal to the new value. As a result of these transitions, the signal retains a significant amount of high-frequency energy due to imaging effects. To remove these undesirable frequencies, the demodulator passes the signal through a ] that suppresses energy outside the expected frequency range (greater than the ] <math>f_s / 2 </math>).<ref group=note>Some systems use ]ing to remove some of the aliasing, converting the signal from digital to analog at a higher sample rate such that the analog ] is much simpler. In some systems, no explicit filtering is done at all; as it is impossible for any system to reproduce a signal with infinite bandwidth, inherent losses in the system compensate for the artifacts — or the system simply does not require much precision.</ref> | ||
==Standard sampling precision and rates== | ==Standard sampling precision and rates== | ||
Line 137: | Line 137: | ||
LPCM encodes a single sound channel. Support for multichannel audio depends on file format and relies on synchronization of multiple LPCM streams.<ref name=LOC_LPCM/><ref>{{Cite web|publisher=Library of Congress |url=https://www.loc.gov/preservation/digital/formats/fdd/fdd000016.shtml |title=PCM, Pulse Code Modulated Audio |date=April 6, 2022 |access-date=2022-09-05}}</ref> While two channels (stereo) is the most common format, systems can support up to 8 audio channels (7.1 surround)<ref name="rfc4856"/><ref name="rfc3190"/> or more. | LPCM encodes a single sound channel. Support for multichannel audio depends on file format and relies on synchronization of multiple LPCM streams.<ref name=LOC_LPCM/><ref>{{Cite web|publisher=Library of Congress |url=https://www.loc.gov/preservation/digital/formats/fdd/fdd000016.shtml |title=PCM, Pulse Code Modulated Audio |date=April 6, 2022 |access-date=2022-09-05}}</ref> While two channels (stereo) is the most common format, systems can support up to 8 audio channels (7.1 surround)<ref name="rfc4856"/><ref name="rfc3190"/> or more. | ||
Common sampling frequencies are 48 ] as used with ] format videos, or 44.1 kHz as used in CDs. Sampling frequencies of 96 kHz or 192 kHz can be used on some equipment, but the benefits have been debated.<ref>{{Cite web|last=Christopher|first=Montgometry|title=24/192 Music Downloads, and why they do not make sense|url=http://people.xiph.org/~xiphmont/demo/neil-young.html|url-status=dead|archive-url=https://web.archive.org/web/20140906115306/http://people.xiph.org/~xiphmont/demo/neil-young.html|archive-date=2014-09-06|access-date=2013-03-16|publisher=Chris "Monty" Montgomery}}</ref> | Common sampling frequencies are 48 ] as used with ] format videos, or 44.1 kHz as used in CDs. Sampling frequencies of 96 kHz or 192 kHz can be used on some equipment, but the benefits have been debated.<ref>{{Cite web|last=Christopher|first=Montgometry|title=24/192 Music Downloads, and why they do not make sense|url=http://people.xiph.org/~xiphmont/demo/neil-young.html|url-status=dead|archive-url=https://web.archive.org/web/20140906115306/http://people.xiph.org/~xiphmont/demo/neil-young.html|archive-date=2014-09-06|access-date=2013-03-16|publisher=Chris "Monty" Montgomery}}</ref> | ||
==Limitations== | ==Limitations== | ||
The ] shows PCM devices can operate without introducing distortions within their designed frequency bands if they provide a sampling frequency at least twice that of the highest frequency contained in the input signal. For example, in ], the usable ] band ranges from approximately 300 ] to 3400 Hz.<ref>https://www.its.bldrdoc.gov/fs-1037/dir-039/_5829.htm{{fv|reason=This source says 4k|date=August 2020}}</ref> For effective reconstruction of the voice signal, telephony applications therefore typically use an 8000 Hz sampling frequency which is more than twice the highest usable voice frequency. | The ] shows PCM devices can operate without introducing distortions within their designed frequency bands if they provide a sampling frequency at least twice that of the highest frequency contained in the input signal. For example, in ], the usable ] band ranges from approximately 300 ] to 3400 Hz.<ref>https://www.its.bldrdoc.gov/fs-1037/dir-039/_5829.htm{{fv|reason=This source says 4k|date=August 2020}}</ref> For effective reconstruction of the voice signal, telephony applications therefore typically use an 8000 Hz sampling frequency which is more than twice the highest usable voice frequency. | ||
Regardless, there are potential sources of impairment implicit in any PCM system: | Regardless, there are potential sources of impairment implicit in any PCM system: | ||
Line 151: | Line 151: | ||
* Linear PCM (LPCM) is PCM with linear quantization.<ref name="LOC_LPCM" /> | * Linear PCM (LPCM) is PCM with linear quantization.<ref name="LOC_LPCM" /> | ||
* ] (DPCM) encodes the PCM values as differences between the current and the predicted value. An algorithm predicts the next sample based on the previous samples, and the encoder stores only the difference between this prediction and the actual value. If the prediction is reasonable, fewer bits can be used to represent the same information. For audio, this type of encoding reduces the number of bits required per sample by about 25% compared to PCM. | * ] (DPCM) encodes the PCM values as differences between the current and the predicted value. An algorithm predicts the next sample based on the previous samples, and the encoder stores only the difference between this prediction and the actual value. If the prediction is reasonable, fewer bits can be used to represent the same information. For audio, this type of encoding reduces the number of bits required per sample by about 25% compared to PCM. | ||
* ] (ADPCM) is a variant of DPCM that varies the size of the quantization step, to allow further reduction of the required bandwidth for a given ]. | * ] (ADPCM) is a variant of DPCM that varies the size of the quantization step, to allow further reduction of the required bandwidth for a given ]. | ||
* ] is a form of DPCM that uses one bit per sample to indicate whether the signal is increasing or decreasing compared to the previous sample. | * ] is a form of DPCM that uses one bit per sample to indicate whether the signal is increasing or decreasing compared to the previous sample. | ||
Line 165: | Line 165: | ||
{{See also|T-carrier|E-carrier}} | {{See also|T-carrier|E-carrier}} | ||
PCM can be either ] (RZ) or ] (NRZ). For a NRZ system to be synchronized using in-band information, there must not be long sequences of identical symbols, such as ones or zeroes. For binary PCM systems, the density of 1-symbols is called ''ones-density''.<ref>Stallings, William, , December 1984, Vol. 22, No. 12, ] ]</ref> | PCM can be either ] (RZ) or ] (NRZ). For a NRZ system to be synchronized using in-band information, there must not be long sequences of identical symbols, such as ones or zeroes. For binary PCM systems, the density of 1-symbols is called ''ones-density''.<ref>Stallings, William, , December 1984, Vol. 22, No. 12, ] ]</ref> | ||
Ones-density is often controlled using precoding techniques such as ] encoding, where the PCM code is expanded into a slightly longer code with a guaranteed bound on ones-density before modulation into the channel. In other cases, extra ]s are added into the stream, which guarantees at least occasional symbol transitions. | Ones-density is often controlled using precoding techniques such as ] encoding, where the PCM code is expanded into a slightly longer code with a guaranteed bound on ones-density before modulation into the channel. In other cases, extra ]s are added into the stream, which guarantees at least occasional symbol transitions. | ||
Another technique used to control ones-density is the use of a ] on the data, which will tend to turn the data stream into a stream that looks ], but where the data can be recovered exactly by a complementary descrambler. In this case, long runs of zeroes or ones are still possible on the output but are considered unlikely enough to allow reliable synchronization. | Another technique used to control ones-density is the use of a ] on the data, which will tend to turn the data stream into a stream that looks ], but where the data can be recovered exactly by a complementary descrambler. In this case, long runs of zeroes or ones are still possible on the output but are considered unlikely enough to allow reliable synchronization. | ||
In other cases, the long term DC value of the modulated signal is important, as building up a ] will tend to move communications circuits out of their operating range. In this case, special measures are taken to keep a count of the cumulative DC bias and to modify the codes if necessary to make the DC bias always tend back to zero. | In other cases, the long term DC value of the modulated signal is important, as building up a ] will tend to move communications circuits out of their operating range. In this case, special measures are taken to keep a count of the cumulative DC bias and to modify the codes if necessary to make the DC bias always tend back to zero. |
Latest revision as of 14:50, 19 November 2024
Digital representation of sampled analog signals "PCM" redirects here. For other uses, see PCM (disambiguation).
Filename extension | .L16, .WAV, .AIFF, .AU, .PCM |
---|---|
Internet media type | audio/L16, audio/L8, audio/L20, audio/L24 |
Type code | "AIFF" for L16, none |
Magic number | Varies |
Type of format | Uncompressed audio |
Contained by | Audio CD, AES3, WAV, AIFF, AU, M2TS, VOB, and many others |
Open format? | Yes |
Free format? | Yes |
Passband modulation |
---|
Analog modulation |
Digital modulation |
Hierarchical modulation |
Spread spectrum |
See also |
Pulse-code modulation (PCM) is a method used to digitally represent analog signals. It is the standard form of digital audio in computers, compact discs, digital telephony and other digital audio applications. In a PCM stream, the amplitude of the analog signal is sampled at uniform intervals, and each sample is quantized to the nearest value within a range of digital steps. Alec Reeves, Claude Shannon, Barney Oliver and John R. Pierce are credited with its invention.
Linear pulse-code modulation (LPCM) is a specific type of PCM in which the quantization levels are linearly uniform. This is in contrast to PCM encodings in which quantization levels vary as a function of amplitude (as with the A-law algorithm or the μ-law algorithm). Though PCM is a more general term, it is often used to describe data encoded as LPCM.
A PCM stream has two basic properties that determine the stream's fidelity to the original analog signal: the sampling rate, which is the number of times per second that samples are taken; and the bit depth, which determines the number of possible digital values that can be used to represent each sample.
History
Early electrical communications started to sample signals in order to multiplex samples from multiple telegraphy sources and to convey them over a single telegraph cable. The American inventor Moses G. Farmer conceived telegraph time-division multiplexing (TDM) as early as 1853. Electrical engineer W. M. Miner, in 1903, used an electro-mechanical commutator for time-division multiplexing multiple telegraph signals; he also applied this technology to telephony. He obtained intelligible speech from channels sampled at a rate above 3500–4300 Hz; lower rates proved unsatisfactory.
In 1920, the Bartlane cable picture transmission system used telegraph signaling of characters punched in paper tape to send samples of images quantized to 5 levels. In 1926, Paul M. Rainey of Western Electric patented a facsimile machine that transmitted its signal using 5-bit PCM, encoded by an opto-mechanical analog-to-digital converter. The machine did not go into production.
British engineer Alec Reeves, unaware of previous work, conceived the use of PCM for voice communication in 1937 while working for International Telephone and Telegraph in France. He described the theory and its advantages, but no practical application resulted. Reeves filed for a French patent in 1938, and his US patent was granted in 1943. By this time Reeves had started working at the Telecommunications Research Establishment.
The first transmission of speech by digital techniques, the SIGSALY encryption equipment, conveyed high-level Allied communications during World War II. In 1943 the Bell Labs researchers who designed the SIGSALY system became aware of the use of PCM binary coding as already proposed by Reeves. In 1949, for the Canadian Navy's DATAR system, Ferranti Canada built a working PCM radio system that was able to transmit digitized radar data over long distances.
PCM in the late 1940s and early 1950s used a cathode-ray coding tube with a plate electrode having encoding perforations. As in an oscilloscope, the beam was swept horizontally at the sample rate while the vertical deflection was controlled by the input analog signal, causing the beam to pass through higher or lower portions of the perforated plate. The plate collected or passed the beam, producing current variations in binary code, one bit at a time. Rather than natural binary, the grid of Goodall's later tube was perforated to produce a glitch-free Gray code and produced all bits simultaneously by using a fan beam instead of a scanning beam.
In the United States, the National Inventors Hall of Fame has honored Bernard M. Oliver and Claude Shannon as the inventors of PCM, as described in "Communication System Employing Pulse Code Modulation", U.S. patent 2,801,281 filed in 1946 and 1952, granted in 1956. Another patent by the same title was filed by John R. Pierce in 1945, and issued in 1948: U.S. patent 2,437,707. The three of them published "The Philosophy of PCM" in 1948.
The T-carrier system, introduced in 1961, uses two twisted-pair transmission lines to carry 24 PCM telephone calls sampled at 8 kHz and 8-bit resolution. This development improved capacity and call quality compared to the previous frequency-division multiplexing schemes.
In 1973, adaptive differential pulse-code modulation (ADPCM) was developed, by P. Cummiskey, Nikil Jayant and James L. Flanagan.
Digital audio recordings
Main articles: Digital audio and Digital recordingIn 1967, the first PCM recorder was developed by NHK's research facilities in Japan. The 30 kHz 12-bit device used a compander (similar to DBX Noise Reduction) to extend the dynamic range, and stored the signals on a video tape recorder. In 1969, NHK expanded the system's capabilities to 2-channel stereo and 32 kHz 13-bit resolution. In January 1971, using NHK's PCM recording system, engineers at Denon recorded the first commercial digital recordings.
In 1972, Denon unveiled the first 8-channel digital recorder, the DN-023R, which used a 4-head open reel broadcast video tape recorder to record in 47.25 kHz, 13-bit PCM audio. In 1977, Denon developed the portable PCM recording system, the DN-034R. Like the DN-023R, it recorded 8 channels at 47.25 kHz, but it used 14-bits "with emphasis, making it equivalent to 15.5 bits."
In 1979, the first digital pop album, Bop till You Drop, was recorded. It was recorded in 50 kHz, 16-bit linear PCM using a 3M digital tape recorder.
The compact disc (CD) brought PCM to consumer audio applications with its introduction in 1982. The CD uses a 44,100 Hz sampling frequency and 16-bit resolution and stores up to 80 minutes of stereo audio per disc.
Digital telephony
Main article: Digital telephonyThe rapid development and wide adoption of PCM digital telephony was enabled by metal–oxide–semiconductor (MOS) switched capacitor (SC) circuit technology, developed in the early 1970s. This led to the development of PCM codec-filter chips in the late 1970s. The silicon-gate CMOS (complementary MOS) PCM codec-filter chip, developed by David A. Hodges and W.C. Black in 1980, has since been the industry standard for digital telephony. By the 1990s, telecommunication networks such as the public switched telephone network (PSTN) had been largely digitized with very-large-scale integration (VLSI) CMOS PCM codec-filters, widely used in electronic switching systems for telephone exchanges, user-end modems and a wide range of digital transmission applications such as the integrated services digital network (ISDN), cordless telephones and cell phones.
Implementations
PCM is the method of encoding typically used for uncompressed digital audio.
- The 4ESS switch introduced time-division switching into the US telephone system in 1976, based on medium scale integrated circuit technology.
- LPCM is used for the lossless encoding of audio data in the compact disc Red Book standard (informally also known as Audio CD), introduced in 1982.
- AES3 (specified in 1985, upon which S/PDIF is based) is a particular format using LPCM.
- LaserDiscs with digital sound have an LPCM track on the digital channel.
- On PCs, PCM and LPCM often refer to the format used in WAV (defined in 1991) and AIFF audio container formats (defined in 1988). LPCM data may also be stored in other formats such as AU, raw audio format (header-less file) and various multimedia container formats.
- LPCM has been defined as a part of the DVD (since 1995) and Blu-ray (since 2006) standards. It is also defined as a part of various digital video and audio storage formats (e.g. DV since 1995, AVCHD since 2006).
- LPCM is used by HDMI (defined in 2002), a single-cable digital audio/video connector interface for transmitting uncompressed digital data.
- RF64 container format (defined in 2007) uses LPCM and also allows non-PCM bitstream storage: various compression formats contained in the RF64 file as data bursts (Dolby E, Dolby AC3, DTS, MPEG-1/MPEG-2 Audio) can be "disguised" as PCM linear.
Modulation
In the diagram, a sine wave (red curve) is sampled and quantized for PCM. The sine wave is sampled at regular intervals, shown as vertical lines. For each sample, one of the available values (on the y-axis) is chosen. The PCM process is commonly implemented on a single integrated circuit called an analog-to-digital converter (ADC). This produces a fully discrete representation of the input signal (blue points) that can be easily encoded as digital data for storage or manipulation. Several PCM streams could also be multiplexed into a larger aggregate data stream, generally for transmission of multiple streams over a single physical link. One technique is called time-division multiplexing (TDM) and is widely used, notably in the modern public telephone system.
Demodulation
The electronics involved in producing an accurate analog signal from the discrete data are similar to those used for generating the digital signal. These devices are digital-to-analog converters (DACs). They produce a voltage or current (depending on type) that represents the value presented on their digital inputs. This output would then generally be filtered and amplified for use.
To recover the original signal from the sampled data, a demodulator can apply the procedure of modulation in reverse. After each sampling period, the demodulator reads the next value and transitions the output signal to the new value. As a result of these transitions, the signal retains a significant amount of high-frequency energy due to imaging effects. To remove these undesirable frequencies, the demodulator passes the signal through a reconstruction filter that suppresses energy outside the expected frequency range (greater than the Nyquist frequency ).
Standard sampling precision and rates
Common sample depths for LPCM are 8, 16, 20 or 24 bits per sample.
LPCM encodes a single sound channel. Support for multichannel audio depends on file format and relies on synchronization of multiple LPCM streams. While two channels (stereo) is the most common format, systems can support up to 8 audio channels (7.1 surround) or more.
Common sampling frequencies are 48 kHz as used with DVD format videos, or 44.1 kHz as used in CDs. Sampling frequencies of 96 kHz or 192 kHz can be used on some equipment, but the benefits have been debated.
Limitations
The Nyquist–Shannon sampling theorem shows PCM devices can operate without introducing distortions within their designed frequency bands if they provide a sampling frequency at least twice that of the highest frequency contained in the input signal. For example, in telephony, the usable voice frequency band ranges from approximately 300 Hz to 3400 Hz. For effective reconstruction of the voice signal, telephony applications therefore typically use an 8000 Hz sampling frequency which is more than twice the highest usable voice frequency.
Regardless, there are potential sources of impairment implicit in any PCM system:
- Choosing a discrete value that is near but not exactly at the analog signal level for each sample leads to quantization error.
- Between samples no measurement of the signal is made; the sampling theorem guarantees non-ambiguous representation and recovery of the signal only if it has no energy at frequency fs/2 or higher (one half the sampling frequency, known as the Nyquist frequency); higher frequencies will not be correctly represented or recovered and add aliasing distortion to the signal below the Nyquist frequency.
- As samples are dependent on time, an accurate clock is required for accurate reproduction. If either the encoding or decoding clock is not stable, these imperfections will directly affect the output quality of the device.
Processing and coding
Some forms of PCM combine signal processing with coding. Older versions of these systems applied the processing in the analog domain as part of the analog-to-digital process; newer implementations do so in the digital domain. These simple techniques have been largely rendered obsolete by modern transform-based audio compression techniques, such as modified discrete cosine transform (MDCT) coding.
- Linear PCM (LPCM) is PCM with linear quantization.
- Differential PCM (DPCM) encodes the PCM values as differences between the current and the predicted value. An algorithm predicts the next sample based on the previous samples, and the encoder stores only the difference between this prediction and the actual value. If the prediction is reasonable, fewer bits can be used to represent the same information. For audio, this type of encoding reduces the number of bits required per sample by about 25% compared to PCM.
- Adaptive differential pulse-code modulation (ADPCM) is a variant of DPCM that varies the size of the quantization step, to allow further reduction of the required bandwidth for a given signal-to-noise ratio.
- Delta modulation is a form of DPCM that uses one bit per sample to indicate whether the signal is increasing or decreasing compared to the previous sample.
In telephony, a standard audio signal for a single phone call is encoded as 8,000 samples per second, of 8 bits each, giving a 64 kbit/s digital signal known as DS0. The default signal compression encoding on a DS0 is either μ-law (mu-law) PCM (North America and Japan) or A-law PCM (Europe and most of the rest of the world). These are logarithmic compression systems where a 12- or 13-bit linear PCM sample number is mapped into an 8-bit value. This system is described by international standard G.711.
Where circuit costs are high and loss of voice quality is acceptable, it sometimes makes sense to compress the voice signal even further. An ADPCM algorithm is used to map a series of 8-bit μ-law or A-law PCM samples into a series of 4-bit ADPCM samples. In this way, the capacity of the line is doubled. The technique is detailed in the G.726 standard.
Audio coding formats and audio codecs have been developed to achieve further compression. Some of these techniques have been standardized and patented. Advanced compression techniques, such as modified discrete cosine transform (MDCT) and linear predictive coding (LPC), are now widely used in mobile phones, voice over IP (VoIP) and streaming media.
Encoding for serial transmission
Main article: Line code See also: T-carrier and E-carrierPCM can be either return-to-zero (RZ) or non-return-to-zero (NRZ). For a NRZ system to be synchronized using in-band information, there must not be long sequences of identical symbols, such as ones or zeroes. For binary PCM systems, the density of 1-symbols is called ones-density.
Ones-density is often controlled using precoding techniques such as run-length limited encoding, where the PCM code is expanded into a slightly longer code with a guaranteed bound on ones-density before modulation into the channel. In other cases, extra framing bits are added into the stream, which guarantees at least occasional symbol transitions.
Another technique used to control ones-density is the use of a scrambler on the data, which will tend to turn the data stream into a stream that looks pseudo-random, but where the data can be recovered exactly by a complementary descrambler. In this case, long runs of zeroes or ones are still possible on the output but are considered unlikely enough to allow reliable synchronization.
In other cases, the long term DC value of the modulated signal is important, as building up a DC bias will tend to move communications circuits out of their operating range. In this case, special measures are taken to keep a count of the cumulative DC bias and to modify the codes if necessary to make the DC bias always tend back to zero.
Many of these codes are bipolar codes, where the pulses can be positive, negative or absent. In the typical alternate mark inversion code, non-zero pulses alternate between being positive and negative. These rules may be violated to generate special symbols used for framing or other special purposes.
Nomenclature
The word pulse in the term pulse-code modulation refers to the pulses to be found in the transmission line. This perhaps is a natural consequence of this technique having evolved alongside two analog methods, pulse-width modulation and pulse-position modulation, in which the information to be encoded is represented by discrete signal pulses of varying width or position, respectively. In this respect, PCM bears little resemblance to these other forms of signal encoding, except that all can be used in time-division multiplexing, and the numbers of the PCM codes are represented as electrical pulses.
See also
- Beta encoder
- Equivalent pulse code modulation noise
- Signal-to-quantization-noise ratio (SQNR), one method of measuring quantization error
Explanatory notes
- Among the first recordings was Uzu: The World Of Stomu Yamash'ta 2 by Stomu Yamashta.
- The first recording with this new system was recorded in Tokyo during April 24–26, 1972.
- Other methods exist such as pulse-density modulation used also on Super Audio CD.
- Some systems use digital filtering to remove some of the aliasing, converting the signal from digital to analog at a higher sample rate such that the analog anti-aliasing filter is much simpler. In some systems, no explicit filtering is done at all; as it is impossible for any system to reproduce a signal with infinite bandwidth, inherent losses in the system compensate for the artifacts — or the system simply does not require much precision.
- Quantization error swings between -q/2 and q/2. In the ideal case (with a fully linear ADC and signal level >> q) it is uniformly distributed over this interval, with zero mean and variance of q/12.
- A slight difference between the encoding and decoding clock frequencies is not generally a major concern; a small constant error is not noticeable. Clock error does become a major issue if the clock contains significant jitter, however.
References
- ^ Alvestrand, Harald Tveit; Salsman, James (May 1999). "RFC 2586 – The Audio/L16 MIME content type". The Internet Society. doi:10.17487/RFC2586. Retrieved March 16, 2010.
{{cite journal}}
: Cite journal requires|journal=
(help) - ^ Casner, S. (March 2007). "RFC 4856 – Media Type Registration of Payload Formats in the RTP Profile for Audio and Video Conferences – Registration of Media Type audio/L8". The IETF Trust. doi:10.17487/RFC4856. Retrieved March 16, 2010.
{{cite journal}}
: Cite journal requires|journal=
(help) - ^ Bormann, C.; Casner, S.; Kobayashi, K.; Ogawa, A. (January 2002). "RFC 3190 – RTP Payload Format for 12-bit DAT Audio and 20- and 24-bit Linear Sampled Audio". The Internet Society. doi:10.17487/RFC3190. Retrieved March 16, 2010.
{{cite journal}}
: Cite journal requires|journal=
(help) - "Audio Media Types". Internet Assigned Numbers Authority. Retrieved March 16, 2010.
- ^ "Linear Pulse Code Modulated Audio (LPCM)". Library of Congress. April 19, 2022. Retrieved September 5, 2022.
- Noll, A. Michael (1997). Highway of Dreams: A Critical View Along the Information Superhighway. Telecommunications (Revised ed.). Mahwah, NJ: Erlbaum. p. 50. ISBN 978-0-8058-2557-2.
- Leibson, Steven (September 7, 2021). "A Brief History of the Single-Chip DSP, Part I". EEJournal. Retrieved September 19, 2024.
- Barrett, G. Douglas (2023). Experimenting the Human: Art, Music, and the Contemporary Posthuman. Chicago London: The University of Chicago Press. p. 102. ISBN 978-0-226-82340-9.
- "The Bartlane Transmission System". DigicamHistory.com. Archived from the original on February 10, 2010. Retrieved January 7, 2010.
- U.S. patent number 1,608,527; also see p. 8, Data conversion handbook, Walter Allan Kester, ed., Newnes, 2005, ISBN 0-7506-7841-0.
- ^ John Vardalas (June 2013), Pulse Code Modulation: It all Started 75 Years Ago with Alec Reeves, IEEE
- US 2272070
- Porter, Arthur (2004). So Many Hills to Climb. Beckham Publications Group. ISBN 9780931761188.
- Sears, R. W. (January 1948). Electron Beam Deflection Tube for Pulse Code Modulation. Vol. 27. Bell Labs. pp. 44–57. Retrieved May 14, 2017.
{{cite book}}
:|work=
ignored (help) - Goodall, W. M. (January 1951). Television by Pulse Code Modulation. Vol. 30. Bell Labs. pp. 33–49. Retrieved May 14, 2017.
{{cite book}}
:|work=
ignored (help) - "Bernard Oliver". National Inventor's Hall of Fame. Archived from the original on December 5, 2010. Retrieved February 6, 2011.
- "Claude Shannon". National Inventor's Hall of Fame. Archived from the original on December 6, 2010. Retrieved February 6, 2011.
- "National Inventors Hall of Fame announces 2004 class of inventors". Science Blog. February 11, 2004. Retrieved February 6, 2011.
- B. M. Oliver; J. R. Pierce & C. E. Shannon (November 1948). "The Philosophy of PCM". Proceedings of the IRE. 36 (11): 1324–1331. doi:10.1109/JRPROC.1948.231941. ISSN 0096-8390. S2CID 51663786.
- P. Cummiskey, N. S. Jayant, and J. L. Flanagan, "Adaptive quantization in differential PCM coding of speech," Bell Syst. Tech. J., vol. 52, pp. 1105–1118, Sept. 1973.
- ^ Thomas Fine (2008). "The dawn of commercial digital recording" (PDF). ARSC Journal. 39 (1): 1–17.
- Roger Nichols. "I Can't Keep Up With All The Formats II". Archived from the original on October 20, 2002.
The Ry Cooder Bop Till You Drop album was the first digitally recorded pop album
- ^ Allstot, David J. (2016). "Switched Capacitor Filters" (PDF). In Maloberti, Franco; Davies, Anthony C. (eds.). A Short History of Circuits and Systems: From Green, Mobile, Pervasive Networking to Big Data Computing. IEEE Circuits and Systems Society. pp. 105–110. ISBN 9788793609860. Archived from the original (PDF) on September 30, 2021. Retrieved November 29, 2019.
- ^ Floyd, Michael D.; Hillman, Garth D. (October 8, 2018) . "Pulse-Code Modulation Codec-Filters". The Communications Handbook (2nd ed.). CRC Press. pp. 26–1, 26–2, 26–3. ISBN 9781420041163.
- Cambron, G. Keith (October 17, 2012). Global Networks: Engineering, Operations and Design. John Wiley & Sons. p. 345.
- Blu-ray Disc Association (March 2005), White paper Blu-ray Disc Format – 2.B Audio Visual Application Format Specifications for BD-ROM (PDF), retrieved July 26, 2009
- "DVD Technical Notes (DVD Video – "Book B") – Audio data specifications". July 21, 1996. Retrieved March 16, 2010.
- Jim Taylor. "DVD Frequently Asked Questions (and Answers) – Audio details of DVD-Video". Retrieved March 20, 2010.
- "How DV works". Archived from the original on December 6, 2007. Retrieved March 21, 2010.
- "AVCHD Information Website – AVCHD format specification overview". Retrieved March 21, 2010.
- EBU (July 2009), EBU Tech 3306 – MBWF / RF64: An Extended File Format for Audio (PDF), archived from the original (PDF) on November 22, 2009, retrieved January 19, 2010
- Mostafa, Mohamed; Kumar, Rajesh (May 2001). "RFC 3108 – Conventions for the use of the Session Description Protocol (SDP) for ATM Bearer Connections". doi:10.17487/RFC3108. Retrieved March 16, 2010.
{{cite journal}}
: Cite journal requires|journal=
(help) - "PCM, Pulse Code Modulated Audio". Library of Congress. April 6, 2022. Retrieved September 5, 2022.
- Christopher, Montgometry. "24/192 Music Downloads, and why they do not make sense". Chris "Monty" Montgomery. Archived from the original on September 6, 2014. Retrieved March 16, 2013.
- https://www.its.bldrdoc.gov/fs-1037/dir-039/_5829.htm
- Stallings, William, Digital Signaling Techniques, December 1984, Vol. 22, No. 12, IEEE Communications Magazine
Further reading
- Franklin S. Cooper; Ignatius Mattingly (1969). "Computer-controlled PCM system for investigation of dichotic speech perception". Journal of the Acoustical Society of America. 46 (1A): 115. Bibcode:1969ASAJ...46..115C. doi:10.1121/1.1972688.
- Ken C. Pohlmann (1985). Principles of Digital Audio (2nd ed.). Carmel, Indiana: Sams/Prentice-Hall Computer Publishing. ISBN 978-0-672-22634-2.
- D. H. Whalen, E. R. Wiley, Philip E. Rubin, and Franklin S. Cooper (1990). "The Haskins Laboratories pulse code modulation (PCM) system". Behavior Research Methods, Instruments, and Computers. 22 (6): 550–559. doi:10.3758/BF03204440.
{{cite journal}}
: CS1 maint: multiple names: authors list (link) - Bill Waggener (1995). Pulse Code Modulation Techniques (1st ed.). New York, NY: Van Nostrand Reinhold. ISBN 978-0-442-01436-0.
- Bill Waggener (1999). Pulse Code Modulation Systems Design (1st ed.). Boston, MA: Artech House. ISBN 978-0-89006-776-5.
External links
- PCM description on MultimediaWiki
- Ralph Miller and Bob Badgley invented multi-level PCM independently in their work at Bell Labs on SIGSALY: U.S. patent 3,912,868 filed in 1943: N-ary Pulse Code Modulation.
- Information about PCM: A description of PCM with links to information about subtypes of this format (for example linear pulse-code modulation), and references to their specifications.
- Summary of LPCM – Contains links to information about implementations and their specifications.
- How to control internal/external hardware using Microsoft's Media Control Interface – Contains information about, and specifications for the implementation of LPCM used in WAV files.
- RFC 4856 – Media Type Registration of Payload Formats in the RTP Profile for Audio and Video Conferences – audio/L8 and audio/L16 (March 2007)
- RFC 3190 – RTP Payload Format for 12-bit DAT Audio and 20- and 24-bit Linear Sampled Audio (January 2002)
- RFC 3551 – RTP Profile for Audio and Video Conferences with Minimal Control – L8 and L16 (July 2003)
High-definition (HD) | |
---|---|
Concepts | |
Resolutions | |
Analog broadcast (All defunct) | |
Digital broadcast | |
Audio | |
Filming and storage | |
HD media and compression | |
Connectors | |
Deployments |
Line coding (digital baseband transmission) | ||
---|---|---|
Main articles | ||
Basic line codes | ||
Extended line codes | ||
Optical line codes | ||