Title: ITU-T coders for wideband, superwideband, and fullband speech communication [Series Editorial]
Authors: Richard V. Cox, Simao Ferraz De Campos Neto, Claude Lamblin, Mostafa Hashem Sherif
Published: 2009-10-02
Link: https://ieeexplore.ieee.org/document/5273816

Abstract

This section of the magazine presents recent algorithms developed by the ITU to provide high quality coding beyond traditional narrowband telephony. Speech coders can be characterized by their bit rate, quality, complexity, and delay. Typical applications fall into one of two categories, one-way and two-way. The first includes storage applications such as telephone answering systems, streaming, multimedia delivery, and push-to-talk calls. The second includes realtime communications such as two person phone calls and conference calls. In this latter category, if the delay is too large - exceeding 300 ms round-trip - humans have difficulty communicating, while for storage and playback operations delay is not a factor. The complexity of a speech coder is one of the main contributing factors to its cost and energy usage. Complexity is most often measured in terms of memory usage (both RAM and ROM) and the number of instructions executed per second. All applications are sensitive to cost, and many are sensitive to energy usage as well. The desired bit rate is determined by channel capacity or storage capacity, depending on the application.


ITU Article from 2009

  • two person phone calls and conference calls. In this latter category, if the delay is too large — exceeding 300 ms round-trip — humans have difficulty communicating
  • For decades the International Telecommunication Union (ITU) standardized speech coders having “telephone bandwidth,” meaning an audio bandwidth of 300–3400 Hz with a sampling rate of 8 kHz
  • first speech coding algorithm, G.711
  • If the fidelity of a new coder is judged comparable, it is called “toll quality.”
  • telephone bandwidth is referred to as “narrowband”
  • first extension of the audio bandwidth, standardized about two decades ago, was ITU-T Recommendation G.722 having 50–7000 Hz bandwidth and a sampling rate of 16 kHz
    • dubbed “wideband” and was intended for video teleconference applications
  • recent coding algorithms have been developed for so-called superwideband (50–14,000 Hz) and fullband (20–20,000 Hz) to take advantage of the growth in available digital telecommunications bandwidth

Past ITU-T Speech Coding Recommendations