Title: ITU-T coders for wideband, superwideband, and fullband speech communication [Series Editorial]
Authors: Richard V. Cox, Simao Ferraz De Campos Neto, Claude Lamblin, Mostafa Hashem Sherif
Published: 2009-10-02
Link: https://ieeexplore.ieee.org/document/5273816
Abstract
This section of the magazine presents recent algorithms developed by the ITU to provide high quality coding beyond traditional narrowband telephony. Speech coders can be characterized by their bit rate, quality, complexity, and delay. Typical applications fall into one of two categories, one-way and two-way. The first includes storage applications such as telephone answering systems, streaming, multimedia delivery, and push-to-talk calls. The second includes realtime communications such as two person phone calls and conference calls. In this latter category, if the delay is too large - exceeding 300 ms round-trip - humans have difficulty communicating, while for storage and playback operations delay is not a factor. The complexity of a speech coder is one of the main contributing factors to its cost and energy usage. Complexity is most often measured in terms of memory usage (both RAM and ROM) and the number of instructions executed per second. All applications are sensitive to cost, and many are sensitive to energy usage as well. The desired bit rate is determined by channel capacity or storage capacity, depending on the application.
ITU Article from 2009
- two person phone calls and conference calls. In this latter category, if the delay is too large â exceeding 300 ms round-trip â humans have difficulty communicating
- For decades the International Telecommunication Union (ITU) standardized speech coders having âtelephone bandwidth,â meaning an audio bandwidth of 300â3400 Hz with a sampling rate of 8 kHz
- first speech coding algorithm, G.711
- If the fidelity of a new coder is judged comparable, it is called âtoll quality.â
- telephone bandwidth is referred to as ânarrowbandâ
- first extension of the audio bandwidth, standardized about two decades ago, was ITU-T Recommendation G.722 having 50â7000 Hz bandwidth and a sampling rate of 16 kHz
- dubbed âwidebandâ and was intended for video teleconference applications
- recent coding algorithms have been developed for so-called superwideband (50â14,000 Hz) and fullband (20â20,000 Hz) to take advantage of the growth in available digital telecommunications bandwidth