Visual, audio and signal coding
(Continuation of Question 6/16 and Question 7/16)
MotivationThe goal of this Question is to produce Recommendations for visual, speech, audio and signal coding methods appropriate for conversational (e.g. videoconferencing and video telephony) and non-conversational (e.g. multimedia streaming, broadcast TV, IPTV, file download, media storage/playback, remote screen display, digital cinema, or virtual & augmented reality) audiovisual services and other services. The Question is to focus primarily on the coding of visual signals, including the compression of:
- video sequences;
- still images;
- graphics;
- stereoscopic, multi-view, depth maps, and free-viewpoint visual information;
- light fields, point clouds, and volumetric imagery;
- computer displays;
- medical imaging;
- 360 degree/panoramic/spherical-view video sequences;
- video and images for virtual and augmented reality.
This Question will primarily focus on the maintenance and extension of existing video and still-image coding Recommendations, and laying the ground for new Recommendations using advanced techniques to significantly improve the trade-offs between bit rate, quality, delay, and algorithm complexity. The Question will also be responsible for maintenance and further developments in speech, audio coding and other signal coding and network-based signal processing. Video, still-image, speech, audio and other signal coding standards will be developed with sufficient flexibility to accommodate a diverse number of transport types (Internet, LAN, 5G and other mobile networks, ITU-T H.222.0, etc.).
Study itemsStudy items to be considered include, but are not limited to:
- new coding methods in order to achieve the following objectives:
- improvements in compression efficiency;
- robust operation in error/loss-prone environments (e.g. non-guaranteed-bandwidth packet networks or mobile wireless communication);
- reduction of real-time delay, complexity, and of channel acquisition time and random access latency;
- organization of the compressed data format to support packetization and streaming;
- development of supplemental enhancement information to accompany source data for enabling enhanced functionality in application environments;
- study and specification of data for annotation, indexing, and searching;
- techniques to permit networks or terminals to adjust bit rates efficiently;
- techniques for object coding and multi-view operation;
- techniques to permit terminals to rapidly adjust the region-of-interest and/or field of view of video stream playback;
- techniques for efficient coding of 360 degree/panoramic/spherical-view video sequences, including those formed by stitching video sequences from multiple cameras with projection/rendering warping;
- techniques for efficient coding of video, images, audio, point clouds, and other signals for virtual and augmented reality, navigation, medical, and other applications;
- techniques for efficient compressed-digital to compressed-digital processing (including transcoding);
- the impact of colorimetry, video and image quality assessment, and quality control requirements on video and image codec development;
- computer graphics compression;
- security aspects that directly affect video, speech, audio and signal coding (including watermarking techniques);
- coordination of video, still-image, speech, audio and signal coding matters not addressed in other coding Questions with other ITU study groups and other bodies;
- harmonization of video, still-image, speech, audio and signal coding activities with other standard development organizations (SDOs);
- enhancements to existing multimedia systems Recommendations including the addition of advanced audio and visual coding (e.g. ITU-T H.26x and G.72x extensions and beyond).
TasksTasks include, but are not limited to:
An up-to-date status of work under this Question is found in the SG16 work programme
(
https://www.itu.int/ITU-T/workprog/wp_search.aspx?sp=16&q=6/16).
Relationships
Recommendations
- ITU‑T H.300 sub-series systems Recommendations
- ITU‑T H.222.0, H.241, H.245 and H.248-series
Questions
- Questions 1/16, 6/16, 8/16, 11/16, 13/16, 28/16
Study groups
- ITU‑T SGs 9, 11, 12, 13
- ITU‑R SG6
Other bodies
- ISO/IEC JTC 1/SC 29 WG 1 (JPEG, JBIG) and WG 11 (MPEG) on video, image, speech, and audio coding
- IETF, DVB, ATSC, ARIB, 3GPP, EBU, SCTE, SMPTE, MC-IF, MEF, VESA, W3C, CTA, IEC TC 100