Committed to connecting the world

Girls in ICT

Question 5

​​Artificial intelligence-enabled multimedia applications

(Continuation of Question 5/16)

Motivation

The recent success of Artificial Intelligence (AI) in various applications has raised study and utilization of AI technology to a new height. AI has been the apex technology of the information age. One of the most exciting aspects of the AI inflection is that “real-world” use cases abound. At the same time, deep-learning enabled advances in computer vision and technologies such as natural language processing are dramatically improving the quality of people’s work and life.

At present, the ecological pattern of AI has been established gradually. In future years, specialized intelligent applications will be the main potential area for the future development of AI. No matter whether it is a specialized or generalized application, the AI studies will focus on analys​ing data at three basic levels: computing layer (base), algorithm layer (technology) and application layer. AI is not just “tech for tech”. Where large data sets are combined with powerful enough technology, value is being created and competitive advantage is being gained.

Multimedia has become the pioneer, and the concept of “AI-enabled Multimedia” as well as “Intelligent Multimedia” has already come up. Scientists, engineers all over the world are delving into some of the most exciting areas such as computer vision and speech technologies. Computers are being taught to understand video, augmenting reality to guide field technicians when operations get complex, helping computers recognize people, detect sentiment and speak with emotion, and enrich video with metadata extracted from it.

AI-enabled multimedia applications are booming, but focused studies are far behind. Emerging technologies brings not only new opportunities, but also new challenges as well as new demands. Taking multimedia data as an example, image, video and sound data are the fuel of AI applications such as recognition, sentiment classification, etc. However, huge volume multimedia data does not indicate high quality labelling data that AI applications could benefit. If no guidelines or standards of multimedia format, labelling are developed, multimedia data collected and labelled by company A could not be used in company B. This results in huge resource waste and prevents the data flow, which can severely hinder the development of the AI industry​.

This Question focusses on artificial intelligence-enabled multimedia applications, 1) to identify challenges facing the deployment of AI-enabled multimedia applications, 2) to analys​​e the impact of AI technologies in standards for multimedia applications, and 3) to identify evaluation and assessment specifications of applications, algorithms and data structures for standards in AI-enabled multimedia applications, in order to boost and innovate the development of multimedia as well as AI industry.

Study items
Study items to be considered include, but are not limited to:
Tasks
​Tasks include, but are not limited to:
An up-to-date status of work under this Question is contained in the SG16 work programme
(https://www.itu.int/ITU-T/workprog/wp_search.aspx?sp=16&q=5/16).​

Relationships

Recommendations:
Questions:
Study groups
Other bodies