Recommendation ITU-T H.862.5 (06/2021) Emotion enabled multimodal user interface based on artificial neural networks
Summary
History
FOREWORD
Table of Contents
1 Scope
2 References
3 Definitions
     3.1 Terms defined elsewhere
     3.2 Terms defined in this Recommendation
4 Abbreviations and acronyms
5 Conventions
6 Functional architecture
     6.1 Architectural framework
7 Functional entities
     7.1 Multimedia processing – pre-processing module
          7.1.1 Text processing
               7.1.1.1 Text data augmentation
               7.1.1.2 Person attribute recognition for text
               7.1.1.3 Topic cluster recognition
               7.1.1.4 Document summarization
               7.1.1.5 Named entity recognition
               7.1.1.6 Sentence splitter
               7.1.1.7 Keyword cluster
          7.1.2 Speech processing
               7.1.2.1 Speech data augmentation
               7.1.2.2 Person attributes recognition for speech
               7.1.2.3 Noise reduction
               7.1.2.4 Voice separation
               7.1.2.5 Voice activity detection
          7.1.3 Image processing
               7.1.3.1 Image data augmentation
               7.1.3.2 Person attributes recognition for image
               7.1.3.3 Noise reduction
               7.1.3.4 Object detection
               7.1.3.5 Face detection
               7.1.3.6 Gesture recognition
     7.2 Emotion analysis neural network – unimodal network
          7.2.1 Text analysis
               7.2.1.1 Trainer for the text analysis
               7.2.1.2 Text emotion inference engine
          7.2.2 Speech analysis
               7.2.2.1 Trainer for speech analysis
               7.2.2.2 Speech emotion inference engine
          7.2.3 Image analysis
               7.2.3.1 Trainer for image analysis
               7.2.3.2 Image emotion inference engine
     7.3 Emotion analysis neural network – multimodal network
          7.3.1 State vector integration
          7.3.2 Trainer for multimodal network
          7.3.3 Multimodal emotion inference engine
     7.4 Emotion expansion module – complex emotion generation
          7.4.1 Artificial emotion model
          7.4.2 Trainer for complex emotion generation
          7.4.3 Complex emotion inference engine
     7.5 Multimedia knowledge database
8 Application to multimodal emotion analysis
Appendix I  Survey on SDO's activity regarding typical emotional service
     I.1 SDO activities on emotion
          I.1.1 W3C
          I.1.2 ISO/IEC JTC 1/SC 35
          I.1.3 OMA
Bibliography
<\pre>