Recommendation ITU-T P.863.2 (05/2024) Extension of ITU-T P.863 for multidimensional assessment of degradations in telephony speech signals up to fullband
Summary
History
FOREWORD
Table of Contents
Introduction
1 Scope
2 References
3 Definitions
     3.1 Terms defined elsewhere
4 Abbreviations and acronyms
5 Conventions
6 Overview of the models
     6.1 Model characteristics
          6.1.1 Input signal characteristics
          6.1.2 Model output
          6.1.3 Scoring of background noise
          6.1.4 Scoring of temporal clipping
7 Comparison between objective and subjective scores
8 Speech material
     8.1 Input or reference speech material
     8.2 Degraded speech material
     8.3 Special requirements for acoustically captured speech material
     8.4 Acoustical insertion or capture for loudspeaker phones
9 Description of the model algorithms
     9.1 Colouration model
     9.2 Discontinuity model
     9.3 Noisiness model
          9.3.1 Active or inactive decision
          9.3.2 Background noise
          9.3.3 Noise on speech
          9.3.4 Active speech level factor
     9.4 Sub-optimum loudness model
          9.4.1 Gain variation indicator
Annex A  Subjective test method for obtaining perceptual dimension scores
Annex B  Conformity data and tests
     B.1 List of files provided for conformity validation
     B.2 Conformity tests
          B.2.1 Conformity data sets
          B.2.2 Conformity requirements
     B.3 Digital attachments
Appendix I  Reporting of the performance results for the model algorithms based on the correlation, Root mean square error (RMSE) and RMSE* metrics
Appendix II  Test instructions
Bibliography