SUPR
DNN based Speech Quality Assessment
Dnr:

NAISS 2024/22-1529

Type:

NAISS Small Compute

Principal Investigator:

Saikat Chatterjee

Affiliation:

Kungliga Tekniska högskolan

Start Date:

2024-11-20

End Date:

2025-12-01

Primary Classification:

20205: Signal Processing

Webpage:

Allocation

Abstract

Our proposal is to develop a deep neural network (DNN) based method that provides a posterior distribution of mean-opinion-score (MOS) for an input speech signal. The DNN outputs statistical parameters of the posterior distribution. The proposed method will be referred to as deep posterior MOS (DeePMOS). For robust training of DeePMOS, we will use a combination of maximum-likelihood learning, stochastic gradient noise, and a student-teacher learning setup. Using the mean of the posterior as a point estimate, we will finally evaluate standard performance measures of the proposed DeePMOS. The results will be published in standard venues like Interspeech, ICASSP, and IEEE TASLP journals. For the proposed project, we require computational resource, and hence applying here for the resource.