We will address the complex problem of quality assessment of online speech for online video meeting systems. This will improve network resource allocation and saves critical time and resources including energy consumption. The quality assessment will be performed by the state-of-the-art large language models that are multi-modal, and hence we require computational resource. That is why we are applying for this project.