Abstract: This study proposes an innovative speech translation method based on Pix2PixGAN, which maps the Mel spectrograms of speech produced by deaf individuals to those of normal-hearing individuals ...
This repository contains the implementation of (MQGAN) for audio synthesis. The project is structured to facilitate the entire workflow from data preparation to model deployment.
Abstract: In this study, we explore the potential of utilizing transformers in a limited data setting. We propose a teacher-student framework to train transformers for classifying lung diseases using ...