Back to Projects
09

AI Real-Time Audio Translation

2024
React
Tailwind CSS
Flask
PyTorch
AI Real-Time Audio Translation

About This Project

Developing a generative AI project using AMD PC AI technologies for the international AMD Pervasive AI Developer Contest with Hackster, in collaboration with a team of IBM Academy: Advance AI mentors from Infinite Learning. Open-Source pre-trained models used in this projects are : OpenAI Whisper as Automatic Speech Recognition (ASR) Pre-trained Model, MarianMT as Machine Translation (MT) Pre-trained Model, tacotron2-DDC as Text to Speech (TTS) Pre-trained Model, and HiFi-GAN as vocoder Model. Device used in this development is: Minisforum Venus UM790 Pro with AMD Ryzen™ 9 with Ryzen 9 7940HS as a hardware sponsorship.