NUST Institutional Repository

Speech Recognition System Using Wav2vec Model (Punjabi Language)

Show simple item record

dc.contributor.author Yaseen, Kashif
dc.contributor.author Zafar, Adeel
dc.contributor.author Ali, Awais
dc.contributor.author Supervised by Dr. Shibli Nisar
dc.date.accessioned 2025-02-11T14:09:27Z
dc.date.available 2025-02-11T14:09:27Z
dc.date.issued 2023-06
dc.identifier.other PTE-339
dc.identifier.uri http://10.250.8.41:8080/xmlui/handle/123456789/49718
dc.description.abstract Speech Recognition presents natural phenomena for the communication among man and machine. The purpose of Speech Recognition speech system is to convert the sequence of sound units in the form of text description. Technology for understanding spoken words by computers has improved a lot recently. But for languages like Punjabi, it's still hard for computers to understand speech well. The complexity of Punjabi phonology, compounded by variations in accent and pronunciation, poses substantial challenges for automatic speech recognition systems. As a result, the need for a robust Punjabi sound recognition system has become increasingly evident. Our project aims to solve this problem by using a special computer model called Wav2Vec. We train this model to understand Punjabi sounds better, so it can transcribe speech more accurately. So far, no work has been done in the field of Punjabi speech recognition system. Our approach involves pre-processing Punjabi audio data, training the Wav2Vec model, and fine-tuning it using transfer learning techniques. The final output is presented through a user-friendly Graphical User Interface (GUI), illustrating the outcomes of our Punjabi sound recognition system in a clear and accessible manner, facilitating easy interaction with transcribed speech for users of varying technical abilities. In this paper, the focus is on the development of the spontaneous speech model for the recognition of the Punjabi language. The GUI for Punjabi speech model also has been created and tested. The recognition accuracy is good for Punjabi sentences and much higher for Punjabi words. The python programming are used to build a speech model for Punjabi live speech. en_US
dc.language.iso en en_US
dc.publisher MCS en_US
dc.title Speech Recognition System Using Wav2vec Model (Punjabi Language) en_US
dc.type Project Report en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account