Speech Recognition System Using Wav2vec Model (Punjabi Language)

Yaseen, Kashif; Zafar, Adeel; Ali, Awais; Supervised by Dr. Shibli Nisar

DSpace Home
→
E-Theses
→
MCS
→
Electrical Engineering
→
BETE
→
View Item

dc.contributor.author	Yaseen, Kashif
dc.contributor.author	Zafar, Adeel
dc.contributor.author	Ali, Awais
dc.contributor.author	Supervised by Dr. Shibli Nisar
dc.date.accessioned	2025-02-11T14:09:27Z
dc.date.available	2025-02-11T14:09:27Z
dc.date.issued	2023-06
dc.identifier.other	PTE-339
dc.identifier.uri	http://10.250.8.41:8080/xmlui/handle/123456789/49718
dc.description.abstract	Speech Recognition presents natural phenomena for the communication among man and machine. The purpose of Speech Recognition speech system is to convert the sequence of sound units in the form of text description. Technology for understanding spoken words by computers has improved a lot recently. But for languages like Punjabi, it's still hard for computers to understand speech well. The complexity of Punjabi phonology, compounded by variations in accent and pronunciation, poses substantial challenges for automatic speech recognition systems. As a result, the need for a robust Punjabi sound recognition system has become increasingly evident. Our project aims to solve this problem by using a special computer model called Wav2Vec. We train this model to understand Punjabi sounds better, so it can transcribe speech more accurately. So far, no work has been done in the field of Punjabi speech recognition system. Our approach involves pre-processing Punjabi audio data, training the Wav2Vec model, and fine-tuning it using transfer learning techniques. The final output is presented through a user-friendly Graphical User Interface (GUI), illustrating the outcomes of our Punjabi sound recognition system in a clear and accessible manner, facilitating easy interaction with transcribed speech for users of varying technical abilities. In this paper, the focus is on the development of the spontaneous speech model for the recognition of the Punjabi language. The GUI for Punjabi speech model also has been created and tested. The recognition accuracy is good for Punjabi sentences and much higher for Punjabi words. The python programming are used to build a speech model for Punjabi live speech.	en_US
dc.language.iso	en	en_US
dc.publisher	MCS	en_US
dc.title	Speech Recognition System Using Wav2vec Model (Punjabi Language)	en_US
dc.type	Project Report	en_US