NUST Institutional Repository

An Augmentative Phonation and Articulation System using Advanced Signal Processing Techniques as an Alternative Communication Device

Show simple item record

dc.contributor.author Shafiq, Uzma
dc.date.accessioned 2023-07-27T04:37:02Z
dc.date.available 2023-07-27T04:37:02Z
dc.date.issued 2023
dc.identifier.other 330042
dc.identifier.uri http://10.250.8.41:8080/xmlui/handle/123456789/35214
dc.description Supervised by: Prof. Dr. Javaid Iqbal en_US
dc.description.abstract Speech recognition systems utilize acoustic signals collected using a microphone. But in individuals with speech disorders such as who have undergone laryngectomy or have vocal cord paralysis, it is not possible to collect the acoustic signals. For rehabilitation of such individuals an alternate method of communication has to be devised which is independent of acoustic signals. Facial muscles of such individuals remain intact and can be thus used for speech recognition purposes. The limited research on subvocal voice recognition demands the need to develop robust methods which can aid in rehabilitation of the effected individuals. This study aims at filling the gaps left by the previous literature. The EMG signal filtration is carried out to denoise the signals using ana advanced signals processing technique called Variational mode decomposition (VMD). VMD decomposes the input signals into its sub signals in different frequency spectra. Each frequency spectrum undergoes Iterative interval thresholding (IIT) using SIFT operator. These filtered signals are then used to extract crucial information from the signals using a novel feature extraction technique that utilizes the VMD method along with singular vector decomposition. These extracted features are utilized to classify the isolated words using Random Forest classifier. The results demonstrate the superiority of this technique, achieving an accuracy of 98.6% and 92% for a vocabulary set of 70 words and 96 words respectively. The final objective of this study was to develop a novel speech activity detection method using IMU signals as an alternate to EMG used previously in the literature. The proposed activity detection algorithm is carried out in four stages. The algorithm provides significantly better results as compared to EMG based activity detection method. The mean activation error rate (AER) for IMU based algorithm was 0.138 and for EMG based activity detection method was 0.275. These findings demonstrate the superiority of the proposed feature extraction and speech activity detection methods over alternative techniques. In conclusion, subvocal voice recognition is vital for rehabilitation, enabling silent communication and enhancing independence. Limited research and lack of IMU utilization pose challenges, but the thesis proposes a novel approach integrating spatiotemporal feature extraction and IMU data for improved accuracy and robustness. Results demonstrate the superiority of the proposed methods. en_US
dc.language.iso en en_US
dc.publisher School of Mechanical & Manufacturing Engineering (SMME), NUST en_US
dc.relation.ispartofseries SMME-TH-887;
dc.subject . Robust Classification, EMG Signals, Data Filtration, Augmentative Phonation, en_US
dc.title An Augmentative Phonation and Articulation System using Advanced Signal Processing Techniques as an Alternative Communication Device en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

  • MS [367]

Show simple item record

Search DSpace


Advanced Search

Browse

My Account