dc.contributor.author |
Asadullah |
|
dc.date.accessioned |
2020-12-31T06:34:50Z |
|
dc.date.available |
2020-12-31T06:34:50Z |
|
dc.date.issued |
2015 |
|
dc.identifier.uri |
http://10.250.8.41:8080/xmlui/handle/123456789/20133 |
|
dc.description |
Supervisor
Dr. Arslan Shaukat |
en_US |
dc.description.abstract |
Pattern Recognition and Machine Learning has played a key role in the development of
Automatic Speech Recognition (ASR). Most of the ASR research has been done using
English Language as English is most widely spoken language in the World. However, there
has been done very less research work generally in Non-English languages and particularly
in Urdu language due to the interest of researchers and non-availability of resources for this
language. There is no standard dataset available for Urdu language due to the lack of
awareness and research and interest of the people, although Urdu and Hindi is the national
language of Pakistan and India.
In this research work an Urdu ASR has been developed using CMU Sphinx Toolkit. We
have used a medium scale vocabulary (250) of Urdu with 250 isolated words uttered by 10
different speakers. The isolated words in the dataset are the most spoken words of Urdu
language. Mel Frequency Cepstral Coefficients (MFCC) is used for feature extraction.
Carnegie Mellon University (CMU) Sphinx Toolkit is used for Classification, which is
based upon state of the art statistical technique Hidden Markov Model (HMM). Sphinxtrain
is used for Training of the Acoustic Model. CMUCLTK is used for the development of
Language Model and Pronunciation Dictionary. This work created Acoustic Model,
Language Model and Dictionary for Urdu Words, which can be used in other Urdu Isolated
Words Applications. This research work has achieved some promising results as compared
to the previous results available in the literature on this dataset. The experiment is
performed on the 100-word to compare the results with the previous work and then all the
250-words are used in another experiment. |
en_US |
dc.publisher |
CEME, National University of Sciences and Technology, Islamabad |
en_US |
dc.subject |
Hidden Markov Model, Automatic Speech Recognition, Urdu Isolated Words, CMU Sphinx, Pattern Recognition, Natural Language Processing (NLP), Computational Linguistics, Mel Frequency Cepstral Coefficients (MFCC) |
en_US |
dc.title |
Automatic Urdu Speech Recognition using Hidden Markov Model |
en_US |
dc.type |
Thesis |
en_US |