NUST Institutional Repository

A Model Driven Framework for Ambiguity Detection in SRS

Show simple item record

dc.contributor.author Behzad, Momina
dc.date.accessioned 2024-11-01T05:49:04Z
dc.date.available 2024-11-01T05:49:04Z
dc.date.issued 2024-10-29
dc.identifier.other 362795
dc.identifier.uri http://10.250.8.41:8080/xmlui/handle/123456789/47493
dc.description Supervisor: Dr. Usman Qamar en_US
dc.description.abstract The ambiguity detection in natural language processing (NLP) is critical to enhance the performance of the requirement based applications. There is lexical ambiguity, which is the complexity arising from confusing part words, syntactic, which is confusion about the structure of a sentence, and referential that happens when one has a problem identifying to which term a given term of reference is referring. This paper proposes a sound approach towards identifying these three categories of ambiguities using basic text processing and machine learning algorithmic analysis. The preparation phase itself entails significant pre-processing of the text in which we convert all text to lowercase, tokenize the text, eliminate the stop words and lemmatize the text. Such steps help to pre-process the text data to increase the chances of feature extraction as can be seen in this paper. In feature extraction the relevance of words is checked using Term Frequency-Inverse Document Frequency (TF-IDF) and syntactic feature extraction is done by Part-of-Speech (POS) tagging. Further, the next step involves the feature classification into ambiguous and non-ambiguous using various machine learning algorithms such as Logistic Regression, Random Forest and a Support Vector Machine (SVM). Indicator of a model’s accuracy include accuracy, precision, recall, F1 score and receiver operating characteristic-area under the curve (ROC-AUC). The findings are indicative of the value of combining more conventional methods of NLP with machine learning for improving the process of identifying levels of ambiguity. It is not only helpful in enhancing the explainability of automated systems but also, to a great extent, contributes to make NL Processing rather reliable and accurate in terms of the areas including but not limited to sentiment analysis, neural machine translation, and information retrieval. en_US
dc.language.iso en en_US
dc.publisher College of Electrical & Mechanical Engineering (CEME), NUST en_US
dc.subject Software Requirement Ambiguities, NLP, Machine Learning, Feature Extraction, SVM, RF, LR, Lexical Ambiguity, Syntactical Ambiguity, Referential Ambiguity en_US
dc.title A Model Driven Framework for Ambiguity Detection in SRS en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

  • MS [441]

Show simple item record

Search DSpace


Advanced Search

Browse

My Account