A Model Driven Framework for Ambiguity Detection in SRS

Behzad, Momina

DSpace Home
→
E-Theses
→
CEME
→
Computer Software Engineering
→
MS
→
View Item

dc.contributor.author	Behzad, Momina
dc.date.accessioned	2024-11-01T05:49:04Z
dc.date.available	2024-11-01T05:49:04Z
dc.date.issued	2024-10-29
dc.identifier.other	362795
dc.identifier.uri	http://10.250.8.41:8080/xmlui/handle/123456789/47493
dc.description	Supervisor: Dr. Usman Qamar	en_US
dc.description.abstract	The ambiguity detection in natural language processing (NLP) is critical to enhance the performance of the requirement based applications. There is lexical ambiguity, which is the complexity arising from confusing part words, syntactic, which is confusion about the structure of a sentence, and referential that happens when one has a problem identifying to which term a given term of reference is referring. This paper proposes a sound approach towards identifying these three categories of ambiguities using basic text processing and machine learning algorithmic analysis. The preparation phase itself entails significant pre-processing of the text in which we convert all text to lowercase, tokenize the text, eliminate the stop words and lemmatize the text. Such steps help to pre-process the text data to increase the chances of feature extraction as can be seen in this paper. In feature extraction the relevance of words is checked using Term Frequency-Inverse Document Frequency (TF-IDF) and syntactic feature extraction is done by Part-of-Speech (POS) tagging. Further, the next step involves the feature classification into ambiguous and non-ambiguous using various machine learning algorithms such as Logistic Regression, Random Forest and a Support Vector Machine (SVM). Indicator of a model’s accuracy include accuracy, precision, recall, F1 score and receiver operating characteristic-area under the curve (ROC-AUC). The findings are indicative of the value of combining more conventional methods of NLP with machine learning for improving the process of identifying levels of ambiguity. It is not only helpful in enhancing the explainability of automated systems but also, to a great extent, contributes to make NL Processing rather reliable and accurate in terms of the areas including but not limited to sentiment analysis, neural machine translation, and information retrieval.	en_US
dc.language.iso	en	en_US
dc.publisher	College of Electrical & Mechanical Engineering (CEME), NUST	en_US
dc.subject	Software Requirement Ambiguities, NLP, Machine Learning, Feature Extraction, SVM, RF, LR, Lexical Ambiguity, Syntactical Ambiguity, Referential Ambiguity	en_US
dc.title	A Model Driven Framework for Ambiguity Detection in SRS	en_US
dc.type	Thesis	en_US