NUST Institutional Repository

Efficient Feature Selection for Machine Learning Using Biologically Inspired Algorithms

Show simple item record

dc.contributor.advisor
dc.contributor.author Moosavi, Syed Kumayl Raza
dc.date.accessioned 2023-05-26T11:01:56Z
dc.date.available 2023-05-26T11:01:56Z
dc.date.issued 2023
dc.identifier.uri http://10.250.8.41:8080/xmlui/handle/123456789/33606
dc.description.abstract Big data has become a ubiquitous feature of modern society, providing opportunities for extracting valuable insights from large datasets. However, the sheer volume, variety, and complexity of big data make it difficult to process and analyze, particularly in the field of Machine Learning (ML). Therefore the technique of dimensionality reduction is of paramount importance in ML, data mining and information retrieval domains. Fea- ture selection is a crucial step in the preprocessing of data for machine learning modèle for dimensionality reduction, as it involves selecfing a subset of relevant features to improve the model’s accuracy and reduce the risk of overfitting. Metaheuristic algo- rithm based wrapper methods are a type of feature selection technique that involves using the predictive performance of a learning algorithm to create a reduced feature set. The traditional methodology involves the use of K-Nearest Neighbour based accuracy maximization as the colt function, which is seen to produce sub-optimal solutions in large sample spaces and is relatively resource intensive. In this work, a novel meta- heuristic algorithm, namely the Hybrid Sine Cosine Firehawk Algorithm (HSCFHA) is proposed. The proposed feature selection technique uses this hybrid algorithm and aime to eliminate insignificant and redundant features by including the minimisation of dataset variance in the cost function. Furthermore, the hybrid algorithm uses the best features of the combined algorithm to improve the exploration ability. The proposed technique is tested on 22 various University of California Irvine datasets containing low, medium and high dimensional datasets and compared to the fraditional KNN based approach. Moreover, the technique is also compared with other state-of-the-art meta heuristic techniques, namely Parficle Swarm Optimizer, Grey Wolf Optimizer, Whale Optimization Algorithm, Hybrid Ant Colony Optimizer and Improved Binary Bat Al- gorithm. The results show significant improvements over previous techniques in terms of minimal loss in essential data while reducing the size of the raw data in considerably less time as well as a well-balanced confusion matrix.
dc.description.abstract Supervisor: Dr. Ahsan Saadat
dc.description.sponsorship Dr. Ahsan Saadat en_US
dc.description.uri
dc.language.iso en en_US
dc.publisher School of Electrical Engineering and Computer Sciences (SEECS) NUST en_US
dc.title Efficient Feature Selection for Machine Learning Using Biologically Inspired Algorithms en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account