dc.contributor.author |
Batool, Fatima |
|
dc.date.accessioned |
2024-08-15T11:41:10Z |
|
dc.date.available |
2024-08-15T11:41:10Z |
|
dc.date.issued |
2024-08-02 |
|
dc.identifier.other |
402874 |
|
dc.identifier.uri |
http://10.250.8.41:8080/xmlui/handle/123456789/45451 |
|
dc.description |
MS Statistics Thesis |
en_US |
dc.description.abstract |
This research tackles the issue of selecting variables and making predictions in high-dimensional
datasets by employing a range of regression techniques, such as Bayesian regression, Lasso, Elastic
Net, Orthogonal Matching Pursuit, and RANSAC Regression. The main aim is to determine the most
efficient method for forecasting a dependent variable using a large array of independent variables and
to identify the key predictors. To assess these techniques, we use synthetic data with one dependent
variable and 1627 independent variables. Each model undergoes testing and training 50 times, with
performance measured by the average Mean Squared Error (MSE) across various data splits and crossvalidation.
The results have crucial implications for domains that require reliable methods for variable
selection and prediction. Future research will aim to apply these methods to real-world datasets and
further refine them to boost their predictive accuracy. |
en_US |
dc.description.sponsorship |
Dr. Tahir Mahmood |
en_US |
dc.language.iso |
en |
en_US |
dc.publisher |
National University of Science and Technology NUST H-12 Islamabad |
en_US |
dc.subject |
High-dimensional data, Variable selection, Bayesian regression, Lasso, Elastic Net, Orthogonal Matching Pursuit, RANSAC Regression, Mean Squared Error (MSE), Model evaluation, Synthetic data. |
en_US |
dc.title |
Machine Learning for Predicting the Antidiabetic Properties of Novel Schiff Bases through FTIR |
en_US |
dc.type |
Thesis |
en_US |