NUST Institutional Repository

Efficient Discrete Cosine Transform (DCT) coefficients for Images Using Deep Learning

Show simple item record

dc.contributor.author Khan, Zirak
dc.date.accessioned 2023-08-10T06:08:23Z
dc.date.available 2023-08-10T06:08:23Z
dc.date.issued 2019
dc.identifier.other 00000172451
dc.identifier.uri http://10.250.8.41:8080/xmlui/handle/123456789/36185
dc.description Supervisor: Dr. Farhan Hussain en_US
dc.description.abstract Digital audio, videos and images corresponds to huge amount of data. The storage and transmission of this data requires significant amount of bandwidth and memory respectively. Compressing this digital data is a field which has been researched upon for decades. Many state of the art compression algorithms have been proposed to cater the storage and transmission requirements of digital data. Aiming at the digital image compression, Discrete Cosine Transform is a widely used transform to explore the frequencies present in a digital image. During the quantization step the less significant frequencies are discarded and only the more important frequencies are retained. This quantization results in the reduced representation of the image hence compression is achieved. The image reconstructed from this reduced frequency set is an approximation of the original image and hence it results in lossy image compression. Lately, a significant amount of research work has been conducted based on the use of neural networks for image compression. This thesis presents a detailed literature review to thoroughly analyze the existing literature and methods. In this thesis we target a deep neural network that can estimate the most important DCT coefficients for an image and then we utilize these most significant DCT Coefficients for the classification task. The estimation of DCT coefficients is targeted by a Multi-Layered Perceptron (MLP) model and a Deep Convolutional Neural Network (DCNN) model. The experimentation showed promising results and revealed that MLP models have relatively lower error rate between actual and predicted results, as compared to DCNN models. Later on, MNIST image dataset is applied to the proposed deep learning models for the prediction of its most significant DCT coefficients and the predicted results are then used for digits classification. The experimental results support the DCT based digits classification with an accuracy of 95%, which is quiet promising. In future, the proposed technique also leverages the use of compressed images for tackling different image classification and regression problems. Moreover, the proposed deep neural networks can be further generalized to support videos and color representations. en_US
dc.language.iso en en_US
dc.publisher College of Electrical & Mechanical Engineering (CEME), NUST en_US
dc.subject Keywords: Image Compression, lossy compression, DCT, Deep Learning, CNN, MLP en_US
dc.title Efficient Discrete Cosine Transform (DCT) coefficients for Images Using Deep Learning en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

  • MS [443]

Show simple item record

Search DSpace


Advanced Search

Browse

My Account