Machine Learning (ML) Accelerator on Cora Z7 Low-Cost FPGA

Ahmed, Obaidullah; Rauf, Ayesha; Zaidi, Syeda Fareeha Batool; Baigal, Kashif Murad; Supervised by Assistant Professor Dr. Hussain Ali

DSpace Home
→
E-Theses
→
MCS
→
Electrical Engineering
→
BETE
→
View Item

dc.contributor.author	Ahmed, Obaidullah
dc.contributor.author	Rauf, Ayesha
dc.contributor.author	Zaidi, Syeda Fareeha Batool
dc.contributor.author	Baigal, Kashif Murad
dc.contributor.author	Supervised by Assistant Professor Dr. Hussain Ali
dc.date.accessioned	2025-02-13T13:20:02Z
dc.date.available	2025-02-13T13:20:02Z
dc.date.issued	2024-06
dc.identifier.other	PTC-475
dc.identifier.uri	http://10.250.8.41:8080/xmlui/handle/123456789/49895
dc.description.abstract	With an ever-increasing demand for faster and more efficient computation in the machine learning domain, FPGAs emerge as a powerful solution capable of revolutionizing various applications. This study delves into the intricate interplay between FPGA technology and machine learning algorithms, unraveling the ways in which FPGA-based acceleration can lead to real-time inferencing. This project presents an implementation of a super-resolution (SR) convolutional neural network (CNN) accelerator on low-cost Zynq-7000 series SoC-based FPGA. On our low-cost Xilinx FPGA board (Cora Z7), C programming is employed for deploying a trained SR-CNN model using Vivado high-level synthesis. Key optimization techniques like loop unrolling, DMA memory access, look-up-table (LUT) partitioning, multiplexer (MUX) incorporation and integer-8 (int8) quantization are strategically implemented to efficiently utilize the limited onboard resources, leading to the creation of an optimized IP core. With deterministic latency, the system consistently delivers high-resolution images with a fixed processing timing. The inherent parallelism offered by the FPGA and the application specific architecture helps us in achieving remarkably low processing timing for the execution of our deep neural network for high quality image reconstruction from low resolution image at the input. The outcomes not only showcase the viability but also the effectiveness of implementing intricate machine learning models on cost effective FPGA platforms. By establishing a solid platform, this project encourages broader adoption and creative implementation of FPGA-accelerated solutions, ultimately propelling the field of machine learning into an era of enhanced computation.	en_US
dc.language.iso	en	en_US
dc.publisher	MCS	en_US
dc.title	Machine Learning (ML) Accelerator on Cora Z7 Low-Cost FPGA	en_US
dc.type	Project Report	en_US