NUST Institutional Repository

Clarity Companion - A Cloud-Based Mobile Application For Real-Time Scene Understanding & Narration

Show simple item record

dc.contributor.author SUPERVISOR DR. ARSLAN SHAUKAT DR. WASI HAIDER BUTT, NS MAAZIN ZAIDI NS MASAB BIN NASIM PC SANA FATIMAH NS USAMA SOHAIL
dc.date.accessioned 2024-07-04T05:01:33Z
dc.date.available 2024-07-04T05:01:33Z
dc.date.issued 2024
dc.identifier.other DE-COMP-42
dc.identifier.uri http://10.250.8.41:8080/xmlui/handle/123456789/44498
dc.description Supervisor DR. ARSLAN SHAUKAT DR. WASI HAIDER BUTT en_US
dc.description.abstract Visual impairments affect approximately 2.2 billion individuals worldwide, impacting people of all ages and genders. These disabilities significantly affect personal lives and contribute to a substantial global financial burden. According to theWorld Health Organization (WHO), adults with visual impairments experience higher rates of unemployment and are more prone to depression and anxiety. Additionally, the estimated annual loss in productivity due to visual impairments is approximately US$400 billion in purchasing power parity. Despite numerous ongoing efforts to prevent blindness, this study focuses on assisting visually impaired individuals in comprehending and visualizing their surroundings using mobile phone cameras. By leveraging several pre-trained deep learning models, including YOLO, ByteTrack, MiDaS, BLIP, and GPT-3.5, this research converts video input from mobile phone cameras into textual descriptions that convey the surrounding scene. The proposed system aims to enhance the independence and quality of life for visually impaired individuals by providing real-time, accessible information about their environment. It is noteworthy that several recent attempts have been made to address this issue; however, most of these solutions fall short in fully understanding a scene and delivering voice output in real-time. This study seeks to overcome these limitations by employing advanced deep learning techniques to provide a more accurate and timely interpretation of visual information, thereby offering a more effective aid for the visually impaired en_US
dc.language.iso en en_US
dc.publisher College of Electrical & Mechanical Engineering (CEME), NUST en_US
dc.subject Scene Understanding, YOLOv8, MiDaS, BLIP, GPT, ByteTrack, Real-Time, Flutter, Flask, Cloud en_US
dc.title Clarity Companion - A Cloud-Based Mobile Application For Real-Time Scene Understanding & Narration en_US
dc.type Project Report en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account