NUST Institutional Repository

Natural Language Descriptions of Images

Show simple item record

dc.contributor.author Raja Zain Abbas, Syed Wahab Aftab Chishty Hamza Javed
dc.date.accessioned 2021-08-02T11:01:28Z
dc.date.available 2021-08-02T11:01:28Z
dc.date.issued 2018
dc.identifier.uri http://10.250.8.41:8080/xmlui/handle/123456789/25172
dc.description Supervisor: Dr Muhammad Shahzad en_US
dc.description.abstract Auto image captioning has recently gained a lot of attention especially because of the use of social media. We are in an era where images have become rather important. Few years ago this was a far fetched idea that a computer can understand a scene and describe it without any human assistance but with the advancement in machine learning , computer vision and natural language processing, a way has been paved for the researchers. Because of this researchers are putting in a lot of effort in this domain and are trying their best to innovate and develop tools which would help in describing the images for the users. A lot of work has already been done in this domain and a lot is still being done. Google and Microsoft are investing heavily in the field. Many models have been developed which can caption an image with great accuracy. There are models which can describe different objects in the images, there are models which can describe different regions , and there are models which can describe a whole image in a sentence. We have worked with the combination of all of the above. Our model describes every region in the image and then creates a meaningful sentence out of those region descriptions.This has helped us in creating dense captions for images, thus providing more information Our Project is Natural Language Descriptions of Images - SceneCap , which has the above mentioned model integrated inside an app, which can be used to take real-time pictures of the scenes to describe them with great detail and understanding. The application allows it to be portable and accessible from any smartphone. This allows our model to be flexible and readily available in the palm of anyone’s hands. en_US
dc.publisher SEECS, National University of Sciences and Technology, Islamabad en_US
dc.subject Software Engineering en_US
dc.title Natural Language Descriptions of Images en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

  • BS [191]

Show simple item record

Search DSpace


Advanced Search

Browse

My Account