Ubiquitous Document Capturing with Deep-Learning

Naz, Tayyaba

DSpace Home
→
E-Theses
→
SEECS
→
Information Technology
→
MS
→
View Item

Ubiquitous Document Capturing with Deep-Learning

Naz, Tayyaba

URI: http://10.250.8.41:8080/xmlui/handle/123456789/10155

Date: 2016

Abstract:

Digital and paper documents co-exist in our daily life. Seamless integration of information from both sources is crucial for e cient knowledge management. The project aims at developing an algorithm which can handle the detection of document so that it can be captured easily to convert it into a digital form for automatic integration of relevant information in electronic work-flows. An approach based on machine learning is proposed unlike the most commonly geometric based methods. Convolutional neural networks (CNN) are used to train a binary classification model and the main algorithm is working by combining line segment detection with CNN model to extract the boundary of the document. CNN model detects the absence and presence of document in the given input and thus that model is then being applied onto the parts of the image being divided along the detected lines. The usage of deep-learning technique provides a solution which is more generalized and flexible than other available solutions.