NUST Institutional Repository

Enhanced Spatial Stream of Two Stream Network for Human Action Recognition

Show simple item record

dc.contributor.author Khan, Shahbaz
dc.date.accessioned 2023-08-04T07:37:26Z
dc.date.available 2023-08-04T07:37:26Z
dc.date.issued 2021
dc.identifier.other 274099
dc.identifier.uri http://10.250.8.41:8080/xmlui/handle/123456789/35633
dc.description Supervisor: Dr. Ali Hassan en_US
dc.description.abstract CNN have been proven effective in deep learning methods for Huaman Action Recognition (HAR) along with other computer vision tasks but the problem of overfitting in this domain remains till date, as deep learning models need large amount of data for training. This thesis is inspired by the two-stream network for HAR where CNN has been deployed as a base model to show that both, the spatial and the temporal aspects of an action are important for its recognition. To deal with the mentioned issue we have proposed enhancement of the spatial stream, which consists of two parts. Primarily, we adopted transfer learning in the spatial stream, where we demonstrated that by using models which are pre-trained on larger datasets like ImageNet yields good performance instead of training the original model from scratch. Secondly, we offer dataset augmentation technique, where we increased the dataset size by performing various random transformations like rotations, cropping and flipping on the image. Further, fine-tuning the network of the enhanced spatial stream on the augmented dataset increases the accuracy. Our architecture is trained and tested on UCF-101 dataset, which is the latest and standard benchmark for action videos. Our results are competent and are comparable with the state of the art two-strean network’s results. Also, our network performed well in the spatial stream as compared to other models. en_US
dc.language.iso en en_US
dc.publisher College of Electrical & Mechanical Engineering (CEME), NUST en_US
dc.subject Key Words: Human Action Recognition, Overfitting, Transfer Learning, Two Stream Network en_US
dc.title Enhanced Spatial Stream of Two Stream Network for Human Action Recognition en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

  • MS [329]

Show simple item record

Search DSpace


Advanced Search

Browse

My Account