NUST Institutional Repository

Attention based bidirectional GRU hybrid model for inappropriate content detection in Urdu Language

Show simple item record

dc.contributor.author Shoukat, Ezzah
dc.date.accessioned 2022-08-15T07:52:29Z
dc.date.available 2022-08-15T07:52:29Z
dc.date.issued 2022
dc.identifier.uri http://10.250.8.41:8080/xmlui/handle/123456789/30077
dc.description CL-T-6644 en_US
dc.description.abstract With the advancement in the scope of online discussion, the spread of toxic and inappropriate content on social networking sites has also increased. Several stud ies have been conducted in different languages. However, existing literature on inappropriate content detection lacks research in Urdu Unicode text language us ing deep learning techniques. Use of attention layer with deep learning model can help in handling the long-term dependencies and increase its efficiency. To explore the effect of attention layer, this study proposes an attention based Bidi rectional GRU hybrid model for identification of Inappropriate content in Urdu Unicode text language. Four different baseline deep learning models LSTM, Bi LSTM, GRU, and TCN are used to evaluate the performance of proposed model. The results of models are compared based on evaluation metrics, dataset size and impact of word embedding layer. The pre-trained Urdu word2vec embeddings are utilized for our case. Our proposed model BiGRU-A outperformed all other base line models by yielding 84% accuracy without using pre-trained word2vec layer. From our experiments we have established that attention layer improves the ef ficiency of model and pre-trained word2vec embedding does not work well with inappropriate content dataset. en_US
dc.description.sponsorship Dr Rabia Irfan en_US
dc.language.iso en en_US
dc.publisher SEECS-School of Electrical Engineering and Computer Science NUST Islamabad en_US
dc.subject Deep Learning, Natural Language Processing, Text classification, Attention. en_US
dc.title Attention based bidirectional GRU hybrid model for inappropriate content detection in Urdu Language en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

  • MS [432]

Show simple item record

Search DSpace


Advanced Search

Browse

My Account