Attention based bidirectional GRU hybrid model for inappropriate content detection in Urdu Language

Shoukat, Ezzah

DSpace Home
→
E-Theses
→
SEECS
→
Information Technology
→
MS
→
View Item

dc.contributor.author	Shoukat, Ezzah
dc.date.accessioned	2022-08-15T07:52:29Z
dc.date.available	2022-08-15T07:52:29Z
dc.date.issued	2022
dc.identifier.uri	http://10.250.8.41:8080/xmlui/handle/123456789/30077
dc.description	CL-T-6644	en_US
dc.description.abstract	With the advancement in the scope of online discussion, the spread of toxic and inappropriate content on social networking sites has also increased. Several stud ies have been conducted in different languages. However, existing literature on inappropriate content detection lacks research in Urdu Unicode text language us ing deep learning techniques. Use of attention layer with deep learning model can help in handling the long-term dependencies and increase its efficiency. To explore the effect of attention layer, this study proposes an attention based Bidi rectional GRU hybrid model for identification of Inappropriate content in Urdu Unicode text language. Four different baseline deep learning models LSTM, Bi LSTM, GRU, and TCN are used to evaluate the performance of proposed model. The results of models are compared based on evaluation metrics, dataset size and impact of word embedding layer. The pre-trained Urdu word2vec embeddings are utilized for our case. Our proposed model BiGRU-A outperformed all other base line models by yielding 84% accuracy without using pre-trained word2vec layer. From our experiments we have established that attention layer improves the ef ficiency of model and pre-trained word2vec embedding does not work well with inappropriate content dataset.	en_US
dc.description.sponsorship	Dr Rabia Irfan	en_US
dc.language.iso	en	en_US
dc.publisher	SEECS-School of Electrical Engineering and Computer Science NUST Islamabad	en_US
dc.subject	Deep Learning, Natural Language Processing, Text classification, Attention.	en_US
dc.title	Attention based bidirectional GRU hybrid model for inappropriate content detection in Urdu Language	en_US
dc.type	Thesis	en_US