NUST Institutional Repository

Offensive Language Detection Using Machine Learning (OLDUM)

Show simple item record

dc.contributor.author Shah, Ijaz
dc.contributor.author Khalid, Hamza
dc.contributor.author Nasir, Muhammad Hasnat
dc.contributor.author Karimi, Zeeshan
dc.contributor.author Arman, Mubashir
dc.contributor.author Supervised by Prof Dr. Shibli Nisar
dc.contributor.author Co Supervised by Prof Dr. Alina Mirza
dc.date.accessioned 2025-02-12T08:09:48Z
dc.date.available 2025-02-12T08:09:48Z
dc.date.issued 2022-05
dc.identifier.other PTC-419
dc.identifier.uri http://10.250.8.41:8080/xmlui/handle/123456789/49759
dc.description.abstract Cyberbullying using offensive language on the Internet has become a major problem among all age groups. Automatic detection of offensive language from social media applications, websites, and blogs is a difficult but important task. In recent years, the presence of offensive language on social media platforms and automatic detection of such language is becoming a major challenge in modern society. The complexity of natural language constructs makes this task even more challenging. Until now, most of the research has focused on resource-rich languages like English. This study is about the detection of offensive language from the user's audio presented in a resource-poor language i.e., Pushto. We propose the first offensive dataset of Pushto containing user-generated Audio from social media. We use individual and combined n-grams techniques to extract features at word level and gender basis. We will apply classifiers from different machine learning techniques to detect offensive language from Pushto Audio. Offensive Language detection Using Machine Learning (OLDUM) aims at developing a prototype of a system that, using machine learning, will be capable of detecting offensive words in Pashto language, helping in automating the process of AUDIO/VOICE note by the social media Applications/Website and therefore stopping any unethical activity. en_US
dc.language.iso en en_US
dc.publisher MCS en_US
dc.title Offensive Language Detection Using Machine Learning (OLDUM) en_US
dc.type Project Report en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account