USAD: An Intelligent System for Slang and Abusive Text Detection in PERSO-Arabic-Scripted Urdu

Complexity 2020:1-7 (2020)
  Copy   BIBTEX

Abstract

The use of slang, abusive, and offensive language has become common practice on social media. Even though social media companies have censorship polices for slang, abusive, vulgar, and offensive language, due to limited resources and research in the automatic detection of abusive language mechanisms other than English, this condemnable act is still practiced. This study proposes USAD, a lexicon-based intelligent framework to detect abusive and slang words in Perso-Arabic-scripted Urdu Tweets. Furthermore, due to the nonavailability of the standard dataset, we also design and annotate a dataset of abusive, offensive, and slang word Perso-Arabic-scripted Urdu as our second significant contribution for future research. The results show that our proposed USAD model can identify 72.6% correctly as abusive or nonabusive Tweet. Additionally, we have also identified some key factors that can help the researchers improve their abusive language detection models.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 91,853

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Three Words of Abusive Slang in Aeschines.P. G. Maxwell-Stuart - 1975 - American Journal of Philology 96 (1):7.
Use of slang among different age groups in karachi.Areeba Mazhar - 2015 - Journal of Social Sciences and Humanities 54 (1):65-88.

Analytics

Added to PP
2020-12-22

Downloads
23 (#682,085)

6 months
15 (#167,163)

Historical graph of downloads
How can I increase my downloads?

Author's Profile

Ahmad Arshad
York University

References found in this work

The Indo-Aryan Languages.R. S. McGregor & Colin P. Masica - 1993 - Journal of the American Oriental Society 113 (1):150.

Add more references