Abstract
Excessive content of profanity in audio and video files has proven to shape one's character and behavior. Currently, conventional methods of manual detection and censorship are being used. Manual censorship method is time consuming and prone to misdetection of foul language. This paper proposed an intelligent model for foul language censorship through automated and robust detection by deep convolutional neural networks (CNNs). A dataset of foul language was collected and processed for the computation of audio spectrogram images that serve as an input to evaluate the classification of foul language. The proposed model was first tested for 2-class (Foul vs Normal) classification problem, the foul class is then further decomposed into a 10-class classification problem for exact detection of profanity. Experimental results show the viability of proposed system by demonstrating high performance of curse words classification with 1.24-2.71 Error Rate (ER) for 2-class and 5.49-8.30 F1- score. Proposed Resnet50 architecture outperforms other models in terms of accuracy, sensitivity, specificity, F1-score.
Original language | English |
---|---|
Title of host publication | 2020 IEEE 22nd International Workshop on Multimedia Signal Processing (MMSP) |
Publisher | IEEE |
ISBN (Electronic) | 9781728193205 |
DOIs | |
Publication status | Published - 16 Dec 2020 |
Event | 22nd IEEE International Workshop on Multimedia Signal Processing 2020 - Virtual, Tampere, Finland Duration: 21 Sept 2020 → 24 Sept 2020 |
Conference
Conference | 22nd IEEE International Workshop on Multimedia Signal Processing 2020 |
---|---|
Abbreviated title | MMSP 2020 |
Country/Territory | Finland |
City | Virtual, Tampere |
Period | 21/09/20 → 24/09/20 |
Keywords
- Censorship
- CNN
- Foul language
- Spectrogram
- Speech detection
ASJC Scopus subject areas
- Signal Processing
- Media Technology