Skip to main navigation Skip to search Skip to main content

Verifying Urdu news authenticity using deep learning with concatenated BERT and GloVe embedding

  • Asif Feroz
  • , Waseem Abbasi
  • , Muhammad Zeeshan Babar
  • , Abeer Aljohani

Research output: Contribution to journalArticlepeer-review

1 Downloads (Pure)

Abstract

Fake news has a significant effect on reader perceptions and is therefore a serious problem. It is difficult to differentiate between fake and real news as the number of News platforms on social media are growing daily. This work aims to create a complete fake news detection mechanism for Pakistani news by using many fact-checked APIs. A total of 14,178 real and fake news items from fifteen different regions and sections of reputable and authoritative Urdu newspapers and news broadcasting websites in Pakistan are included in the dataset. We evaluate the dataset via three deep learning models RoBERTa, XLM-RoBERTa and mBERT are embedded with concatenated BERT and GloVe. For the training of the selected models, we first use an extensive multilingual dataset. The results of the proposed models are as follows: evaluated via performance metrics such as the F1 score, accuracy, precision and recall. XLM-RoBERTa with concatenated GloVe embedding outperforms with an F1 score of 0.956, accuracy of 0.962, precision of 0.932 and recall of 0.940. A comparative analysis of the latest machine learning and deep learning models is also done. The latest Urdu benchmark dataset shows that the XLM-RoBERTa model with Concatenated BERT and GloVe embedding outperforms these models for Urdu fake news detection.
Original languageEnglish
Article number7352
JournalScientific Reports
Volume16
Early online date5 Feb 2026
DOIs
Publication statusPublished - 20 Feb 2026

Keywords

  • True news
  • UFND
  • Urdu news
  • GloVe
  • Fake news
  • Deep learning

Fingerprint

Dive into the research topics of 'Verifying Urdu news authenticity using deep learning with concatenated BERT and GloVe embedding'. Together they form a unique fingerprint.

Cite this