Description
This dataset comprises curated text corpora from three major sources: The Guardian news articles, Reddit posts, and Twitter tweets. It is designed to support research in topic modelling, text mining, and natural language processing. The data has been preprocessed and organised to facilitate comparative studies across different domains and platforms. Potential applications include topic discovery, trend analysis, and cross-domain text analytics.
This research is funded by the Digital Circular Electrochemical Economy (DCEE) [EP/V042432/1] and the UKRI Interdisciplinary Centre for Circular Chemical Economy [EP/V011863/1], [EP/V011863/2].
This research is funded by the Digital Circular Electrochemical Economy (DCEE) [EP/V042432/1] and the UKRI Interdisciplinary Centre for Circular Chemical Economy [EP/V011863/1], [EP/V011863/2].
| Date made available | 25 Jun 2025 |
|---|---|
| Publisher | Heriot-Watt University |
| Date of data production | 2025 |
Research output
- 1 Article
-
Exploring public attention in the circular economy through topic modelling with twin hyperparameter optimisation
Song, J., Yuan, Y., Chang, K., Xu, B., Xuan, J. & Pang, W., Dec 2024, In: Energy and AI. 18, 100433.Research output: Contribution to journal › Article › peer-review
Open AccessFile5 Link opens in a new tab Citations (Scopus)42 Downloads (Pure)
Cite this
- DataSetCite