On-chain analytics for sentiment-driven statistical causality in cryptocurrencies

Ioannis Chalkiadakis, Anna Zaremba, Gareth W. Peters, Michael J. Chantler

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)
35 Downloads (Pure)


This paper establishes a new framework for assessing multimodal statistical causality between cryptocurrency market (cryptomarket) sentiment and cryptocurrency price processes. In order to achieve this, we present an efficient algorithm for multimodal statistical causality analysis based on Multiple-Output Gaussian Processes. Signals from different information sources (modalities) are jointly modelled as a Multiple-Output Gaussian Process, and then using a novel approach to statistical causality based on Gaussian Processes (GPs), we study linear and non-linear causal effects between the different modalities. We demonstrate the effectiveness of our approach in a machine learning application by studying the relationship between cryptocurrency spot price dynamics and sentiment time-series data specific to the crypto sector, which we conjecture influences retail investor behaviour. The investor sentiment is extracted from cryptomarket news data via methods developed in the area of statistical machine learning known as Natural Language Processing (NLP). To capture sentiment, we present a novel framework for text to time-series embedding, which we then use to construct a sentiment index from publicly available news articles. We conduct a statistical analysis of our sentiment statistical index model and compare it to alternative state-of-the-art sentiment models popular in the NLP literature. In regard to the multimodal causality, the investor sentiment is our primary modality of exploration, in addition to price and a blockchain technology-related indicator (hash rate). Analysis shows that our approach is effective in modelling causal structures of variable degree of complexity between heterogeneous data sources and illustrates the impact that certain modelling choices for the different modalities can have on detecting causality. A solid understanding of these factors is necessary to gauge cryptocurrency adoption by retail investors and provide sentiment- and technology-based insights about the cryptocurrency market dynamics.

Original languageEnglish
Article number100063
JournalBlockchain: Research and Applications
Issue number2
Early online date28 Jan 2022
Publication statusPublished - Jun 2022


  • Blockchain regression
  • Cryptocurrencies
  • Cryptonews sentiment
  • Multiple-output Gaussian process
  • Natural language processing
  • Statistical causality

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications
  • Information Systems


Dive into the research topics of 'On-chain analytics for sentiment-driven statistical causality in cryptocurrencies'. Together they form a unique fingerprint.

Cite this