Kernel generalized least squares regression for network-structured data

Edward Antonian, Gareth W. Peters*, Michael Chantler

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

14 Downloads (Pure)

Abstract

In this paper, we study a class of non-parametric regression models for predicting graph signals {yt} as a function of explanatory variables {xt}. Recently, Kernel Graph Regression (KGR) and Gaussian Processes over Graph (GPoG) have emerged as promising techniques for this task. The goal of this paper is to examine several extensions to KGR/GPoG, with the aim of generalising them a wider variety of data scenarios. The first extension we consider is the case of graph signals that have only been partially recorded, meaning a subset of their elements is missing at observation time. Next, we examine the statistical effect of correlated prediction error and propose a method for Generalized Least Squares (GLS) on graphs. In particular, we examine Autoregressive AR(1) vector autoregressive processes, which are commonly found in time-series applications. Finally, we use the Laplace approximation to determine a lower bound for the out-of-sample prediction error and derive a scalable expression for the marginal variance of each prediction. These methods are tested on both real and synthetic data, with the former taken from a network of air quality monitoring stations across California. We find evidence that the generalised GLS-KGR algorithm is well-suited to such time-series applications, outperforming several standard techniques on this dataset.

Original languageEnglish
Article numbere0324087
JournalPLoS ONE
Volume20
Issue number5
DOIs
Publication statusPublished - 30 May 2025

Fingerprint

Dive into the research topics of 'Kernel generalized least squares regression for network-structured data'. Together they form a unique fingerprint.

Cite this