TY - JOUR
T1 - GAN River-I: A process-based low NTG meandering reservoir model dataset for machine learning studies
AU - Sun, Chao
AU - Demyanov, Vasily
AU - Arnold, Daniel
N1 - Funding Information:
The work contained in this paper contains work conducted during a PhD study undertaken as part of the Natural Environment Research Council (NERC) Centre for Doctoral Training (CDT) in Oil & Gas [grant number NEM00578X/1 ]. This project is sponsored by Heriot-Watt University via their James Watt Scholarship Scheme whose support is gratefully acknowledged.
Publisher Copyright:
© 2022 The Authors
PY - 2023/2
Y1 - 2023/2
N2 - The GAN River-I data set is designed to provide a stern test for machine learning and geostatistical tools that wish to recreate the complex geometries of realistic facies distributions in subsurface reservoirs. It provides more complex, non-stationary facies distributions than previous open data sets, some of which have modelled channels but do not include the number and complex association of facies types of this data set. GAN River-I is a dataset of 2D layers of 3D facies models produced from a process-based simulator of a meandering fluvial system. The data set contains 25 simulated 3D cubes, converted into three datasets consisting of 16,000 2D models/images, each representing the increasing complexity of the modelled facies. The number of facies decreases between the three data sets, with nine facies, seven facies and three facies, respectively. The facies reduction is carried out by amalgamating similar facies in terms of their likely permeability and sedimentary relationships to represent flowing units in a subsurface reservoir. The data is therefore provided to allow users to increase the model complexity in a manageable and comparable way between groups using the data. GAN River-I covers a range of low NTG meandering patterns with varied avulsion rates. Each dataset comprises an ensemble of meandering models representing various plausible patterns and, therefore, can be used as a geologically plausible benchmark for testing generative models’ performance. We provide three data file formats, including image, Ndarray and GSLIB, to adapt to different researchers’ preferences.
AB - The GAN River-I data set is designed to provide a stern test for machine learning and geostatistical tools that wish to recreate the complex geometries of realistic facies distributions in subsurface reservoirs. It provides more complex, non-stationary facies distributions than previous open data sets, some of which have modelled channels but do not include the number and complex association of facies types of this data set. GAN River-I is a dataset of 2D layers of 3D facies models produced from a process-based simulator of a meandering fluvial system. The data set contains 25 simulated 3D cubes, converted into three datasets consisting of 16,000 2D models/images, each representing the increasing complexity of the modelled facies. The number of facies decreases between the three data sets, with nine facies, seven facies and three facies, respectively. The facies reduction is carried out by amalgamating similar facies in terms of their likely permeability and sedimentary relationships to represent flowing units in a subsurface reservoir. The data is therefore provided to allow users to increase the model complexity in a manageable and comparable way between groups using the data. GAN River-I covers a range of low NTG meandering patterns with varied avulsion rates. Each dataset comprises an ensemble of meandering models representing various plausible patterns and, therefore, can be used as a geologically plausible benchmark for testing generative models’ performance. We provide three data file formats, including image, Ndarray and GSLIB, to adapt to different researchers’ preferences.
KW - Conceptual geological model
KW - Facies modelling
KW - Generative adversarial networks
KW - Meandering river
UR - http://www.scopus.com/inward/record.url?scp=85145647524&partnerID=8YFLogxK
U2 - 10.1016/j.dib.2022.108785
DO - 10.1016/j.dib.2022.108785
M3 - Article
C2 - 36506803
SN - 2352-3409
VL - 46
JO - Data in Brief
JF - Data in Brief
M1 - 108785
ER -