The Importance of Blending Different Data Types to Train Machine Learning Classifiers for Sedimentary Structure Detection

Research output: Chapter in Book/Report/Conference proceedingConference contribution


This study demonstrates that, by adding sketched interpretation data to photographic datasets of geological outcrops, we can improve the quality of sedimentary structure classification, even for smaller volume datasets. We blended raw outcrop photos with sketches of sedimentary structures to use as input into a Convolutional Neural Network (CNN) model which will predict and classify certain geological structures. The use of CNN can make geological classification easier for us by assisting in the collection of geological observations in seconds. Our work shows that the CNN model misclassified various geological features when trained only with one type of data (outcrop photos or geological sketches). The efficacy and novelty of the system described in this paper lies in the blending of two different data types (both outcrop photographs and geological sketches) when training our CNN model for geological feature detection. The use of the blended dataset in learning, at an optimal balance between sketches and outcrop photos (from 40% to 67% sketch proportion in the training dataset), results in fewer misclassifications and higher test accuracy of the model predictions of the sedimentary structures.

Original languageEnglish
Title of host publication82nd EAGE Conference and Exhibition 2021
PublisherEAGE Publishing BV
Number of pages5
ISBN (Electronic)9781713841449
Publication statusPublished - 2021
Event82nd EAGE Conference and Exhibition 2021 - Amsterdam, Virtual, Netherlands
Duration: 18 Oct 202121 Oct 2021


Conference82nd EAGE Conference and Exhibition 2021
Abbreviated titleEAGE 2021
CityAmsterdam, Virtual

ASJC Scopus subject areas

  • Geochemistry and Petrology
  • Geology
  • Geophysics
  • Geotechnical Engineering and Engineering Geology


Dive into the research topics of 'The Importance of Blending Different Data Types to Train Machine Learning Classifiers for Sedimentary Structure Detection'. Together they form a unique fingerprint.

Cite this