AI-Powered Biodiversity Assessment: Species Classification via DNA Barcoding and Deep Learning

Loris Nanni*, Daniela Cuza, Sheryl Brahnam

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)
184 Downloads (Pure)

Abstract

Only 1.2 million out of an estimated 8.7 million species on Earth have been fully classified through taxonomy. As biodiversity loss accelerates, ecologists are urgently revising conservation strategies, but the “taxonomic impediment” remains a significant barrier, limiting effective access to and understanding of taxonomic data for many researchers. As sequencing technologies advance, short DNA sequence fragments increasingly serve as DNA barcodes for species identification. Rapid acquisition of DNA sequences from diverse organisms is now possible, highlighting the increasing significance of DNA sequence analysis tools in species identification. This study introduces a new approach for species classification with DNA barcodes based on an ensemble of deep neural networks (DNNs). Several techniques are proposed and empirically evaluated for converting raw DNA sequence data into images fed into the DNNs. The best-performing approach is obtained by representing each pair of DNA bases with the value of a related physicochemical property. By utilizing different physicochemical properties, we can create an ensemble of networks. Our proposed ensemble obtains state-of-the-art performance on both simulated and real datasets.
Original languageEnglish
Article number240
JournalTechnologies
Volume12
Issue number12
Early online date22 Nov 2024
DOIs
Publication statusPublished - Dec 2024

Keywords

  • DNA barcoding
  • ensemble
  • convolutional neural networks

Fingerprint

Dive into the research topics of 'AI-Powered Biodiversity Assessment: Species Classification via DNA Barcoding and Deep Learning'. Together they form a unique fingerprint.

Cite this