Accelerated design of Escherichia coli reduced genomes using a whole-cell model and machine learning

Ioana M. Gherman, Kieren Sharma, Joshua Rees-Garbutt, Wei Pang, Zahraa S. Abdallah, Thomas E. Gorochowski, Claire S. Grierson, Lucia Marucci

Research output: Contribution to journalArticlepeer-review

Abstract

Whole-cell models (WCMs) are multi-scale computational models that aim to simulate the function of all genes and processes within a cell. This approach is promising for designing genomes tailored for specific tasks. However, a limitation of WCMs is their long runtime. Here, we show how machine learning (ML) surrogates can be used to address this limitation by training them on WCM data to accurately predict cell division. Our ML surrogate achieves a 95% reduction in computational time compared with the original WCM. We then show that the surrogate and a genome-design algorithm can generate an in silico-reduced E. coli cell, where 40% of the genes included in the WCM were removed. The reduced genome is validated using the WCM and interpreted biologically using Gene Ontology analysis. This approach illustrates how the holistic understanding gained from a WCM can be leveraged for synthetic biology tasks while reducing runtime. A record of this paper’s transparent peer review process is included in the supplemental information.
Original languageEnglish
Article number101392
JournalCell Systems
Volume16
Issue number10
Early online date24 Sept 2025
DOIs
Publication statusPublished - 15 Oct 2025

Keywords

  • whole-cell modeling
  • genome design
  • machine learning surrogate
  • genome reduction
  • gene essentiality
  • synthetic biology

Fingerprint

Dive into the research topics of 'Accelerated design of Escherichia coli reduced genomes using a whole-cell model and machine learning'. Together they form a unique fingerprint.

Cite this