Abstract
Whole-cell models (WCMs) are multi-scale computational models that aim to simulate the function of all genes and processes within a cell. This approach is promising for designing genomes tailored for specific tasks. However, a limitation of WCMs is their long runtime. Here, we show how machine learning (ML) surrogates can be used to address this limitation by training them on WCM data to accurately predict cell division. Our ML surrogate achieves a 95% reduction in computational time compared with the original WCM. We then show that the surrogate and a genome-design algorithm can generate an in silico-reduced E. coli cell, where 40% of the genes included in the WCM were removed. The reduced genome is validated using the WCM and interpreted biologically using Gene Ontology analysis. This approach illustrates how the holistic understanding gained from a WCM can be leveraged for synthetic biology tasks while reducing runtime. A record of this paper’s transparent peer review process is included in the supplemental information.
| Original language | English |
|---|---|
| Article number | 101392 |
| Journal | Cell Systems |
| Volume | 16 |
| Issue number | 10 |
| Early online date | 24 Sept 2025 |
| DOIs | |
| Publication status | Published - 15 Oct 2025 |
Keywords
- whole-cell modeling
- genome design
- machine learning surrogate
- genome reduction
- gene essentiality
- synthetic biology