TY - JOUR
T1 - SuperMetal: a generative AI framework for rapid and precise metal ion location prediction in proteins
AU - Lin, Xiaobo
AU - Su, Zhaoqian
AU - Liu, Yunchao
AU - Liu, Jingxian
AU - Kuang, Xiaohan
AU - Cummings, Peter T.
AU - Spencer-Smith, Jesse
AU - Meiler, Jens
PY - 2025/7/15
Y1 - 2025/7/15
N2 - Metal ions, as abundant and vital cofactors in numerous proteins, are crucial for enzymatic activities and protein interactions. Given their pivotal role and catalytic efficiency, accurately and efficiently identifying metal-binding sites is fundamental to elucidating their biological functions and has significant implications for protein engineering and drug discovery. To address this challenge, we present SuperMetal, a generative AI framework that leverages a score-based diffusion model coupled with a confidence model to predict metal-binding sites in proteins with high precision and efficiency. Using zinc ions as an example, SuperMetal outperforms existing state-of-the-art models, achieving a precision of 94 % and coverage of 90 %, with zinc ions localization within 0.52 ± 0.55 Å of experimentally determined positions, thus marking a substantial advance in metal-binding site prediction. Furthermore, SuperMetal demonstrates rapid prediction capabilities (under 10 s for proteins with ∼ 2000 residues) and remains minimally affected by increases in protein size. Notably, SuperMetal does not require prior knowledge of the number of metal ions—unlike AlphaFold 3, which depends on this information. Additionally, SuperMetal can be readily adapted to other metal ions or repurposed as a probe framework to identify other types of binding sites, such as protein-binding pockets. Scientific contribution SuperMetal introduces a diffusion-based, SE(3)-equivariant generative model that places metal ions in proteins with 94 % precision, 90 % coverage, and sub-ångström (0.52 Å) accuracy in under 10 s, surpassing current methods and accelerating metal-aware protein engineering and drug discovery.
AB - Metal ions, as abundant and vital cofactors in numerous proteins, are crucial for enzymatic activities and protein interactions. Given their pivotal role and catalytic efficiency, accurately and efficiently identifying metal-binding sites is fundamental to elucidating their biological functions and has significant implications for protein engineering and drug discovery. To address this challenge, we present SuperMetal, a generative AI framework that leverages a score-based diffusion model coupled with a confidence model to predict metal-binding sites in proteins with high precision and efficiency. Using zinc ions as an example, SuperMetal outperforms existing state-of-the-art models, achieving a precision of 94 % and coverage of 90 %, with zinc ions localization within 0.52 ± 0.55 Å of experimentally determined positions, thus marking a substantial advance in metal-binding site prediction. Furthermore, SuperMetal demonstrates rapid prediction capabilities (under 10 s for proteins with ∼ 2000 residues) and remains minimally affected by increases in protein size. Notably, SuperMetal does not require prior knowledge of the number of metal ions—unlike AlphaFold 3, which depends on this information. Additionally, SuperMetal can be readily adapted to other metal ions or repurposed as a probe framework to identify other types of binding sites, such as protein-binding pockets. Scientific contribution SuperMetal introduces a diffusion-based, SE(3)-equivariant generative model that places metal ions in proteins with 94 % precision, 90 % coverage, and sub-ångström (0.52 Å) accuracy in under 10 s, surpassing current methods and accelerating metal-aware protein engineering and drug discovery.
KW - Generative AI
KW - Metal-binding sites
KW - Diffusion model
KW - Metalloprotein
UR - https://www.scopus.com/pages/publications/105010698478
U2 - 10.1186/s13321-025-01038-9
DO - 10.1186/s13321-025-01038-9
M3 - Article
C2 - 40665445
SN - 1758-2946
VL - 17
JO - Journal of Cheminformatics
JF - Journal of Cheminformatics
M1 - 107
ER -