Authors
Buelbuel, E. F., Bang, S., Geroge, K., Bianchi, G., Raj, P., Chung, S., Pauline, V., Hochstrasser, R., Minas, H. A., Elgaher, W. A. M., Kany, A. M., Hirsch, A., Schmitt, S., Heinz, D. W., Kalinina, O. V., Klakow, D., Bozhueyuek, K. A. J.
Abstract
Large language models and generative protein design promise to accelerate biotechnology, but it remains unclear whether they can engineer dynamic megasynth(et)ases whose activity depends on transient, context-specific domain interfaces. Nonribosomal peptide synthetases (NRPSs) are an especially demanding target, yet a high-value one because they produce many clinically important natural products and offer a route to analogs that are often difficult or impractical to access by chemical synthesis. Here we integrate pretrained generative models (ESM3, ProteinMPNN and EvoDiff) with design-build-test-learn cycles and data-guided prioritization to generate 76 de novo thiolation (T) domains. We built and tested 578 recombinant NRPS variants in vivo spanning minimal, full-length and hybrid assembly lines. AI-designed T-domains supported product formation across architectures, enabled catalytically active hybrids at recombined junctions and increased yields by up to ~3-fold relative to NRPSs carrying the native T-domain. A representative design showed improved soluble expression, refolding, and a 12 C higher melting temperature, while molecular dynamics simulations indicated preserved global stability but reshaped, state-dependent interdomain contact networks. Together, these results establish generative design as an effective route to context-conditioned optimization and reprogramming of biosynthetic assembly lines.
Preprint server:
bioRxiv
The authors list and abstract were imported from bioRxiv on 05 Mar 2026.
Advertisement
Stats
- Recommendations n/a n/a positive of 0 vote(s)
- Views 34
- Comments 0