Diff-Shape keeps 3D shape for KRAS/EGFR; BRADSHAW yields ERAP1 tools in 4 AI rounds

Today's Overview

  • Diff-Shape diffusion model keeps 3D shape while generating novel KRAS and EGFR inhibitors Achieves substantially higher 3D shape fidelity than state-of-the-art while maintaining low 2D graph similarity across benchmarks.
  • BRADSHAW Platform Delivers ERAP1 Tool Compounds via Four-Round AI-Guided Optimization Four iterative rounds of in-silico generation and ML filtering produced ERAP1 inhibitor series with progressively optimized potency and drug-like properties.

Also Worth Noting

03
MURNet PFAS binding predictorDocking & Binding

MURNet fuses chemical descriptors, 2D graphs and fingerprints to predict PFAS-plasma protein binding, outperforming baselines and reliably extrapolating to PFAS homologues in HSA case studies. link (Chem)

04
ToxCML hybrid QSAR/read-across platformADMET & Properties

ToxCML integrates consensus QSAR and k-NN read-across with five molecular representations to predict 18 toxicity endpoints for 54,601 chemicals, yielding AUC 0.86–0.99 and BACC 0.73–0.98 on external sets while maintaining >95% in-domain coverage. link (Chem)

05

A condition-aware discrete diffusion language model that couples non-autoregressive denoising with reinforcement learning to generate molecules meeting heterogeneous structure and property constraints, outperforming prior methods in binding affinity and drug-likeness benchmarks. link

06
Multimodal SMILES+descriptor seq2seq model outperforms baselines on hierarchical ATC code assignment for drug discovery and repurposingGeneral AIDD

A sequence-to-sequence architecture that jointly encodes SMILES strings and molecular descriptors predicts multi-level ATC codes more accurately than baseline methods and includes a stopping rule to handle polypharmacological labels. link (Chem)

07
scProTrans: sequence-guided single-cell proteome translationOmics & Biomarkers

scProTrans, a sequence-aware deep architecture with hierarchical attention and bidirectional encoders, outperforms state-of-the-art methods on 17 multi-omics datasets in predicting single-cell protein abundance from RNA while preserving low-abundance signals and extending to tri-omics ATAC-RNA-protein translation. link

Today's Observation

Two complementary studies show how structure-aware generative models can accelerate hit-to-lead cycles when tightly coupled to assay feedback. The Diff-Shape diffusion framework keeps 3D shape fidelity while delivering low 2D similarity, enabling scaffold hops around a single crystal structure. On benchmark sets this translates into controllable decoration, linker replacement and core hops; on KRAS G12D and EGFR mutants the model produced synthesizable ligands with nanomolar biochemical IC50, although only in-vitro data are reported so far. Shape constraint therefore appears useful for cryptic or flat pockets where graph-only models often fail.

The BRADSHAW platform addresses the downstream optimisation gap. Four iterative rounds of ML filtering and chemical synthesis, each retraining on fresh ERAP1 assay data, moved an initial μM fragment to a 200 nM cellular tool compound with rat oral bioavailability. Success required deliberate human-in-the-loop filtering; teams initially struggled when the algorithm supplied >90 % of proposals. Together the papers underline that 3D-aware generation can supply novel, potent starting points, but sustained improvements—and avoidance of optimisation cul-de-sacs—still demand rapid assay turnaround and experienced chemist curation at every cycle.

The above is personal commentary for reference only. Refer to the original papers for authoritative content.