SAE-Steered Protein Binder Design with Topological Pre-Filtering
We present an interpretability-grounded approach to de novo binder design using sparse autoencoder (SAE) feature decomposition of ESM-2 combined with topological data analysis (TDA) for pre-filtering.
Pipeline: (1) Constrained sequence generation targeting RBX-1 RING domain (res 40-108, E2-binding face), (2) SAE-steered refinement using InterPLM SAE on ESM2-650M layer 18 with helix feature f5922 clamped at 10.0, (3) persistent homology entropy pre-filter (threshold 1.1841, empirically calibrated), (4) Boltz-2 structure prediction with iPTM >= 0.70 cutoff, (5) expression filtering (net charge, hydrophobic stretches, unpaired Cys).
RBX-1 results: 200 parents -> 1000 steered variants -> 359 passed topo filter (35.9%) -> 57 Boltz predictions -> 54 hits (94.7% hit rate) -> 35 passed expression filter. Best iPTM: 0.949. Mean iPTM (final 35): 0.834.
Prior validation on KRAS G12D: 84.3% hit rate (43/51), best consensus iPTM 0.960 +/- 0.012 vs 0.838 +/- 0.016 unsteered baseline. SAE steering produced higher-quality, more reproducible candidates.
Novel contributions: (1) First use of SAE feature clamping as a protein generation steering mechanism, (2) persistent homology entropy as topological pre-filter validated against 4 alternative TDA methods (Cohen's d=0.487), (3) multi-seed consensus ranking. To our knowledge, this is the first competition entry using mechanistic interpretability of a protein language model as the primary design mechanism.
Method PDF: https://drive.google.com/file/d/1RxmzSYQ4GhEuQQVV5pg--INPEgS1ysAR/view?usp=sharing
id: wild-kiwi-opal
No preview available
--
--
--
--
--
21
id: wild-zebra-clay
No preview available
--
--
--
--
--
14
id: gentle-ox-stone
No preview available
--
--
--
--
--
14
id: azure-heron-moss
No preview available
--
--
--
--
--
13
id: vast-toad-vine
No preview available
--
--
--
--
--
12
id: pale-kiwi-granite
No preview available
--
--
--
--
--
16
id: strong-kiwi-bronze
No preview available
--
--
--
--
--
18
id: radiant-raven-topaz
No preview available
--
--
--
--
--
14
id: soft-wolf-plume
No preview available
--
--
--
--
--
16
id: steady-ant-marble
No preview available
--
--
--
--
--
14
id: strong-panther-plume
No preview available
RBX1
0.42
--
--
--
18
id: solid-orca-cloud
No preview available
--
--
--
--
--
13
id: bright-mole-orchid
No preview available
--
--
--
--
--
12
id: vast-shark-onyx
No preview available
--
--
--
--
--
14
id: hollow-bat-fern
No preview available
--
--
--
--
--
17
id: brisk-zebra-jade
No preview available
--
--
--
--
--
13
id: quiet-tiger-frost
No preview available
--
--
--
--
--
16
id: quick-ant-moss
No preview available
--
--
--
--
--
18
id: quiet-swan-ice
No preview available
--
--
--
--
--
15
id: violet-raven-crystal
No preview available
--
--
--
--
--
18
id: misty-cat-cedar
No preview available
--
--
--
--
--
18