To propose new binders, I used the newly released BoltzGen as a generative prior to sample de-novo short protein sequences (target length: 40–60 AA). The point was not sequence editing of known binders, but generative sampling: BoltzGen was used to draw candidate sequences that the model judges to lie in foldable / protein-like manifold regions of sequence space.
For the top scoring sequences, they were then passed through AlphaFold structure prediction to quickly sanity-check whether the candidate sequences appear to form a compact, stable single-chain topology. Predicted models that were fully or partly disordered were filtered out. Also a BSA analysis in pymol was done.
For the subset of sequences with coherent predicted folds, I then computationally estimated interface scores against the G protein surface using PRODIGY. These interface scores were only used comparatively (to triage candidates), not as absolute physical numbers.
Two of the top-scoring candidates (sequence 9 and 10 in our set) were intentionally generated as cyclic sequences.
id: rough-otter-jade

Nipah Virus Glycoprotein G
0.75
71.37
--
4.1 kDa
42
id: wild-shark-leaf

Nipah Virus Glycoprotein G
0.74
86.60
--
5.2 kDa
48
id: scarlet-toad-ivy

Nipah Virus Glycoprotein G
0.50
48.51
--
4.1 kDa
42
id: green-dove-snow

Nipah Virus Glycoprotein G
0.49
40.80
--
4.3 kDa
44
id: solid-gecko-quartz

Nipah Virus Glycoprotein G
0.48
88.29
--
3.9 kDa
42
id: young-owl-oak

Nipah Virus Glycoprotein G
0.45
65.28
--
5.3 kDa
52
id: frozen-falcon-pine

Nipah Virus Glycoprotein G
0.15
62.31
--
5.8 kDa
58
id: ivory-zebra-stone

Nipah Virus Glycoprotein G
0.00
75.30
--
4.8 kDa
48
id: crimson-wolf-quartz

Nipah Virus Glycoprotein G
0.00
69.27
--
5.6 kDa
54
id: wild-boar-onyx

Nipah Virus Glycoprotein G
0.00
58.60
--
5.0 kDa
50