WRLLWestlake Representation Learning Lab

ProTrek

ProTrek is a tri-modal protein language model that jointly models protein sequence, structure and function (SSF)

Pipeline

Input

Prompt + Database

Model

ProTrek

Model

Structure Validation

Select

Output

Sequence

Description

Employs contrastive learning with three core alignment strategies: (1) using structure as the supervision signal for AA sequences and vice versa, (2) mutual supervision between sequences and functions, and (3) mutual supervision between structures and functions. This tri-modal alignment training enables ProTrek to tightly associate SSF by bringing genuine sample pairs (sequence-structure, sequence-function, and structure-function) closer together while pushing negative samples farther apart in the latent space.

ProTrek achieves over 30x and 60x improvements in sequence-function and function-sequence retrieval, is 100x faster than Foldseek and MMseqs2 in protein-protein search, and outperforms ESM-2 in 9 of 11 downstream prediction tasks.