Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[New Model] ProREM #55

Open
wants to merge 8 commits into
base: main
Choose a base branch
from
Open

[New Model] ProREM #55

wants to merge 8 commits into from

Conversation

tyang816
Copy link
Contributor

@tyang816 tyang816 commented Oct 31, 2024

Hi, Pascal

We recently fused MSA into the ProSST model and achieved better performance, achieving a score of 0.518 on substitutions named ProREM. (https://arxiv.org/abs/2410.21127)

  1. Added the scoring code of prorem in proteingym/baselines/prorem.
  2. Added prorem information to config.json and constants.json.
  3. Added scripts/scoring_DMS_zero_shot/scoring_ProREM_substitutions.sh.
  4. Update the structure tokenizer of ProSST.

It is worth noting:

  1. A new conda environment is required, the configuration file is in proteingym/baselines/prorem/environment.yaml (same as ProtSSN and ProSST), or you can use the environment of ProSST.
  2. The original PDB files are same with ProtSSN and ProSST, you can download them here. The .a2m files can be downloaded here.
  3. More model information can be seen in ProREM repository.

Thanks again for your community efforts.

best wishes
Yang Tan

@tyang816 tyang816 changed the title [New Model] ProtREM [New Model] ProREM Nov 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant