Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
PN269 authored Jun 6, 2024
1 parent 07c74b4 commit f55b102
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
# Leveraging a large language model to predict protein phase transition: a physical, multiscale and interpretable approach
We apply a unified modeling framework to predict protein phase transition (PPT). In classification task A proteins exhibiting experimental evidence of undergoing a phase transition, forming either droplets or amyloids, are consolidated into a single dataset (+Droplet drivers and +Amyloids). Phase transition propensity is predicted versus the preference to maintain the native soluble state (-PT). In classification task B the unified dataset is utilized to predict the propensity to form droplets versus amyloid aggregates. To accomplish this, we fine-tune the ESM-2 model to predict PPT and compare its performance to biophysical knowledge-based models (e.g., random forest).

![Alt text](./Files/Schematic.png?raw=true "Title")
![Alt text](./Files/schematic.png?raw=true "Title")
This repository provides code for predicting protein phase transition (PPT) propensity, including two examples of AD-related proteins, their associated genes, and transcription factors.

## Folder and related notebooks:
Expand Down

0 comments on commit f55b102

Please sign in to comment.