Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[feature] Lvl-specific dropout correction #1

Open
testaibot opened this issue Feb 24, 2018 · 2 comments
Open

[feature] Lvl-specific dropout correction #1

testaibot opened this issue Feb 24, 2018 · 2 comments

Comments

@testaibot
Copy link
Owner

Problem: DE genes search on initial step of SAVER removes rear landmark genes, making impute step useless.

Idea to consider: seurat's DE genes binned search

Possible solution: we have no prior information about real landmark genes and full dropout correction is time-consuming, possible solution is to group genes by nZero percentage, making assumption, that zeroes are mainly real zeroes, rather dropout events

@testaibot
Copy link
Owner Author

This isn't optimal, but as we know, "real" dropouts are around 10-25% for droplet-based methods.
Limiting Saver's lasso with genes in these boundaries we can dramatically improve performance on "10k+ genes" datasets

@testaibot
Copy link
Owner Author

We assume, that correction may perform better in case of several expression levels of the same gene.
E.g. Nanog fold-change is 4-8 orders for 4cell->blastocyst and 1.5-3 for subpopulations.
If there is a gene that co-expressed with Nanog in the first subpopulation and not expressed in the second it can be imputed, indusing false-positives.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant