Contributing to tool recommender system in Galaxy using deep learning (Gated recurrent units neural network)
Following steps can be followed to start contributing to this project:
- Fork this repository (https://github.com/anuprulez/galaxy_tool_recommendation).
- Create a new branch.
- Install the dependencies by executing the following lines:
conda env create -f environment.yml
conda activate tool_prediction_gru_wc
- The scripts are located at
scripts/
. - Data files is located at
data/
. - Add new features/techniques.
- Run the project using
sh train.sh
.- To run this project on complete set of workflow, large compute resource is needed (with at least 20-30 GB RAM) and running time is > 24 hours.
- Details of the parameters in the training script are given in
README.md
.
- Get a recommended model at
data/<<file name>>.hdf5
. - See recommended tools using
ipython_script/tool_recommendation_gru_wc.ipynb
or place the newly created recommendation model (from step 7) atipython_script/data/<<file name>>.hdf5
. - Open a pull request against the main repository (https://github.com/anuprulez/galaxy_tool_recommendation).
- Anup Kumar (https://github.com/anuprulez) (Main contributor).
- Helena Rasche (https://github.com/hexylena) (Contributed to the scripts for data collection from Galaxy EU server and to the Galaxy tool prediction API).
- ...