Skip to content
Snippets Groups Projects
Commit b1611d99 authored by Nikos Pappas's avatar Nikos Pappas
Browse files

include full link to git clone, edit some lists

parent d49c1f2d
No related branches found
No related tags found
No related merge requests found
...@@ -6,7 +6,7 @@ ...@@ -6,7 +6,7 @@
``` ```
# Clone this repo # Clone this repo
$ git clone this_repo pvogs_function $ git clone https://git.science.uu.nl/n.pappas/pvogs_function.git
# Get in there # Get in there
$ cd pvogs_function $ cd pvogs_function
...@@ -74,7 +74,8 @@ Currently, this workflow was built and tested on a local machine with an X serve ...@@ -74,7 +74,8 @@ Currently, this workflow was built and tested on a local machine with an X serve
`cd` into the root directory of this repo. `cd` into the root directory of this repo.
- Dry run: - Dry run
Always a good idea before launching the whole worfklow Always a good idea before launching the whole worfklow
``` ```
$ snakemake --use-conda -j16 -np $ snakemake --use-conda -j16 -np
...@@ -87,6 +88,7 @@ If the dry run completed with no errors you can execute the worfklow by removing ...@@ -87,6 +88,7 @@ If the dry run completed with no errors you can execute the worfklow by removing
$ snakemake --use-conda -j16 -p $ snakemake --use-conda -j16 -p
``` ```
- Speed up environment creation with mamba - Speed up environment creation with mamba
If `mamba` is available in your snakemake environment, or if you created a new environment with the `environment.yml` If `mamba` is available in your snakemake environment, or if you created a new environment with the `environment.yml`
provided here: provided here:
``` ```
...@@ -94,6 +96,7 @@ $ snakemake --use-conda -j16 --conda-frontend mamba ...@@ -94,6 +96,7 @@ $ snakemake --use-conda -j16 --conda-frontend mamba
``` ```
- Jupyter integration - Jupyter integration
A central notebook is used for all visualization and machine learning (model search) purposes. A central notebook is used for all visualization and machine learning (model search) purposes.
Its main output is the `results/RF/best_model.pkl` file. Its main output is the `results/RF/best_model.pkl` file.
...@@ -118,9 +121,10 @@ Something along the [guidelines from snakemake](https://snakemake.readthedocs.io ...@@ -118,9 +121,10 @@ Something along the [guidelines from snakemake](https://snakemake.readthedocs.io
## Output ## Output
The output of the whole workflow is produced and stored within a `results` directory. This looks like below. The output of the whole workflow is produced and stored within a `results` directory.
This has the structure shown below.
(several directories and files have been omitted) (several directories and files have been omitted)
Th most prominent ones are marked with a short description: The most prominent ones are marked with a short description:
``` ```
# Skipping several thousands of intermediate files with the -I option # Skipping several thousands of intermediate files with the -I option
$ tree -n -I '*NC*.fasta|*_genes.*|*.gff|*.log' results $ tree -n -I '*NC*.fasta|*_genes.*|*.gff|*.log' results
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment