Skip to content
GitLab
Explore
Sign in
Register
Primary navigation
Search or go to…
Project
P
phap
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Wiki
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Package registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
Nikos Pappas
phap
Commits
7e735c8e
Commit
7e735c8e
authored
4 years ago
by
Nikos Pappas
Browse files
Options
Downloads
Patches
Plain Diff
update README
parent
b9be4c08
No related branches found
Branches containing commit
No related tags found
Tags containing commit
No related merge requests found
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
README.md
+49
-26
49 additions, 26 deletions
README.md
with
49 additions
and
26 deletions
README.md
+
49
−
26
View file @
7e735c8e
...
...
@@ -143,45 +143,58 @@ Results are stored per sample according to the sample ids you provided in the
sample sheet.
For each sample, results for each tool are stored in directories named after
the tool. An example looks like this:
```
results/
├── A
│ ├── all_predictions.tsv
│ ├── rafah
│ │ ├── A_CDS.faa
│ │ ├── A_CDS.fna
│ │ ├── A_CDS.gff
│ │ ├── A_CDSxMMSeqs_Clusters
│ │ ├── A_Genomes.fasta
│ │ ├── A_Genome_to_Domain_Score_Min_Score_50-Max_evalue_1e-05.tsv
│ │ ├── A_Ranger_Model_3_Predictions.tsv
│ │ ├── A_Seq_Info.tsv
│ │ └── predictions.tsv
│ ├── tmp
│ │ ├── genomes
│ │ └── reflist.txt
│ └── vhulk
│ ├── predictions.tsv
│ └── results.csv
```
results/A/
├── all_predictions.tsv
├── rafah
│ ├── A_CDS.faa
│ ├── A_CDS.fna
│ ├── A_CDS.gff
│ ├── A_CDSxMMSeqs_Clusters
│ ├── A_Genomes.fasta
│ ├── A_Genome_to_Domain_Score_Min_Score_50-Max_evalue_1e-05.tsv
│ ├── A_Ranger_Model_3_Predictions.tsv
│ ├── A_Seq_Info.tsv
│ └── predictions.tsv
├── tmp
│ ├── genomes
│ └── reflist.txt
├── vhmnet
│ ├── feature_values
│ ├── predictions
│ ├── predictions.tsv
│ └── tmp
├── vhulk
│ ├── predictions.tsv
│ └── results
└── wish
├── llikelihood.matrix
├── prediction.list
└── predictions.tsv
```
### Per sample
*
`all_predictions.tsv`
: Contains the best prediction per contig (rows) for
each tool along with its confidence/p-value/whatever single value each tool
uses to evaluate its confidence in the prediction.
An example for three genomes:
```
contig
vhulk_pred
vhulk_score
rafah_pred
rafah_score
NC_005964.2
None 4.068828 Mycoplasma 0.46
1
NC_015271.1
Escherichia_coli
1.0301523
Salmonella
0.495
NC_023719.1
Bacillus
0.0012575098
Bacillus
0.55
contig
vhulk_pred
vhulk_score
rafah_pred
rafah_score
vhmnet_pred vhmnet_score wish_pred wish_score
NC_005964.2
None 4.068828 Mycoplasma 0.461 Mycoplasma fermentans 0.9953 Bacteria;Tenericutes;Mollicutes;Mycoplasmatales;Mycoplasmataceae;Mycoplasma;Mycoplasma fermentans;Mycoplasma fermentans MF-I2 -1.208570000000000
1
NC_015271.1
Escherichia_coli
1.0301523
Salmonella
0.495 Muricauda pacifica 0.9968 Bacteria;Proteobacteria;Gammaproteobacteria;Enterobacterales;Enterobacteriaceae;Raoultella;Raoultella sp. NCTC 9187;Raoultella sp. NCTC 9187 -1.3869200000000002
NC_023719.1
Bacillus
0.0012575098
Bacillus
0.55 Clostridium sp. LS 1.0000 Bacteria;Firmicutes;Clostridia;Clostridiales;Clostridiaceae;Clostridium;Clostridium beijerinckii;Clostridium beijerinckii -1.29454
```
### Per tool
*
`tmp`
directory
*
Contains one fasta file per input genome, along with other intermediate
files necessary for a smooth execution of the workflow.
### Per tool
*
`rafah`
*
All files prefixed with
`<sample_id>_`
are the rafah's raw output
*
`predictions.tsv`
: A selection of the 1st (
`Contig`
) , 6th
...
...
@@ -193,8 +206,18 @@ files necessary for a smooth execution of the workflow.
*
`predictions.tsv`
: A selection of the 1st (
`BIN/genome`
), 10th (
`final_prediction`
)
11th (
`entropy`
) columns from file
`results.csv`
.
*
`vhmnet`
*
Directories
`feature_values`
and
`predictions`
are the raw output
*
Directory
`tmp`
is a temporary dir written by
`VirHostMatcher-Net`
for
doing its magic.
*
`predictions.tsv`
contain contig, host taxonomy and scores.
*
`wish`
*
Files
`llikelihood.matrix`
and
`prediction.list`
are the raw output
*
File
`predictions.tsv`
has contig, host taxonomy and
**llikelihood**
scores.
### Logs
Logs capturing stdout and stderr during execution of each rule
are locate
d in
Logs capturing stdout and stderr during execution of each rule
can be foun
d in
`workdir/logs/<sample_id>/*.log`
files.
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment