This repository git@forgemia.inra.fr:migale/dada2_frogs.git contains the Rmd file used to perform a combination of dada2[1] and FROGS[2][3] and generate a report. It is able to deal with 16S and ITS data.
Some adjustments are to be made to use it on another infrastructure than the migale platform.
Use cases
16S Workflow
flowchart LR
style FROGS stroke:#5f999d
style dada2 stroke:#5f999d
database[(Databank)]-->affiliation
fastq[[Fastq]]
biom[[BIOM]]
tsv[[TSV]]
fastq-->seqkit
seqkit---plotQualityProfiles
subgraph dada2
direction TB
plotQualityProfiles-->filterAndTrim1[filterAndTrim]-->filterAndTrim2[filterAndTrim]-->learnErrors-->dada-->mergePairs-->makeSequenceTable
end
filterAndTrim1-->cutadapt-->filterAndTrim2
makeSequenceTable-->rc
subgraph FROGS
direction TB
rc[remove chimera]-->filters-->affiliation
end
affiliation-->biom
affiliation--> tsv
ITS worflow
flowchart LR
style FROGS stroke:#5f999d
style dada2 stroke:#5f999d
style mergePairs stroke:#5F999d,stroke-width:4px
style ITSx stroke:#5F999d,stroke-width:4px
database[(Databank)]-->affiliation
fastq[[Fastq]]
biom[[BIOM]]
tsv[[TSV]]
fastq-->seqkit
seqkit---plotQualityProfiles
subgraph dada2
direction TB
plotQualityProfiles-->filterAndTrim1[filterAndTrim]-->filterAndTrim2[filterAndTrim]-->learnErrors-->dada-->mergePairs-->makeSequenceTable
end
filterAndTrim1-->cutadapt-->filterAndTrim2
makeSequenceTable-->rc
subgraph FROGS
direction TB
rc[remove chimera]-->filters-->ITSx-->affiliation
end
affiliation-->biom
affiliation--> tsv
If you are not using the migale infrastructure: make sure to add FROGS (v4.0.1) and seqkit installed on your system and update lines in dada2_FROGS.Rmd (line 100):
1. Callahan BJ, McMurdie PJ, Rosen MJ, Han AW, Johnson AJA, Holmes SP. DADA2: High-resolution sample inference from illumina amplicon data. Nature methods. 2016;13:581.
2. Bernard M, Rué O, Mariadassou M, Pascal G. FROGS: a powerful tool to analyse the diversity of fungi with special management of internal transcribed spacers. Briefings in Bioinformatics. 2021;22. doi:10.1093/bib/bbab318.
3. Escudié F, Auer L, Bernard M, Mariadassou M, Cauquil L, Vidal K, et al. FROGS: Find, Rapidly, OTUs with Galaxy Solution. Bioinformatics. 2018;34:1287–94. doi:10.1093/bioinformatics/btx791.
A work by Migale Bioinformatics Facility
Université Paris-Saclay, INRAE, MaIAGE, 78350, Jouy-en-Josas, France
Université Paris-Saclay, INRAE, BioinfOmics, MIGALE bioinformatics facility, 78350, Jouy-en-Josas, France