genomics-pipeline

Genomics Pipeline Documentation

Analyze your own whole genome sequencing (WGS) data on consumer hardware. 27 analysis steps, all running locally in Docker.

Back to GitHub repository


Getting Started

Pipeline Steps

Core Analysis (Steps 1-20)

# Step Doc
1 ORA to FASTQ (Illumina decompression) 01-ora-to-fastq.md
2 Alignment (minimap2 + samtools) 02-alignment.md
3 Variant Calling (DeepVariant) 03-variant-calling.md
4 Structural Variants (Manta) 04-structural-variants.md
5 SV Annotation (AnnotSV) 05-annotsv.md
6 ClinVar Screen 06-clinvar-screen.md
7 Pharmacogenomics (PharmCAT) 07-pharmacogenomics.md
8 HLA Typing (T1K) 08-hla-typing.md
9 STR Expansions (ExpansionHunter) 09-str-expansions.md
10 Telomere Length (TelomereHunter) 10-telomere-analysis.md
11 ROH Analysis (bcftools roh) 11-roh-analysis.md
12 Mito Haplogroup (haplogrep3) 12-mito-haplogroup.md
13 VEP Annotation 13-vep-annotation.md
14 Imputation Prep 14-imputation-prep.md
15 SV Quality (duphold) 15-duphold.md
16 Coverage QC (indexcov) 16-indexcov.md
17 Cancer Predisposition (CPSR) 17-cpsr.md
18 CNV Calling (CNVnator) 18-cnvnator.md
19 SV Calling (Delly) 19-delly.md
20 Mitochondrial (GATK Mutect2) 20-mtoolbox.md

Post-Processing (Steps 21-27)

# Step Doc
21 CYP2D6 Star Alleles (Cyrius) 21-cyrius.md
22 SV Consensus Merge 22-survivor-merge.md
23 Clinical Variant Filter 23-clinical-filter.md
24 HTML Summary Report 24-html-report.md
25 Polygenic Risk Scores 25-prs.md
26 Ancestry PCA 26-ancestry.md
27 CPIC Drug Recommendations 27-cpic-lookup.md

Guides