纠正错误 / Fix 添加实例 / Add example
文件格式 / Formats

GATK

Broad Institute 的变异分析工具箱,常用于 germline calling、GVCF 联合分型和变异过滤。Genome Analysis Toolkit for variant discovery workflows.

速览 | Quick Look

安装 | Install

mamba install -c bioconda gatk4

常用命令 | Common Commands

单样本 GVCF calling:

gatk HaplotypeCaller \
  -R reference.fa \
  -I sample.sorted.bam \
  -O sample.g.vcf.gz \
  -ERC GVCF

导入多个 GVCF:

gatk GenomicsDBImport \
  --sample-name-map samples.map \
  --genomicsdb-workspace-path cohort_db \
  -L targets.interval_list

联合分型:

gatk GenotypeGVCFs \
  -R reference.fa \
  -V gendb://cohort_db \
  -O cohort.vcf.gz

过滤 SNP:

gatk VariantFiltration \
  -R reference.fa -V cohort.vcf.gz -O cohort.filtered.vcf.gz \
  --filter-expression "QD < 2.0 || FS > 60.0 || MQ < 40.0" \
  --filter-name "basic_snp_filter"

关键参数 | Key Options

常见坑 | Pitfalls

参考 | References