WORKFLOW
1.Quality control————FASTQC、MultiQc
2.Reads mapping——BWA
(3.Mapped reads processing)——RmDup
4.Variants calling——— FreeBayes、
5.Variants annotation and report———SnpEff、GEMINI load
QUALITY CONTROL
1.Inspect a raw sequence file
一共6个样本(一家三口 双端测序)
wget -b https://zenodo.org/record/3243160/files/father_R1.fq.gz
wget -b https://zenodo.org/record/3243160/files/father_R2.fq.gz
wget -b https://zenodo.org/record/3243160/files/mother_R1.fq.gz
wget -b https://zenodo.org/record/3243160/files/mother_R2.fq.gz
wget -b https://zenodo.org/record/3243160/files/proband_R1.fq.gz
wget -b https://zenodo.org/record/3243160/files/proband_R2.fq.gz
2.Assess the Read Quality
# 使用FastQC软件对单个fastq文件进行质量评估,结果输出到qc/文件夹下
qcdir= ~/project/boy/afterqc
fqdir=~/project/boy/qc
fastqc -t 3 -o $qcdir $fqdir/father_R1.fq.gz
# 多个数据质控
fastqc -t 2 -o $qcdir $fqdir/*.fastq.gz
##外显子组的话代码是
fastqc -t 10 -o $qcdir $fqdir/*.fq.gz
# 使用MultiQc整合FastQC结果
multiqc *.zip
-
Per base sequence quality
-
Per sequence quality scores
-
Per base sequence content
-
Per sequence GC content
-
Adapter
-
N Content