Basic CPU-native NGS data processing workflow

Simplest NGS data processing workflows have two main stages
- Read mapping and alignment refinement
- Variant calling
Test data: https://s3.amazonaws.com/parabricks.sample/parabricks_sample.tar.gz
Source of the test data - NVIDIA tutorials <https://docs.nvidia.com/clara/parabricks/4.3.0/tutorials/gettingthesampledata.html>_

Stage 2: Variant calling via HaplotypeCaller

#!/bin/bash

set -euo pipefail

SAMPLE_NAME="pb_sample"
FASTA="Ref/Homo_sapiens_assembly38.fasta"
BAM_IN="${SAMPLE_NAME}_CPU_markdup_BQSR.bam"

## Variant calling with HaplotypeCaller
gatk --java-options  "-XX:-UsePerfData" HaplotypeCaller \
--input ${BAM_IN} \
--output ${BAM_IN}.vcf \
--reference ${FASTA} \
--native-pair-hmm-threads 64