## sscocaller: Calling crossovers from single-sperm DNA sequencing reads
`sscocaller` processes DNA reads from each single sperm in the aligned and sorted BAM file for detecting crossovers by identifying haplotype shifts. It takes the large bam file which contains aligned DNA reads from a list of single sperm cells and summarizes allele counts for the provided informative SNP markers. A HMM model is applied for haplotyping each sperm and viterbi algorithm is run for deriving the inferred haplotype sequence against the list of SNP markers.
`sscocaller` processes DNA reads from each single sperm in the aligned and
sorted BAM file for inferring the haplotypes of single sperm genomes that can
later be used for calling crossovers by identifying haplotype shifts (see [comapr](https://github.com/ruqianl/comapr)).
It takes the large bam file which contains all aligned DNA reads from sperm cells and
summarizes allele counts for the provided informative SNP markers. While counting
the alleles, the Viterbi algorithm is implemented for finding the haplotype
sequence for the list of SNP markers.

## Inputs
- Bam, sorted and index bam file which contains DNA reads of single sperm cells with CB tag, eg. from single-cell alignment pipeline (cellranger)
- VCF, variant calling file that contains the list of informative SNPs provided
- VCF, variant call file that contains the list of informative SNPs
- barcodeFile, the list of cell barcodes of the sperm cells
...
...
@@ -19,31 +25,35 @@
* {sample}_chr1_totalCount.mtx, a sparse mtx with entries representing total allele counts
* {sample}_chr1_vi.mtx, a sparse mtx with entries representing inferred viterbi state (haplotype state)
* {sample}_chr1_snpAnnot.txt, the SNP positions and allele
* {sample}_chr1_SegInfo.txt, statistics of viterbi state segments in text file format. It contains consecutive viterbi states for each chromosome with statistics including, starting SNP position, ending SNP position, the number of SNPs supporting the segment, the log likelihood ratio of the viterbi segment and the inferred hidden state.
* {sample}_chr1_viSegInfo.txt, statistics of viterbi state segments in text file format. It contains consecutive viterbi states for each chromosome with statistics including, starting SNP position, ending SNP position, the number of SNPs supporting the segment, the log likelihood ratio of the viterbi segment and the inferred hidden state.