Benchmarking¶
This section contains quality control plots from the unit testing.
Position sampling¶
The following section looks at the position sampling algorithms.
Segment sampling algorithms¶
The following plots benchmark the segment sampling behaviour of the various segment sampling algorithms implemented in GAT.
Statistics¶
For 1-sized fragments (i.e. SNPs), the statistics can be checked against a hypergeometric distribution (sampling without replacement). All the tests below use a single continuous workspace of 1000 nucleotides seeded with a varying number of SNPs.

Test with a single SNP. Here, there are no issues with multiple hits. The workspace contains a single annotation of increasing size (1,3,5,...,99)
Statistics¶
Gat¶
SNPs¶
For 1-sized fragments (i.e. SNPs), the statistics can be checked against a hypergeometric distribution (sampling without replacement). All the tests below use a single continuous workspace of 1000 nucleotides seeded with a varying number of SNPs.

Test with a single SNP. Here, there are no issues with multiple hits. The workspace contains a single annotation of increasing size (1,3,5,...,99)

In this test, 10 SNPs are in the segment list. The workspace contains a single annotation of size (10, 15, ..., 105). All SNPs overlap the annotated part of the workspace and hence all results are highly signficant.
Intervals¶
Annotator¶
SNPs¶
For 1-sized fragments (i.e. SNPs), the statistics can be checked against a hypergeometric distribution (sampling without replacement). All the tests below use a single continuous workspace of 1000 nucleotides seeded with a varying number of SNPs.

Test with a single SNP. Here, there are no issues with multiple hits. The workspace contains a single annotation of increasing size (1,3,5,...,99)

In this test, 10 SNPs are in the segment list. The workspace contains a single annotation of size (10, 15, ..., 105). All SNPs overlap the annotated part of the workspace and hence all results are highly signficant.