Human vs Zebra mbuna LastZ results

Human (Homo sapiens, GRCh38) and Zebra mbuna (Maylandia zebra, M_zebra_UMD2a) were aligned using the LastZ alignment algorithm (LastZ) in Ensembl release 94. Human was used as the reference species. After running LastZ, the raw LastZ alignment blocks were chained according to their location in both genomes. During the final netting process, the best sub-chain was chosen in each region on the reference species.

Configuration parameters

ParameterValue
Gap open penalty (O)400
Gap extend penalty (E)30
HSP threshold (K)3000
Threshold for gapped extension (L)3000
Threshold for alignments between gapped alignment blocks (H)2200
Seed and Transition value (T)1
Scoring matrix (Q)
Default:
     A    C    G    T
    91 -114  -31 -123
  -114  100 -125  -31
   -31 -125  100 -114
  -123  -31 -114   91

Chunking parameters

ParameterHumanZebra mbuna
Chunk size30,000,00010,100,000
Overlap0100,000
Group set size10,100,000
Masking options

Statistics over 264,580 alignment blocks

Genome coverage (bp) Coding exon coverage (bp)
Human

Uncovered: 3,066,216,775 out of 3,099,750,718
Covered: 33,533,943 out of 3,099,750,718

Uncovered: 17,679,883 out of 38,571,754
Matches: 14,329,618 out of 38,571,754
Mismatches: 6,200,120 out of 38,571,754
Insertions: 362,133 out of 38,571,754
Identity over aligned base-pairs: 68.6%

Zebra mbuna

Uncovered: 923,132,095 out of 957,485,262
Covered: 34,353,167 out of 957,485,262

Uncovered: 17,637,643 out of 42,396,503
Matches: 17,012,824 out of 42,396,503
Mismatches: 7,292,879 out of 42,396,503
Insertions: 453,157 out of 42,396,503
Identity over aligned base-pairs: 68.7%

Block size distribution

Size range All 264,580 alignment blocks Blocks grouped in nets
# blocks Total size (incl. gaps) # nets Total size (incl. gaps)
1 bp - 10 bp
1,719
7.4 kb
10 bp - 100 bp
54,103
4.1 Mb
16,356
1.2 Mb
100 bp - 1 kb
207,088
41.1 Mb
84,932
25.5 Mb
1 kb - 10 kb
1,666
2.5 Mb
10,048
19.6 Mb
10 kb - 100 kb
4
59.6 kb
79
1.1 Mb
100 kb - 1 Mb
2
265.0 kb
2
218.2 kb