Human vs C.intestinalis LastZ results

Human (Homo sapiens, GRCh38) and C.intestinalis (Ciona intestinalis, KH) were aligned using the LastZ alignment algorithm (LastZ) in Ensembl release 76. Human was used as the reference species. After running LastZ, the raw LastZ alignment blocks were chained according to their location in both genomes. During the final netting process, the best sub-chain was chosen in each region on the reference species.

Configuration parameters

ParameterValue
Gap open penalty (O)400
Gap extend penalty (E)30
HSP threshold (K)3000
Threshold for gapped extension (L)3000
Threshold for alignments between gapped alignment blocks (H)2200
Seed and Transition value (T)1
Scoring matrix (Q)
Default:
     A    C    G    T
    91 -114  -31 -123
  -114  100 -125  -31
   -31 -125  100 -114
  -123  -31 -114   91

Chunking parameters

ParameterHumanC.intestinalis
Chunk size30,000,00010,100,000
Overlap0100,000
Group set size10,100,000
Masking options{default_soft_masking => 1}

Statistics over 62,205 alignment blocks

Genome coverage (bp) Coding exon coverage (bp)
Human

Uncovered: 3,091,009,164 out of 3,099,750,718
Covered: 8,741,554 out of 3,099,750,718

Uncovered: 33,303,089 out of 38,571,754
Matches: 3,481,527 out of 38,571,754
Mismatches: 1,741,125 out of 38,571,754
Insertions: 46,013 out of 38,571,754
Identity over aligned base-pairs: 66.1%

C.intestinalis

Uncovered: 110,963,978 out of 115,227,500
Covered: 4,263,522 out of 115,227,500

Uncovered: 12,960,220 out of 16,116,563
Matches: 2,059,100 out of 16,116,563
Mismatches: 1,057,997 out of 16,116,563
Insertions: 39,246 out of 16,116,563
Identity over aligned base-pairs: 65.2%

Block size distribution

Size range All 62,205 alignment blocks Blocks grouped in nets
# blocks Total size (incl. gaps) # nets Total size (incl. gaps)
1 bp - 10 bp
237
1.1 kb
10 bp - 100 bp
17,678
1.3 Mb
2,267
171.8 kb
100 bp - 1 kb
44,108
7.7 Mb
15,327
5.6 Mb
1 kb - 10 kb
180
227.7 kb
2,090
3.4 Mb
10 kb - 100 kb
2
31.6 kb
6
111.0 kb