Gene Hoch_1748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1748 
Symbol 
ID8544130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2380653 
End bp2403791 
Gene Length23139 bp 
Protein Length7712 aa 
Translation table11 
GC content71% 
IMG OID646386455 
Productamino acid adenylation domain protein 
Protein accessionYP_003266190 
Protein GI262194981 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.970061 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCGA TGTCCACTGA CACCTTCGTC TTTCCCTCGT CCTTCGCGCA GCAGCGGCTC 
TGGTTTCTCG AGCAGATGAA CCCGGGCACC GGCGCCTACC ACATCGCCGG TGCTGTGCGC
ATCGCCGGGC CGCTCGAGCG CGACACCCTA CAGGCCGCGC TCGACGACCT GGTCGCTCGC
CACGAGTCGC TGCGCACCAC CTTCGCGCTC GAGCAAGGCG AGCCCGTGCA GGTCATCGCC
GAAGACGGCC GCGTGTCGCT CACGCGCACG GATCTGCGCG CCGACGGCGT CGCCGTCACC
CCCGAGCTCG TGAACCAGCA GGCGCTCGAC GAAGTGCGCA AGCCCTTCGA TCTCGCGCGC
GGCCCGCTGT TGCGCGTACA CCTGCTGCAG AGCCGCGATA GCGAGGTTCT GCTGCTCCTG
ACCATCCACC ACATCGTCGC CGACGGCTGG TCCATGGATC TGCTGATCCG CGAGCTGTCG
GTGCTGTACA ACGCGCGCCT GAGCGGCGAG CCCGCGGCCC TGGCACCGCT CGACATTCAG
TACGCCGACT ACACCGAGTG GCAACGCGAA TGGCTGGCCA CCCCGGGCGT CCTCGACGAA
CAGATCGACT ACTGGCGCCG CCAGCTCGCG CACGCGCCGG TGCTCCAGCT CCCCGCGGAT
CACGCCCGCC CGCCGGTCCC CACGCACCGC GGCGCGACCC TCCCCGTCGC CGTGTCGCCC
GCGCTCGCAG CCTCCCTGCG TTCGCTCGCC GGTGACGAGG GCGCCACCAT GTTCATGGTC
CTGCTGTCCG GTTTCCAGGC GCTACTCGCG CGCTACACCG GCCAGCGCGA CATTCTGGTC
GGCGCCCCCA TGCACGAGCG CAGCCGCGTC GAGCTCGAGA ACATGATCGG CTGCTGCCTC
AACACCCTGG TGCTGCGCAC CGAGGTCGAC GCCGCCGAGA GCTTCCGCGC GCTGCTCTCG
CGGGTACGCG CGATCACGCT CGACGCCTAC GCCCACCGCG ATCTGCCCTT CGAGCGCCTG
GTCGACGAGC TGCGACCCGA GCGCAATCCC GCCTATACGC CCTATTTTCA GGCCGTATTC
AACTTCCGCC CGGCCGCGCG CACCGAAGCG CATCTCGCCG GCCTGGCCAT CACCCCCGTC
GAGGTGCACA CCGCCTCGGC CAAATTCGAT ATCACCCTCG ATCTTCAGGA CACCGGCGCC
GAGCTCGTCG GCGTCATCGA GTACCGCACC GATCTGTTCG CGACCGCGAC CATCGAAGGA
CTGCGCAACT GCTGGCTCAC GCTGCTCGAA GCCGTGGTCG CGACCCCCGA TTGTCCGGTC
GGCGCCCTGC CGCTGCTGAC CGAACCCCAG CGCGCCGAGC TGCTCGCCCG CGGCTGCGCC
CAAGAGCGCT TCCCCGCCGA GGGCACCCTC CATCAGCGCG TCCTCGCCGC CGCCGCCGCC
GCCCCCGACG CCGTCGCCCT CGTCTGCGGC GACGACTCCC TCACCTACGC CCAGCTCCTC
CGCCGCTCCG CCCAGCTCGC CCATCGCCTC CAGGCCCTCG GCGTCGGCCC CGAGAGCCGC
GTCGGCCTCT GCCTCCAGCG CTCCATCGAC ATGGTCGTCG CCATCCTCGC CACCCTCCAG
GCCGGCGGCG CCTACGTCCC CCTCGACCCC GCCTATCCCC CCGAGCGCAT CGCCCTGCTC
ATCGACGACA GCGGCATGAG CGCCCTCGTC ACCCGTCACC CCGACAGCGA CGCCCTCCCC
GACCACCTCG CCACCGGCGA CCTCCCCTGC GTCCTCCTCG ACCAACACGC CGACCAGCTC
GCCGCCCTCC CCGACGCGCC CCCGCCCTGC GCCGCCGACG CAGACTCCCT CGCCTACATC
ATCTACACCT CGGGCTCCAC CGGCCGGCCC AAGGGCGTCC TCGTCACCCA CCGCAACGTG
TTGCGTTTGT TCGACAGCAC GTCCGAGGAC TTCGCGTTCT CCGCCGACGA CGTCTGGACC
CTCTTCCACT CCTTCTCCTT CGACTTCTCC GTCTGGGAGC TCTGGGGCGC CCTCACCTTC
GGCGCCCGCC TCGTCATCGT CCCCTGGCTC GTCTCCCGCT CCCCCGACGC CTTCGCCCAG
CTCCTCGCGC GCGAGCGCGT CACCGTCCTC AACCAGACAC CGTCCGCTTT CCGCCAGCTC
ATCCACACCG ACTCCCTCAC CCCGCTCCCC GCCCTCCGCT ACGTCATCTT CGGCGGCGAG
GCCCTCGATC CCACCGCTCT CCTCCCCTGG GTCGAGCGCT TTGGCTTCGA CCAGACCACC
TTCGTCAACA TGTACGGAAT CACCGAGACC ACCGTCCATG TTACCTACCG CCCCCTCTCG
CGCGCCGATC TCGATCACCC CTCCAGCCGC ATCGGCCGCC CCTTGCGCGA CCTCGACATC
TACCTCCTCG ACGAGCGCAT GCAGCCCGTC CCCCTCGGCG TCCCCGGCGA GATCTACGTC
GGCGGCCCCG GCCTCGCCCG CGGGTATCTC GGACGCCCCG AGCTCACCGC CCTGCGCTTC
CCCGACCATC CCTTCCGCGA CGGCGAGCGC GTCTACCGCT CCGGCGACCT CGGCCGCTGG
ACCCACGACG GCGACCTCGA ATACCTCGGC CGCAACGACG CGCAGGTCAA GATCCGCGGC
TTCCGCATCG AACTCGGCGA GATCCAGGCC GCCATCGAGG CACACCCCGC CATCCGCAGC
GCCGCCGTCA TCGCCCGCTC CATCGGCTCC GACCAGCGCC GCGCCCTCCT CGCCTATCTC
GTCCCGGCGT CCGACGACAT CCCCTCGGTC GACGCGCTCC GCGCCTTCCT CGCCCAGCGC
CTCCCCGACT ACATGCTCCC GGCCTCCTTT CACTTCCTCG ACGCCTTGCC CCTCACCGTT
CACGGCAAGC TCGACACCCG CGCCCTCCCC GAGGTCGAGT TCGCGGCCGC GGCTACCGAC
GACAGCTACG AGGCCCCGCG CAACATGGTC GAGGAAGTCC TGTGCGCGGT GTGGGCGCAC
GCCCTGAGCG TGCCTCGCGT CGGCATCCGC GACAACTTCT TTGCCCTGGG CGGCGACTCG
ATCCTCAGCA TCCGCGTGGT CACGATGTCC GAGAAGCGCG GGATCCGCTA CACGGCCGCG
CAACTGTTCA GCAATCAGAC CGTAGCCGAG ATCGCCGCGG TGGCCCAGGA CGCACTCGAC
GACGACGATG ACGCGCGCGA AGAGCTGTAC ACCGAGCCCT TTTCACAGGT GTCCGAAGAT
GATCTGGCCC GCCTGCGCGC CGCCCATCCC GACGTCGTCG ACGCCTACCC GCTCGCGCGC
GTGCAGGCCG GCATGCTCTA TCACATGGAG CTGGCGCCAA ACTCGAACAT CTACCACAAC
ACCGATTCGT TCCATCTGCG CAGCCGCATC CCGTTCGACG CGGACTGCTT CCGCGCAGCC
GTACAGGTCG TGGTCGGCCG TCACGCGGTG CTGCGCACCT CGTTCGACAT GGCCTCATAC
AGCGAGCCGC TGCAGCTCGT GCATGAGGAT GTCACCCTCC CCTTCGAGCT GAGCGACCTG
CGCCACCTCG ACGAGGCCGC TCAGGAAGCC GTCATCGAGG AGCTCATCCG CAGCGAGATG
GCCGCGCACT TCGACCTGCG GACACCGCCG CTGCTGCGCT TTTTCGTCCA CCTGCGCAGC
GACGACACCT TCCAGTTCAC GCTCACCGAA TGCCACGCGA TCATCGACGG CTGGAGCCTG
CACAGCGTCC TCGTCGAGAT CTTCAACCAC TACTTCGCGC TCATGCACGG CGGCGAGCTG
CCCGCCTACG AGGCGCCGCG CCTGAGTTAC CGCGATTTCG TGGCGCTCGA GCGCCGCGTG
CTGGCGGCGC CCGCGCATCG TGCCTTCTGG AGCGAACAGC TCGACGGCGC CGCGGCCGTG
CGCCTGCCGC GGCACATGTC GCCGCTGCCC GGACGCGAGC CGGTCATCGC CATGATCTCG
CGGCATATCC CCGAGCAGCT CGAGGCGGGC CTGCGGGCCC GCGCTCAGGC GCTGGCGGTA
CCGCTCAAGA GCGTCCTTTT GGCCGCCCAT CTCAAGCTGC TCAGCAGCGT GAGCGGACGC
GACGACATCA TCACCGGTGT GGCTCTGCAC GGCCGCCCCG AAGTCGAGGA CGGCGCCCGC
CTGCGCGGCC TGTTCCTCAA CACCTTGCCG TTCCGCTTGC GCCTGCGCCC GGGCTCGTGG
GCATCTTTGA TTCAGCAGAC CTTTGCAGCC GAGCGCGCGC TGCATCCTTT TCGCCAGTAT
CCCATGGCCG AGATCCAGCG TGCGCACGGG CGCGAGACGC TGTACGAGGT CATGTTCAAC
TACCTGCACT TCCACGTGCT GCGCGACCTC GCCGGCTCGG TCGCCGACCT CGAGCCGCTC
AGCACGCGCC GCTCCGAAGG CACCAACGTC GCGCTCGCGG TCGACTTCCA GACCGACCCC
TACGATTACA GCCTGGCCCT CGACCTCGAC TACGACAGCA ACCTGTTCCA ACACGAGCAG
ATCGCGGCCA TAGCCGACGC GTACCTGCAC ATCCTGCACT GCCTAGCGTT CGAGGCCGAG
GCCCCGCACG ACGCCTTCTC GGCGCTTCCG GGCAGCGAGC GCGCGCTGCT CCTGCATGAG
TTCAACGAGA CCGCGCGACC GCTGCCGCAG GGCGCTTCGC TGGCGGCCCT GGTCGCCGAA
CACGCCGCGC GTACGCCCGA CGCCATCGCC GCCCGCGATC GCGCCCACAG CCTCAGCTAC
CGCGCCTTGT GCGACCGCGC CATGCGCCTC GGCCACCTGC TGCGCGCGAC CGGCGTCGGC
CCCGGGGCCA CGGTCGGCGT GCTTCTCGAC CGATCGGTCG ATATCCTGGT CGCCATCCTC
GGCGCGCACG CGGCCGGCGC CGCGTACCTG CCGCTCGACC CCGTGTACCC GAGCGAGCGG
CTGCGCTACA TGCTCGACGA CGCGCGCGTG GCCGCGGTGC TCACCGAGGA GGCGCTGCGC
GCCAGCCTGC CGCCGCTGAC CGCGCCGGTG CTGTGTATCG ACGCGCCCGC GCACCACGCC
GCGCTCGAGC AGCAACCCGC GACGCCGCTG CCGATGCCAA GCGGCGACGC GCTGGCCTAC
GTGCTGTACA CCTCGGGCTC GACCGGACAC CCCAAAGGCG CCATGGTCAC CCAGCGCAGC
CTGGCCAACT ACCTGCGCTG GGCCGCCGGG TTCTATCCGC TGAGCGAAGG CTCGGGCGCT
CCCGTGAGCA CCTCGATCGC GTTCGACGGC ACCGTGCCGA GTCTGCTCGG ACCGCTCGCC
GTCGGCAAAT GCGTGCACTT TTTGCCCGAA GAGCAGGACG TGGTGCACCT CGCCGAGGCC
CTGAGCACCG CGGACGAACC CTATAGCCTG ATCAAGCTCA CGCCCGCACA CCTCGAAGCG
CTGCGCCACC GCCTGCCCGA TGACGCACCA GTGCGTGCGC ACGCCTTCGT GCTGGGTGGC
GAAGTTCTCC CGCCCAGCCT GGCCGCGTGG TGGCAACAGC GGGCGCCGCA GGTGCGCATC
TACAACCAGT ACGGCCCCAC CGAGACCACG GTGGCCTGTA CCGCGCACCG CATCGCCGAC
TCGGTGTCCG AAACATTGCC CGTACCCATC GGACGCCCGA TCGCCAACGT CCGCGTGTAC
GTGCTCGACG CCTTTCTCCA GCCGCTGCCC CAGGGCGCCG TCGGCGAGCT GTACGTGGCC
GGCGCGGGCG TGTCCCAGGG TTACTGGGCG CGGCCCAAAC TCACGGCCGA GCGCTTCCTG
CCCGATCCCT TCGCGTCCGA GCACGGCGCC CGCATGTACC GCACCGGCGA TCTGGTGCGC
GTGGGCTGCG ACGGCACGCT CGACTACGTG GGCCGCAGCG ACGCTCAGAT CAAGCTGCGC
GGCTACCGCA TCGAGCTGGG CGAGATCGAG GCCGTGCTGG CCCAGCAACC GGGCGTGCGC
GAAGCTGCCG CGACCGTGCA CCAGAGCGCG CAAGGCCACG GCCAGCTCGT CGGCTATCTG
GTCTTCGACA GCGACAGCGC CGACGACGAC GACAGCGCCG ACGGCGACGC CCGCGCGCAG
CGCCTCGACG AGATGCGCCG AGCGCTGCGC ACGCGGCTGC CTGCGTACAT GGTGCCCGCG
AGCCTGGTCG TGCTCGACGC CCTGCCGCGC ACGCCGAACC GCAAACTCGA GCGCCGCGCG
CTGCCCGCGC CCACCGCCGA GAGCGCGCCC AGCGCCGCGG TGCACGAGGC CCCACGCACC
GCCGTCGAGA AGCTGCTGGC GTCGATCTGG ACCCAGACCC TGGGCGTCGA GGAGCCCGGT
ATCCACGACG ATTTCTTTGC TCTCGGCGGC GACTCCATCC TCAGCATCCA GGTCACCGCG
CGCGCGGGCC GCGCCGGCGT CCGCATCACA CCGCGCCAGG TCTTCGAGCA CCCGACCATC
CACGAGCTGG CCGCGGTTGC CGACCAGTCG TCCGATCTCC TCGGCGACGT CGCTGCCGAG
CAGGGTCCGG TCACCGGCGC GGCGCCGCTC ACGCCGGTGC AAGCGCGCTT CTTCGCTCTC
GACCTGGCGC GTCCACAGCA CTGGAATCAA TCGATGCTGC TGCGCGCGCG CCAGCCGGTG
GACGCCGACG CCCTGGCCGC GGCCGTGCGC GCGGTGCTCG GCCACCACGA CGCCTTGCGC
CTGCGCTTCG AGCGCGGGGA CGACAGCAAC GCGGCCTGGC GCCAGGAGCA CGCGGCGCCC
GTGACCACGG CGCCGCTGTC GCGTTGGGAT CTCGGCGACC TGCCCGCCGA GCAGCGCGAT
GAGGCCTTGC GTACGCGCGC CAGCGAGCTC CAGGCGAGCC TCGATCTCGC CACCGGCCCG
CTGGTGCACG TGGCCCTATT CGACCTTGGC CCCGACGCGG AGCAGCGCAT TCTCATCGCC
GTCCATCACC TGGTGGTCGA CGGCGTGTCC TGGCGCATCC TGCTCGAGGA CCTGCAGAGC
GCCTACGAGC ACGCGGCCGC CGGACGAGCG CCCCAGCTCC CGGCCAAGAC CTCGTCATTC
CAGCAGTGGG CGCGCGCGCT GCGCGACCAC GCCCACAGCC TGGCCGTGCG CCAACAGCTC
GACTACTGGA GCGATCCGAC GCGCGCGCAG CTCGCCGCCC TGCCCGTGGA CCATGGCCAC
GGCAGCAACC GCGAGGCCGA CCACGCCGCG GTCACGATAG AGCTCGACGC CGACACCACC
AGCGCCCTGC TGCAGCAGGC CACCAGCAGC TATCGCGCCC GCGTGGACGA AATCCTGTTG
GCGCCCTTGG TCGCGACTCT GTCCGCCTGG ACGAAATCGC CCACGATCCA GCTCGACCTC
GAGGGCCACG GTCGCGAGGA GGTCGGCACG CCGCTCGATC TCAGCCGCAC GGTGGGCTGG
TTCACGGCCG TCTACCCGGT GCGCGTGGAC CTGGGCGCGG TCGCCGCCCC CGGCTCCGGC
GCCCTCGCTG CCTTGCGCGC GGTCAAAGAA CAGCGCCGCG CCGTGCCCGC CCACGGCCTC
GGCTACGGCC TGCTGCGCTA CCTGGGTGAC GACCAGACGC GCGCCACCCT GGCCGCGCTG
CCGGACTCGC AGGTCGCCTT CAACTACCTC GGTCAGCTCG ACGCCAGCTT CGCCCAGGAC
GCCATGTTCG TACCTGCGGA CGAGTCCGCC GGGCCGGATC GCGATCCCGC AGCGCCGCGC
CGCTACCTGC TCGAGATCAG CGGCTACGTG CTGCAAGGCC GACTGCAGCT CACCTGGAGC
TACAGCGCCG CCGCGCACCA GCGCGCGACC ATCGAACGTC TGGCAGCCGA CTTCACGGCC
GCCCTGCAAA CGCTGATCGC GGAGCGCACG AGCGCGCGTG CGTACGTGCC CTCGGACTTT
CCTCTGGCCG CGCTGTCGCA GCCCGCGCTC GACCAGATCA CCGAGCGCTG GCCGGCACGC
GCAGACGGCA TCGACACCGC CGAGAACCCC ACCATTGAGG ACATCTACCC GCTCTCCGCG
CTGCAGGAGG GGATGCTGTT CCACAGCCTG CAGGCGCCCG GGCGCGGTGA GTATTTCGAA
CAGCTCGCCT GGGAGCTGGG CGCCGAGATC GACGTCGACG CCTTCGAGCG CGCGTGGCAG
AGCACCGTGC AGTGCCATCC CATCCTGCGC ACCGCCTTCG TGTGGCGCGA CGTGCCGCAG
CCCATGCAGG TGGTGCTGTC CGACGCGCCC GTGCACATCA CCCGGCTGGA CTGGAGCGAT
ATCGACCCAT CGTCCCAAGA ACAGCGCCTC CGCGAGCTGC TCGCGGCCGA GCGCGCCCGC
CCCTTCGCGC TCGACGCGGC CCCGCTCATG CGCCTCAGCT ACATCCGCCT GGACGCGGCG
CGCGGCTACT TCGTGTGGAG TCACCACCAT CTGCTGCTCG ACGGCTGGTC GCTGCCGCTG
GTCATCACCC AGACCATGAG CGCCTACCAG AGCATCGCCC AGGGCCGTAC GCCGCAGATG
CCCAGGCCGC GCCCCTACCG CGACTATATC CGCTTTTTGC AGGAGCGCGA CGATGCCGGC
GCCGAGGCGT TCTGGCGCGA ATCGCTGGCC GGGCTGAGCG AACCCGCGCG CCTCGGCCTG
GGCGGCAGCG AGGGCGAAAG CGCCGCTGCG AGCGACGGCG CAGCGCCTCA CCGCGAGCGC
GAACACCACC TCTCGGCCGC GCTCAGCGAG CGTCTGCAAG ACCTCGCGCG CCGCCATCAG
CTCACGCTCA ACACCCTGGC CCAAGGCGCC TGGGCGCTGC TGCTCGGCCG CTACAGCAAC
AGCGATGACG TGGTCTTTGG CGCCACGGTG TCGGGGCGCT CGCCCACGCT GCCCGGCGTC
GAGGACATGG TCGGCCTGTT CATCAACACC GTGCCGGTGC GCGTGCGCTT GCCCGACGAT
GCGCTGCTGC TCGACTGGCT GCATCAGCTC CAGCGCCAGC AGGTGGCCCT GCGCGAATAC
GAACACAGCT CGCTGGTGCG CGTGCAGGGT TGGAGCCAGG TGCCGCGTGA CACCCAGCTC
TTCGACACCC TGCTGGTGTT CGAGAACTAC GCGGTCGCCG GCGGCTTCGA CGAGGTTGGC
GACCAGCTCC AGGCGCGCCT GGTGCACGCG GTCGAGCGCA CAAACTACCC GATGACGCTG
GTGATCGGCA TGGGCGAGCG GCTGACCCTG CGCCTGGCCT ACGACGCACA GCGCTTCAGC
GAACGCGACG CCGAGCAAGT TCTCGGACAC CTGGCCTATA TCCTGAGCGG CATGACCGAG
CACATCGGCC ATACCTTGCA CGAGCTGCCG CTGCTGTCCG ATAACGAGCG CGCCGAGCTG
CTCGCCCGCG GCTGCGCCCA AGAGCGCTTC CCCGCCGAGG GCACCCTCCA TCAGCGCGTC
TTCGACCGCG CCGCCGCCGC CCCCGACGCC GTCGCCCTCG TCTGCGGCGA CGACTCCCTC
ACCTACGCCC AGCTCGTCCG CCGCTCCGCC CAGCTCGCCC ATCGCCTCCA GGCCCTCGGC
GTCGGCCCCG AGAGCCGCGT CGGCCTCTGC CTCCAGCGCT CCATCGACAT GGTCGTCGCC
ATCCTCGCCA CCCTCCAGGC CGGCGGCGCC TACGTCCCCC TCGACCCCGC CTATCCCCCC
GAGCGCATCG CCCTGCTCAT CGACGACAGC GGCATGAGCG CCCTCGTCAC CCGTCACCCC
GACAGCGACG CCCTCCCCGA CCACCTCGCC ACCGGCGACC TCCCCTGCGT CCTCCTCGAC
CAACACGCCG ACCAGCTCGC CGCCCTCCCC GACGCGCCCC CGCCCTGCGC CGCCGACGCA
GACTCCCTCG CCTACATCAT CTACACCTCG GGCTCCACCG GCCGGCCCAA GGGCGTCCTC
GTCACCCACC GCAACGTGTT GCGTTTGTTC GACAGTACGT CCGACGACTT CGCCTTCTCC
GCCGACGACG TCTGGACCCT CTTCCACTCC TTCTCCTTCG ACTTCTCCGT CTGGGAGCTC
TGGGGCGCCC TCACCTTCGG CGCCCGCCTC GTCATCGTCC CCTGGCTCGT CTCCCGCTCC
CCCGACGCCT TCGCCCAGCT CCTCGCGCGC GAGCGCGTCA CCGTCCTCAA CCAGACACCG
TCCGCTTTCC GCCAGCTCAT CCACACCGAC TCCCTCACCC CGCTCCCCGC CCTCCGCTAC
GTCATCTTCG GCGGCGAGGC CCTCGATCCC ACCGCTCTCC TCCCCTGGGT CGAGCGCTTT
GGCTTCGACC AGACCACCTT CGTCAACATG TACGGAATCA CCGAGACCAC CGTCCATGTT
ACCTACCGCC CCCTCTCGCG CGCCGATCTC GATCACCCCT CCAGCCGCAT CGGCCGCCCC
TTGCGCGACC TCGACATCTA CCTCCTCGAC GAGCGCATGC AGCCCGTCCC CCTCGGCGTC
CCCGGCGAGA TCTACGTCGG CGGCCCCGGC CTCGCCCGCG GGTATCTCGG ACGCCCCGAG
CTCACCGCCC TGCGCTTCCC CGACCATCCC TTCCGCGACG ACGAGCGCGT CTACCGCTCC
GGCGACCTCG GCCGCTGGAC CCACGACGGC GACCTCGAAT ACCTCGGCCG CAACGACGCG
CAGGTCAAGA TCCGCGGCTT CCGCATCGAA CTCGGCGAGA TCCAGGCCGC CATCGAGGCA
CACCCCGCCA TCCGCAGCGC CGCCGTCATC GCCCGCTCCA TCGGCTCCGA CCAGCGCCGC
GCCCTCCTCG CCTATCTCGT CCCGACGTCC GACGACATCC CCTCGGTCGA CGCGCTCCGC
GCCTTCCTCG CCCAGCGCCT CCCCGACTAC ATGCTCCCGG CCTCCTTTCA CTTCCTCGAC
GCCTTGCCCC TCACCGTTCA CGGCAAGCTC GACACCCGCG CCCTCCCCGA GGTCGACGCC
ATCGCCGCGC ACGCTCGCAG CTACGAGCCG CCGCGTCCCG GCGCCGAGAG CGACATGGCC
GCGCTGTGGT GCGAGGTGCT CGATGTCGAG CAGGTGGGAC GCAGCGACGA CTTCTTCGCC
CTCGGCGGCC ACTCACTCGT GGCCACACGG CTGCTGTCGC GCATGCGAGC CACCTTCGGC
GACCACGTGG CGCTGGCCGA TATCTTCGCG CATCCCAGGC TGGCAGCGTT GGCCGAGGCC
GTCGGCGCAG CTCACTCGAA CGCGTCGCCG CTGCTGACGC CGCCGCCCCT GGTGGCCCAG
CAGCGCCCCG AGCGCCCGCC GCTGTCCTTC GCCCAACAGC GGCTGTGGTA TCTCGACCAC
CTCGAAGGCA CGGGCGCATA CAATATCTCG TCGCTGCTGC GCCTGAGCGG CGCCGTGGAT
CGCGACAAGC TGGCCCAGGC GCTCAACGCG CTGATACAGC GGCACGAGAT CCTGCGCACC
GTGTTCCCGG CCGAGCAGGG TGTCCCCTAC CAGCGGGTGC TGGAGCACGC TCCGATAGAG
ATCGCGTGCC ACGCGGCCGC GGATGACGAG CTGGTGCGCC AGCAGGCCGC AGCCCTGGCG
GCCGAGCCCT TCGACCTCAG CCACGGGCCG CTGCTGCGCG TTGCCATGTG GCCGGTGCGC
GGTGGCGAGC ACGAGCATCT GCTGCTGCTG GTCGTGCACC ATATTATTTC GGACGGTTGG
TCGATGTCGC GGCTCACGGA CGAATTCACG GCCATCTATC GAGCCGTGCT CGCCGGTCGC
GACCCGGAGC ACGCGGGACT GCCGGCGCTT CCGGTGCAGT ACGCGGACTT CGCCGCCTGG
CAGCGCGCGT GGCTCGACGA TGACACAGTG GCCGCGCAGC TCGCCTTCTG GCGCGACGCC
CTCGCCGATG CACCGCCGGT GCTCGAGCTG CCCACGGACG CGCCGCGCCC CACCGCGCAG
AGCTTCCGCG GCGGCAGCTA CGAGTTCCAG GTGCCAGCGG TGCTCACCGA GCGCCTGCGG
ACGCTGGCCC GCGAGCAGGG AGCCACGCTG TACATGGTGC TGCTCGCCGG CTTCCAGCTT
CTGCTCAGCC GCTACAGCAA CCAGCGCGAC ATCGTCGTCG GCACCCCGGT AGCCGGACGC
GTGGTCGCCG AAACCGAGCC GCTGCTCGGC TGCTTCGTCA ACACCCTGGC CATCCGCGCC
CGCGTCGACC AGGCGTCCAG TTTCCGCGCG CTGCTCGATC AGGTGCGCGC GCACACGCTC
GACGCCTTCG AGCACCAGGC CCTGCCCTTC GAGAAGCTGG TCGACGCGCT CCAGCCCGTG
CGCGAGCTCG GCGTCACGCC GCTGTTCCAG GTCATGTTCG TCCTGCAGAA CGCGCCTGAG
GCCGTGGCCG AGCTGCCCGA GCTGCGCGTG GAACCGGTGC CCTTTGAGCG CAGCAGCGCG
CAGTTCGACC TCACGCTGTC CATGGAGGAA CGCGACGGCG CGCTGTCGGC GCAGGCCATC
TACGCCCGCG ATCTGTTCGC CGCACGCAGC ATCGCCCAGA TGATGGGGCA TCTGTGTCAT
CTGCTCGAGA TCGCGGTCGC CGAGCCCGAT CGCGCCCTCG ACGAGCTGCC GCTGCTGTCC
GATAACGAGC GCGCCGAGCT GCTCGCCCGC GGCTGCGCCC AAGAGCACTT CCCCGCCGAG
GGCACCCTCC ATCAGCGCGT CCTCGACCGC GCCGCCGCCG CCCCCGACGC CGTCGCCCTC
GTCTGCGGCG ACGACTCCCT CACCTACGCC CAGCTCGTCC GCCGCTCCGC CCAGCTCGCC
CATCGCCTCC AGGCCCTCGG CGTCGGCCCC GAGAGCCGCG TCGGCCTCTG CCTCCAGCGC
TCCATCGACA TGGTCGTCGC CATCCTCGCC ACCCTCCAGG CCGGCGGCGC CTACGTCCCC
CTCGACCCCG CCTATCCCCC CGAGCGCATC GCCCTGCTCA TCGACGACAG CGGCATGAGC
GCCCTCGTCA CCCGTCACCC CGACAGCGAC GCCCTCCCCG ACCACCTCGT CACCGGCGAC
CTGCCCTGCG TCCTCCTCGA CCAGCACGCA GGCCAGCTCG CCGCCCTCCC CGACGCGCCC
CCGCCCTGCG CCGCCGACGC GGACTCCCTC GCCTACATCA TCTACACCTC GGGCTCCACC
GGCCGGCCCA AGGGCGTCCT CGTCACCCAC CGCAACGTGT TGCGTTTGTT CGACAGCACG
TCCGACGACT TCGCCTTCTC CGCCGACGAC GTCTGGACCC TCTTCCACTC CTTCTCCTTC
GACTTCTCCG TCTGGGAGCT CTGGGGCGCC CTCACCTTCG GCGCCCGCCT CGTCATCGTC
CCCTGGCTCG TCTCCCGCTC CCCCGACGCC TTCGCCCAGC TCCTCGCGCG CGAGCGCGTC
ACCGTCCTCA ACCAGACACC GTCCGCTTTC CGCCAGCTCA TCCACACCGA CTCCCTCACC
CCGCTTCCCG CCCTTCGCTA CGTCATCTTC GGCGGCGAGG CCCTCGATCC CACCGCTCTC
CTCCCCTGGG TCGAGCGCTT TGGCTTCGAC CAGACCACCT TCGTCAACAT GTACGGAATC
ACCGAGACCA CCGTCCATGT TACCTACCGC CCCCTCTCGC GCGCCGATCT CGATCACCCC
TCCAGCCGCA TCGGCCGCCC CTTGCGCGAC CTCGACATTT ACCTCCTCGA CGAGCGCATG
CAGCCCGTCC CCCTCGGCGT CCCCGGCGAG ATCTACGTCG GCGGCCCCGG CCTCGCCCGC
GGGTATCTCG GACGCCCCGA GCTCACCGCC CTGCGCTTTC CCGACCATCC CTTCCGCGAC
GGCGAGCGCG TCTACCGCTC CGGCGACCTC GGCCGCTGGA CCCACGACGG CGACCTCGAA
TACCTCGGCC GCAACGACGC GCAGGTCAAG ATCCGCGGCT TCCGCATCGA ACTCGGCGAG
ATCCAGGCCG CCATCGAGGC ACACCCCGCC ATCCGCAGCG CCGCCGTCAT CGCCCGCTCC
ATCGGCTCCG ACCAGCGCCG CGCCCTCCTC GCCTATCTCG TCCCGACGTC CGACGACATC
CCCTCGGTCG ACGCGCTCCG CGCCTTCCTC GCCCAGCGCC TCCCCGACTA CATGCTCCCG
GCCTCCTTTC ACTTCCTCGA CGCCTTGCCC CTCACCGTTC ACGGCAAGCT CGACACCCGC
GCCCTCCCCG AGGTCGAGTT CGCGGCCGCG GCTACCGGCG ACAGCTACGA GGCCCCGCGC
TCGGACGCTG AAGCCGCGCT CGCCGATATC TGGAGCACCG TGCTCGGCAT CGAACGGCCG
GGGATCCACG ACGACTTCTT CGCGCTCGGC GGAGACTCGA TCCTCAGCAT CCAGATCATC
GCCCGCGCCA GCCAGATCGG CCTGCACCTC ACGCCGCGCG ACATCTTCGA GCACAGCACC
ATCGCCGCGC AGGCCACGGC CGCCAGTCGC ACGCGCCGCA GCCGCGCCGA ACAGGGCCCG
GTGACCGGGC CGGCGCCGCT CACGCCCATC GAGCACTGGT TCTTCGAGCA GCCGCGCGAG
CGTCGCGGGC ACTGGAACCA GTCGATGCTC CTGCGCGCGC GCCAGCGCAT CGACGGCGAC
GCGCTGGCCG CAGCCGTGCG CGCCATCGCC CGCCACCACG ACGCGCTGCG CCTGCGCTTC
TACGAGCAAG CCGACGGCAC CTGGGAGCAG GTACACGCAG CGCCCAGCCA CGAGGCGCCG
GTGACGCTCA TCGATCTCGT CGCGCTCGGC GCGCTCGCCG CAGAGCCCGC CGAGGCCGAG
GCCGACGCCG ATGACGCCGC AGCGGTGGCC GAGGCCGTGC GCGTGCATGC CGATGAGGTG
CAGCGCAGTC TCGACCTCAG CACCGGACCG CTGCTGCGGG TGGCGCTCTT CGAGCTCGGC
CCGCAGCGCG AGCAGCGCCT GCTCATCGTC ATCCATCACC TGGTGGTCGA CGGCGTGTCC
TGGCGCATCC TGCTCGAGGA TCTACAGCAG GCCTACGCGC GCTGCGCCGC CGGCCGCGAG
CCCGAGCTGC CGGCCAAGAC ATCGTCGTTC CAGCAGTGGT CCCAGGCCCT GCTCGCGCAC
GCGCAGCAGC CTGCCCTGCG CCAGCAGCTC GAGTACTGGA GCGCGCCCGA GCGCGCCCGC
GTGCCGGCGC TGCCGGCCGA TCATCCCCAA GGCGAAAACC GCGAGACCGC CCAGGCCTCG
CTGGCCTTCG CGCTCGACGC GGAGCTCACG CGCGCGCTGC TGCACGAGAC CGCCGCCGCC
TACCGCGCGC GCGTCGACGA GTTGCTGCTC GCGGCGCTGA GCGCGACGCT GTCCGAGTGG
ACGAAATCGA CGCTGGTACA GGTCGATGTC GAGGGCCACG GCCGCGAGGA TATCGATGAC
GAGGTCGATG TCACGCGCAC GGTCGGCTGG TTCACGACCA TCTACCCGGT GCTGCTGTCG
CTCGACCCGG CCGGGGACGC ACTCGCCGAC AACGCCGCGA GCGATGACGA CGCCCTGCTG
GCCATCCTGC GGGCGGTCAA AGAGCAACGC CGCGCGGTGC CCGGCCACGG TCTCGGCTAC
GGTTTACTGC GCTACCTGGG TGACGACGAC GCCCGGGCGG CGCTGCGGGC GCTGCCGGGG
TCGCAGGTGG CGTTCAATTA TCTCGGACAG TTCGACCAAT TGCAGCAAGC GGCCAGCGTA
TTCGCGGTCG CGAACGAACC CACCGGCGCC AACCGCGACC TCAGCGCCCA GCGCCGCTAC
GAGCTAGAAC TCAACGGCTA CGTCGGTGAC GCTCGGCTTC AGCTCACCTG GTTCTACGGC
AGCGAGCGCT ACGAACGCGC GACCATCGAG CGCCTGGCCG CGCGCTTTGT CGAACACCTC
GCCGCGCTGG TCGCGCGCAG CGGCGAGGCC AACGCCTACG TGCCCTCGGA TTTCCCGCTG
GCTATGCTCT CGGCGAGCGC GCTCGAGCGC CTGGCCGGGC GCTGGCCTGC GCTCGAGGAC
GTCTACCCGC TCACGCCGCT GCAAGCCGGT ATGTTGTTTC ACTCGCTGTA CGCTCCCGAG
CGCGGCGAGT ACCTGGGTCA ATTCGCCTGG TATCTGCACG GCCCACTGGA CGCCGACGCC
TTCCAGCGCG CCTGGCAGGC CGTCGTCGAT CTCCACCCGG TGCTGCGCTC GGCCTTCCTG
TGGCAGGATC TCGACGAACC GCAACAGGTC GTGGTCCCGG CCCAGGTGCG CATCGAGCAG
CACGATTGGC GCGACCGAAC GTCCGAAATC AACCCCGCCA TCGACGCGTT CTTGCAGCGC
GATCGCGCGC GCGGCCTCGA CCTCGAGCAG CCCCCGCTCA TGCGCCTCAG CTTGGTTCGC
CTGGACGACG AGAAGAGCTG CTTCGTGTGG ACGCATCACC ATCTCCTGAT CGACGGCTGG
TGTCTGTCGA TCCTCATGGG ACAGGTGGTC ACCGCCTACG AGGCGCTGCG CCGAGGACAC
CCCGCGCAGC TCGAGCGGCC GCGGCCGTAT CGCGACTACA TCGCCTGGCT GCGCGGGCGC
GACCCGGCCG AGGCCGAGGC GTTCTGGCGC GCGGCGCTGG CCGGCTTCAG CGCCCCCAAC
GTGTTGGCCG TGGACCGCGG CGCGCGCTCC GGGCAGGCCT CGCAGCACCG CATCGTGCAC
TTCGAGCTGC CCGAGCCCAC GCGCGAGGCG CTGATCGCCA TGGCGCGCCG CCATCAGCTC
ACGCTCAACA CCCTGGTGCA AGGCGCCTGG GCGCTGCTGG TGGCCCGCTA CAGCGCGGCC
GACGATGTGG TCTTCGGCGC CACGGTCTCG GGCCGTACGC CCGCGCTGGC GGGCGTCGAG
TCGATGATCG GCCTGTTCAT CAACACCCTG CCGGTTCGCG TCCAGATGTC CGAGGACATG
CCGGCCGCGG CCTGGCTGCG CGCGCTCCAG CAGCAGCAGA GCGAGACCCG CGCCTTCGAG
CACACGGCCC TGGTCGACAT CCAGCGCTGG AGCGAGCTCG GCCACGGCGA GCCGCTGTTC
GAGACCCTGC TGGTGTTCGA GAACTACGCG GTCGACGACA GCGCTGGCGC GGTCGAGACC
AGCCTCGACA TCGAGCACTT GCACGCGCAC GAGCGCACCA ACTATCCGCT CGCGCTCACC
GTCGGCCGGC GCCTGGGCAT CGAGCTGGCC TACGATCAGA GCCGCTTCGA CGACGATGTC
GCCGCGCGCC TCCTGCGCCA CTTCGCCTCG CTGCTTGGCC AGCTCGCCGA GGCGCCGCAG
CGGCCGCTGT CGGCGCTGTC GCTGGCCGAT CGCGCCGAAC AGCAGGCGCT GATCGCGAGC
TGGAAGACGA GCGCGCGCGA CTACCCGGCG GCGACGAGCA TGCACGCGCT GGTGGCCGAG
CAAGCCGCGC GCGCGCCCCA GGCCGTGGCC GCGGTGTGGG GCGAGCAGCG CATCAGCTAC
GCGGAGCTGA TGGCCCGCTC CAGCCAGCTC GCCCATTACC TGCGCGCGCG CGGCGTCACC
GCCGACGTCC CTGTGGGCGT GCACGTCGAG CGCTCGCTCG ACCTGGTGGT CGCCGTGCTC
GCGGTGCTGC AGAGCGGCGG CGCCTTCGCC CCGCTCGACA CCGCCCTGCC GCGCGAGCGT
CTGCGCACCA TGATCGCGGG TCTGCGCGCC CCCGTGCTGC TCACGCAGGC GGCGCTGCTG
GCCGACTTTG GCGCCATCGT CGAGGAGGCG GGCGACGACG CGTCCGCCGG CAGCGCGGCG
CCGGCCCTGC TGGTGGCCAT CGACGAGCCG GCCACGCGCG CGGCCATCGC CGCGCTGCCC
GAGACGCCGC CGCCGAGCGA GAGCGAACCC GACCACCTGT CCTATATCAT TCACACCTCG
GGCTCCACGG GGACGCCCAA GGGCGTGATG GTGACGCATC GCAACTGGGT CAACGCGTTC
CACGCCTGGG CCGAGGACTA CCGGCTGGGC ACGGACGCGC GCTGCCACCT GCAGATGGCC
AGCTTCTCGT TCGACGTGTT CGCGGGCGAT TACGCGCGCG CGCTGGCCTC GGGCGGCACC
CTGGTGCTGT GCCCGCGCGA GCTGCTGCTC GACCCGCCGG CCCTGCTCGC GCTGCTGCAA
CGCGAACGCG TGGACTGCGC CGAGTTTGTG CCCGCGGTGT TGCGCGGCCT GGCCCAGCAC
TGCGAGGACA GCGCGCAGAC CCTCGCGGGC ATGCACACGC TCATCGCCGG CTCGGATAGC
TGGCACATGA GCGAGTATCG GCGCTTCCGC GCGCTCATCG GCGCCGACGC CCGGCTGATC
AACTCCTACG GCATCACCGA GACGACGATC GATACCACCT ACTTCGAGGT CACGGCCGAC
GCCGACGTCA GCCCTGCAGG CGAGGGCGAG GGCGAGGGCG ACGAGCGCGG CCTGGTGCCC
ATCGGTCGCC CGTTCGGCAA CAGCCGCGTG TACGTGCTCA GCCGCGATCT CACGCCGCAG
CCGATCGGCG TCCCCGGCGA GGTGTACATC GGTGGCGCCG GCGTGGCGCG CGGCTACCTC
GGCCGCCCCG AGCTCACGGC CGAGCGCTTC GTGCCCGACG CTTTCGGCGA CGAGCCCGGC
GCGCGCATGT ACCGCACCGG CGACCGCGCG TTCTACCGCC CCGACGGCAA CATCGCGTTT
CTCGGACGCG TGGACACACA GGTCAAGGTG CGCGGCTACC GCATCGAGCT GGGCGAGATC
GAATCGGTGC TCGTGCGCCA CCCGGCGGTG CAGCAGTCCG CCGTGCTCCT GCGCAGTGAC
GGCCCCGGGC AGCCGCGCCT GGTCGCCTAC GTGGCCGCAG CCTCGGGCGC GGCGCTCGAG
CTGGTCGAGC TGCGCGCCTT CCTGGGCGAG CGCTTGCCAG ACTACATGGT ACCGGCGTTT
TTCGTGGTGC TCGACGCGCT GCCGCTCACG GCCAACGGCA AGGTCGACCG CCGCAATCTG
CCCGCCGCCG ACGCCAGCCA TCGCGTCGGC GTCGAGGAGC GGGTGGCGCC GCGCGACGAC
ATCGAGGCCG CGATCCTGAG GCTGTGGCAG CAGGTGCTCG CGGTCGACGA GCTCGGCGTG
GGCGATGACT TCTTCGCCGC TGGCGGGCAT TCGCTGCTGG CCACGCAGCT CATCTCGCGG
GTCAACGCGG CCTTTGCCAT CGCCCTGCCG CTGCGCGTGG TGTTCGACGC GCCGACGGTG
GCGGCCATGG CCACCGAGGT GCGCGCGTGC GCTGGCGACG CGCCGTCCGA TATCGCGCCG
CGCGAGCGCA TCGCCGCGCG CGCGCAACAG GGACCGGCGC CGCTGTCCTT TGCCCAGCGC
CGGCTGTGGT TCCTCGACCA GTTCGAGCCC GGCAACCCGG CGTACAACAT CTCGGAGTTC
GTCCACCTGC GCGGTGCGCT CGACGTCGCC GTGCTGCGTC GCAGCCTGAG CGAGGTGGTG
CGCCGCCACG AGGTGCTGCG CACCCGCTTC GCGGCCGCGG GGCCGGACGG CCAGGGCCCG
GTACAGATCA TCGATCCACC GGCGGCCGAG CCGCTCGCGC TGCCGATCAT CGATCTCGGT
CATCTGTCCG GCGACGAGCG CGCGCAGCGC TGCCAGGAGC TGGCGCTCGC GGCTGTGCGC
GCGCCCTTCG ATCTGAGCAC GGGTCCGCTC CTGCGGGTCC AGCTCGTACG CCTGGACGAA
GGCGAGCACG TGCTCTTGCT GGTCATCCAC CACATCGTCT CGGACGGCTG GTCGACCGGC
GTGCTCACGC GCGAACTCGG CGCGCTCTAC CGCGCCTTCT CGCGCGGCGA GGATTCACCG
CTGGCGCCGC TGAGCCTACA GTACGCGGAT TTCTCCGCGT GGCAGCAGGC CTGGCTGGAG
AGCCCGGCGC TGAGCGCGCA GATGGACTAC TGGCGCCAGC AGCTCGCCGG CGACGACGAG
CCTCTGAGCT TGCCGAGCGA TCGCCCGCGT CCAGCGATCC GCACCGACCG CGGCGGCACC
ACCACCTTCG CGATCGAGGC CGAGGTCCTC GCCTCCCTGC GCGCGCTCGG CGCCCGCGAG
CACGCGTCGC TGTTCATGGT CCTGCTGGCT GCGCTCGACG TGCTGCTCGT GCGCTACAGC
CGGCAGGAGG ACATCCGCGT CGGCACCTAT ATCGCCAACC GCAACCGGCC CGAGCTCGAA
GACCTGATCG GCTTCTTCCT CAACACCCTG GTGCTGCGCA GCGATTGCAG CGGCGACCCC
GGCTTCCGCG AGCTGCTCGG CCGGGTGGCA GCGACCACGC TCGAGGCCTA CGCCAACCAG
GATGTGCCCT TCGAGAAGCT GCTCGAGACC CTGCAGCCGA CGCGCGATAT GCGCCATACG
CCGCTGTTCC AGGTGCTGCT GGTGCTGCAA AATACGCCCG CGCCGCAGAG CGAGGACGGC
GCGCTCGAGC TTTTGCCCTA CGAGCTGGCG GGCGAGGCCC ACGCGCACTT CGACCTCACG
CTGTGGGTCA CGGAGAAGGA CGGCGGGCTG CTGGCCACGC TGGAGTACAA CGCCGATCTC
TTCGACCACG CCACGGCAGA GCGCATGGCC GCGCACTATC ACACCCTGCT GCGCGGCATC
GCGGCCAACC CCGAGCGCCG CATCAGCGAG CTGCCGCTGC TGCCCGAGGC CGAGCGCGTG
CCGGTGCTCG AGACCTTCGC CGTGCGCCGC CGCGAGCGCG ACATCGAGCG CGGCATCCAC
ACCTTGTTCG AGGCCCGCGC CGCGGCGACG CCCGAGGCCT GCGCGGTGGT CGATGCCGAG
CAGCGGCTCA GCTACGGGCA GCTCGACGCC CGCGCCAACC AGCTCGCCCA CTACCTGCGC
GCCCGCGGCG TGGTCGCCGA AACCCGGGTC GGCATCTGTC TCGACCGCTC GGTCGAATTA
CTCGTGGCCG TCCTCGGCGT GCTCAAGGCC GGGGCCACCT ACGTGCCCCT GGCGCCCGAC
TATCCGGCCG AGCGGCTGGC CATCATGGCC CATGACAGCG ACATGCGCGC GCTGCTCACC
AGCGCAGATC TGGCCGAGCT GGCCGGCTCG CTGGGGGTAC CCGCGCTGCT GCTCGACCGC
GAGCGCGCCG AGATCGCGGC CCAGCCCACG GCCTCGCCCG CGCTCAGCGT GGCGCCGAGC
AGCCTGGCCT ACGTGGTCTT CACCTCGGGT TCGACCGGTC GTCCGAAAGG CGTGATGATC
GAACACCGCA GCCTGGTCAA CGCCTACTAC GGCTGGGAGG AGGACTACGG GCTGGACGGG
CTGCGCTGCC ATCTGCAGAT GGCCAGCTTC TCCTTTGACG TCTTCGCCGG TGACTGGGTG
CGTGCGCTGG GTTCGGGCGC GGCCCTGGTG CTGTGCCCGC GCGAGACCCT GCTCGACCCC
GCGGCCCTGC ACGCGCTCAT CGAGCGCGAG CGCGTGGACT GCGGCGAGTT CGTGCCCGCG
GTGGTGCGCC TGCTCATGGA GCACCTGCGC GCGCGCGGCG CCACGCTCGA GACCATGCGC
CTGGTCGCGG TCGGCTCGGA CGTCTGGGAC ATGCGCGAAT ACCATCAGCT CGCGGCGCTG
TGTCCCGCGG GTACGCGCGT GCTCAGCTCC TACGGGCTGA GCGAGGCGAG TATCGACAGC
ACCTTCTTCG AGAGCGCGGA GCCGACGCCG AGCGAGCAGG TGGTGCCCAT CGGTCGCCCG
TTCCCCAACA CCGAGGTCTA CGTGCTCGAC GCCCACGGAC AGCCGGCGCC CATCGGCGTG
CCCGGCGAGC TGTTCCTCGG CGGCCCCGGC CTGGCCCGCG GCTACGCCGG TCAGCCCGAG
CTCACGGCCG CGCGCTTCGT CCCCAACCCG ATATCGCGCG AGCCCGGCGC CCGGCTGTAC
CGCAGCGGCG ATCTGGTGCG CTGGATGGCG AGCGGCGACC TGGCCTTTCT CGGACGCACG
GACACGCAGA TCAAAATCCG CGGCCACCGC ATCGAGCCCG ACGAGATCAA GGCCGTGGTG
CTCGAGGATG AGGCCGTGCG CGAGGCGGTG CTGATCGGCC GCGGCGAGGG CGAGAGCAAG
CAGCTCGTGG CCTACCTGAC CCTGAGCGCG CCCGCGGCCA CGAGCGCGGA GGCGGTGCGT
GCGCGACTCG CGGACAAGCT GCCGCCGTTC ATGGTCCCGA GCGCGCTGAT CATCCTCGAG
ACGCTGCCGC TCACGCCCAA CGGCAAGGTC GACCTGCGCG CCCTGCCCGC GCCCAGCGCC
GAGGACCGCG TGGCCGCCGA CGAGCACACG CCGCCGCGCA CGGCCACCGA AGCCCGCCTG
GTCGAAATCT GGGAGCAGGT GCTCGAGCTC GCGCCGGTGG GCGTGTTCGA CGACTTCTTT
CAGCTCGGCG GCCACTCGCT GCTGGCGGTG CGGTTGATGG CCGAGATCCG CGATCGTCTG
GGCCAGTCGC TGCCGCTGGC GACGCTGTTC CAGGGGGCCA CGATCGAACG CCTGGCGCGC
GCGATCGACG GTGGCGCGGC CGGTCCCTGG TCGCCGCTGG TGCTGCTGCA GAGCGGTGAC
GCGTCCACGC CGCTGTTCTG CGTCCCGGGT GCCGGTGGCA ACGTGCTCTA CTTCCGCGAC
CTGGCGCGCT CGCTCGGCGG CGAGCGGCCC ATCTACGGTC TGCAGGCGCG CGGACTCGAC
GGCGTGAGCA CGCCGCACGA CTCGGTGGAA GACATGGCCG CGTGCTATGT CGAAGCGCTC
CGCACGGTGC AACCGCACGG GCCCTACGCG CTGGCCGGCC ACTCCTTCGG GAGCTGGGTG
GCCTTTGAGA TGGCCCAGCA GCTCGTCCGC GCGGGTGAAG AGATCGCGAT CGTGGCCATC
TTCAATACGC CCATCCCGCG CATGACGCCG GGTTCGACGC CGGACTTCGA CGACGCCACC
TGGATGGCCG CGTTGGCCGG CTCGGTCGGC CGCTTCTACG GCGCCGACTT GGGCATAGAC
GCCGAGTCGC TGCGGCCGCT GTCGATGGAC GCGCGCTATC GCCAGCTCAC CGAGCGCCTG
GTCGCGGCCC GCATCCTGCC CGCGGGAGCC GCCGAGGCCA TGGTGCGCGG TCTGGTCCAG
GTGTACAAAG CCGCGTATCT CATCGACTAC GAGCCGGGCG ACGCCACGGC CGTGCCCATC
GCGTTCTTCC GCGCCGACAC CTGGCACGAG GAGGACGGCG AGGTACCGGC CGATCTGTTC
GAGCAGCCGG CTTGGGGCTG GACGCGCTTC GCGAGCGACG ATATCACCAT CGTCCAGGTG
CCCGGCGACC ACATGACCAT GCTGGCGCCG CCGCATGTCG ACCAACTGTC CGAGATTCTT
CGCGCGCTGC TCGGCGAGAG CCTGAGGAAG CGGAGGTGA
 
Protein sequence
MPAMSTDTFV FPSSFAQQRL WFLEQMNPGT GAYHIAGAVR IAGPLERDTL QAALDDLVAR 
HESLRTTFAL EQGEPVQVIA EDGRVSLTRT DLRADGVAVT PELVNQQALD EVRKPFDLAR
GPLLRVHLLQ SRDSEVLLLL TIHHIVADGW SMDLLIRELS VLYNARLSGE PAALAPLDIQ
YADYTEWQRE WLATPGVLDE QIDYWRRQLA HAPVLQLPAD HARPPVPTHR GATLPVAVSP
ALAASLRSLA GDEGATMFMV LLSGFQALLA RYTGQRDILV GAPMHERSRV ELENMIGCCL
NTLVLRTEVD AAESFRALLS RVRAITLDAY AHRDLPFERL VDELRPERNP AYTPYFQAVF
NFRPAARTEA HLAGLAITPV EVHTASAKFD ITLDLQDTGA ELVGVIEYRT DLFATATIEG
LRNCWLTLLE AVVATPDCPV GALPLLTEPQ RAELLARGCA QERFPAEGTL HQRVLAAAAA
APDAVALVCG DDSLTYAQLL RRSAQLAHRL QALGVGPESR VGLCLQRSID MVVAILATLQ
AGGAYVPLDP AYPPERIALL IDDSGMSALV TRHPDSDALP DHLATGDLPC VLLDQHADQL
AALPDAPPPC AADADSLAYI IYTSGSTGRP KGVLVTHRNV LRLFDSTSED FAFSADDVWT
LFHSFSFDFS VWELWGALTF GARLVIVPWL VSRSPDAFAQ LLARERVTVL NQTPSAFRQL
IHTDSLTPLP ALRYVIFGGE ALDPTALLPW VERFGFDQTT FVNMYGITET TVHVTYRPLS
RADLDHPSSR IGRPLRDLDI YLLDERMQPV PLGVPGEIYV GGPGLARGYL GRPELTALRF
PDHPFRDGER VYRSGDLGRW THDGDLEYLG RNDAQVKIRG FRIELGEIQA AIEAHPAIRS
AAVIARSIGS DQRRALLAYL VPASDDIPSV DALRAFLAQR LPDYMLPASF HFLDALPLTV
HGKLDTRALP EVEFAAAATD DSYEAPRNMV EEVLCAVWAH ALSVPRVGIR DNFFALGGDS
ILSIRVVTMS EKRGIRYTAA QLFSNQTVAE IAAVAQDALD DDDDAREELY TEPFSQVSED
DLARLRAAHP DVVDAYPLAR VQAGMLYHME LAPNSNIYHN TDSFHLRSRI PFDADCFRAA
VQVVVGRHAV LRTSFDMASY SEPLQLVHED VTLPFELSDL RHLDEAAQEA VIEELIRSEM
AAHFDLRTPP LLRFFVHLRS DDTFQFTLTE CHAIIDGWSL HSVLVEIFNH YFALMHGGEL
PAYEAPRLSY RDFVALERRV LAAPAHRAFW SEQLDGAAAV RLPRHMSPLP GREPVIAMIS
RHIPEQLEAG LRARAQALAV PLKSVLLAAH LKLLSSVSGR DDIITGVALH GRPEVEDGAR
LRGLFLNTLP FRLRLRPGSW ASLIQQTFAA ERALHPFRQY PMAEIQRAHG RETLYEVMFN
YLHFHVLRDL AGSVADLEPL STRRSEGTNV ALAVDFQTDP YDYSLALDLD YDSNLFQHEQ
IAAIADAYLH ILHCLAFEAE APHDAFSALP GSERALLLHE FNETARPLPQ GASLAALVAE
HAARTPDAIA ARDRAHSLSY RALCDRAMRL GHLLRATGVG PGATVGVLLD RSVDILVAIL
GAHAAGAAYL PLDPVYPSER LRYMLDDARV AAVLTEEALR ASLPPLTAPV LCIDAPAHHA
ALEQQPATPL PMPSGDALAY VLYTSGSTGH PKGAMVTQRS LANYLRWAAG FYPLSEGSGA
PVSTSIAFDG TVPSLLGPLA VGKCVHFLPE EQDVVHLAEA LSTADEPYSL IKLTPAHLEA
LRHRLPDDAP VRAHAFVLGG EVLPPSLAAW WQQRAPQVRI YNQYGPTETT VACTAHRIAD
SVSETLPVPI GRPIANVRVY VLDAFLQPLP QGAVGELYVA GAGVSQGYWA RPKLTAERFL
PDPFASEHGA RMYRTGDLVR VGCDGTLDYV GRSDAQIKLR GYRIELGEIE AVLAQQPGVR
EAAATVHQSA QGHGQLVGYL VFDSDSADDD DSADGDARAQ RLDEMRRALR TRLPAYMVPA
SLVVLDALPR TPNRKLERRA LPAPTAESAP SAAVHEAPRT AVEKLLASIW TQTLGVEEPG
IHDDFFALGG DSILSIQVTA RAGRAGVRIT PRQVFEHPTI HELAAVADQS SDLLGDVAAE
QGPVTGAAPL TPVQARFFAL DLARPQHWNQ SMLLRARQPV DADALAAAVR AVLGHHDALR
LRFERGDDSN AAWRQEHAAP VTTAPLSRWD LGDLPAEQRD EALRTRASEL QASLDLATGP
LVHVALFDLG PDAEQRILIA VHHLVVDGVS WRILLEDLQS AYEHAAAGRA PQLPAKTSSF
QQWARALRDH AHSLAVRQQL DYWSDPTRAQ LAALPVDHGH GSNREADHAA VTIELDADTT
SALLQQATSS YRARVDEILL APLVATLSAW TKSPTIQLDL EGHGREEVGT PLDLSRTVGW
FTAVYPVRVD LGAVAAPGSG ALAALRAVKE QRRAVPAHGL GYGLLRYLGD DQTRATLAAL
PDSQVAFNYL GQLDASFAQD AMFVPADESA GPDRDPAAPR RYLLEISGYV LQGRLQLTWS
YSAAAHQRAT IERLAADFTA ALQTLIAERT SARAYVPSDF PLAALSQPAL DQITERWPAR
ADGIDTAENP TIEDIYPLSA LQEGMLFHSL QAPGRGEYFE QLAWELGAEI DVDAFERAWQ
STVQCHPILR TAFVWRDVPQ PMQVVLSDAP VHITRLDWSD IDPSSQEQRL RELLAAERAR
PFALDAAPLM RLSYIRLDAA RGYFVWSHHH LLLDGWSLPL VITQTMSAYQ SIAQGRTPQM
PRPRPYRDYI RFLQERDDAG AEAFWRESLA GLSEPARLGL GGSEGESAAA SDGAAPHRER
EHHLSAALSE RLQDLARRHQ LTLNTLAQGA WALLLGRYSN SDDVVFGATV SGRSPTLPGV
EDMVGLFINT VPVRVRLPDD ALLLDWLHQL QRQQVALREY EHSSLVRVQG WSQVPRDTQL
FDTLLVFENY AVAGGFDEVG DQLQARLVHA VERTNYPMTL VIGMGERLTL RLAYDAQRFS
ERDAEQVLGH LAYILSGMTE HIGHTLHELP LLSDNERAEL LARGCAQERF PAEGTLHQRV
FDRAAAAPDA VALVCGDDSL TYAQLVRRSA QLAHRLQALG VGPESRVGLC LQRSIDMVVA
ILATLQAGGA YVPLDPAYPP ERIALLIDDS GMSALVTRHP DSDALPDHLA TGDLPCVLLD
QHADQLAALP DAPPPCAADA DSLAYIIYTS GSTGRPKGVL VTHRNVLRLF DSTSDDFAFS
ADDVWTLFHS FSFDFSVWEL WGALTFGARL VIVPWLVSRS PDAFAQLLAR ERVTVLNQTP
SAFRQLIHTD SLTPLPALRY VIFGGEALDP TALLPWVERF GFDQTTFVNM YGITETTVHV
TYRPLSRADL DHPSSRIGRP LRDLDIYLLD ERMQPVPLGV PGEIYVGGPG LARGYLGRPE
LTALRFPDHP FRDDERVYRS GDLGRWTHDG DLEYLGRNDA QVKIRGFRIE LGEIQAAIEA
HPAIRSAAVI ARSIGSDQRR ALLAYLVPTS DDIPSVDALR AFLAQRLPDY MLPASFHFLD
ALPLTVHGKL DTRALPEVDA IAAHARSYEP PRPGAESDMA ALWCEVLDVE QVGRSDDFFA
LGGHSLVATR LLSRMRATFG DHVALADIFA HPRLAALAEA VGAAHSNASP LLTPPPLVAQ
QRPERPPLSF AQQRLWYLDH LEGTGAYNIS SLLRLSGAVD RDKLAQALNA LIQRHEILRT
VFPAEQGVPY QRVLEHAPIE IACHAAADDE LVRQQAAALA AEPFDLSHGP LLRVAMWPVR
GGEHEHLLLL VVHHIISDGW SMSRLTDEFT AIYRAVLAGR DPEHAGLPAL PVQYADFAAW
QRAWLDDDTV AAQLAFWRDA LADAPPVLEL PTDAPRPTAQ SFRGGSYEFQ VPAVLTERLR
TLAREQGATL YMVLLAGFQL LLSRYSNQRD IVVGTPVAGR VVAETEPLLG CFVNTLAIRA
RVDQASSFRA LLDQVRAHTL DAFEHQALPF EKLVDALQPV RELGVTPLFQ VMFVLQNAPE
AVAELPELRV EPVPFERSSA QFDLTLSMEE RDGALSAQAI YARDLFAARS IAQMMGHLCH
LLEIAVAEPD RALDELPLLS DNERAELLAR GCAQEHFPAE GTLHQRVLDR AAAAPDAVAL
VCGDDSLTYA QLVRRSAQLA HRLQALGVGP ESRVGLCLQR SIDMVVAILA TLQAGGAYVP
LDPAYPPERI ALLIDDSGMS ALVTRHPDSD ALPDHLVTGD LPCVLLDQHA GQLAALPDAP
PPCAADADSL AYIIYTSGST GRPKGVLVTH RNVLRLFDST SDDFAFSADD VWTLFHSFSF
DFSVWELWGA LTFGARLVIV PWLVSRSPDA FAQLLARERV TVLNQTPSAF RQLIHTDSLT
PLPALRYVIF GGEALDPTAL LPWVERFGFD QTTFVNMYGI TETTVHVTYR PLSRADLDHP
SSRIGRPLRD LDIYLLDERM QPVPLGVPGE IYVGGPGLAR GYLGRPELTA LRFPDHPFRD
GERVYRSGDL GRWTHDGDLE YLGRNDAQVK IRGFRIELGE IQAAIEAHPA IRSAAVIARS
IGSDQRRALL AYLVPTSDDI PSVDALRAFL AQRLPDYMLP ASFHFLDALP LTVHGKLDTR
ALPEVEFAAA ATGDSYEAPR SDAEAALADI WSTVLGIERP GIHDDFFALG GDSILSIQII
ARASQIGLHL TPRDIFEHST IAAQATAASR TRRSRAEQGP VTGPAPLTPI EHWFFEQPRE
RRGHWNQSML LRARQRIDGD ALAAAVRAIA RHHDALRLRF YEQADGTWEQ VHAAPSHEAP
VTLIDLVALG ALAAEPAEAE ADADDAAAVA EAVRVHADEV QRSLDLSTGP LLRVALFELG
PQREQRLLIV IHHLVVDGVS WRILLEDLQQ AYARCAAGRE PELPAKTSSF QQWSQALLAH
AQQPALRQQL EYWSAPERAR VPALPADHPQ GENRETAQAS LAFALDAELT RALLHETAAA
YRARVDELLL AALSATLSEW TKSTLVQVDV EGHGREDIDD EVDVTRTVGW FTTIYPVLLS
LDPAGDALAD NAASDDDALL AILRAVKEQR RAVPGHGLGY GLLRYLGDDD ARAALRALPG
SQVAFNYLGQ FDQLQQAASV FAVANEPTGA NRDLSAQRRY ELELNGYVGD ARLQLTWFYG
SERYERATIE RLAARFVEHL AALVARSGEA NAYVPSDFPL AMLSASALER LAGRWPALED
VYPLTPLQAG MLFHSLYAPE RGEYLGQFAW YLHGPLDADA FQRAWQAVVD LHPVLRSAFL
WQDLDEPQQV VVPAQVRIEQ HDWRDRTSEI NPAIDAFLQR DRARGLDLEQ PPLMRLSLVR
LDDEKSCFVW THHHLLIDGW CLSILMGQVV TAYEALRRGH PAQLERPRPY RDYIAWLRGR
DPAEAEAFWR AALAGFSAPN VLAVDRGARS GQASQHRIVH FELPEPTREA LIAMARRHQL
TLNTLVQGAW ALLVARYSAA DDVVFGATVS GRTPALAGVE SMIGLFINTL PVRVQMSEDM
PAAAWLRALQ QQQSETRAFE HTALVDIQRW SELGHGEPLF ETLLVFENYA VDDSAGAVET
SLDIEHLHAH ERTNYPLALT VGRRLGIELA YDQSRFDDDV AARLLRHFAS LLGQLAEAPQ
RPLSALSLAD RAEQQALIAS WKTSARDYPA ATSMHALVAE QAARAPQAVA AVWGEQRISY
AELMARSSQL AHYLRARGVT ADVPVGVHVE RSLDLVVAVL AVLQSGGAFA PLDTALPRER
LRTMIAGLRA PVLLTQAALL ADFGAIVEEA GDDASAGSAA PALLVAIDEP ATRAAIAALP
ETPPPSESEP DHLSYIIHTS GSTGTPKGVM VTHRNWVNAF HAWAEDYRLG TDARCHLQMA
SFSFDVFAGD YARALASGGT LVLCPRELLL DPPALLALLQ RERVDCAEFV PAVLRGLAQH
CEDSAQTLAG MHTLIAGSDS WHMSEYRRFR ALIGADARLI NSYGITETTI DTTYFEVTAD
ADVSPAGEGE GEGDERGLVP IGRPFGNSRV YVLSRDLTPQ PIGVPGEVYI GGAGVARGYL
GRPELTAERF VPDAFGDEPG ARMYRTGDRA FYRPDGNIAF LGRVDTQVKV RGYRIELGEI
ESVLVRHPAV QQSAVLLRSD GPGQPRLVAY VAAASGAALE LVELRAFLGE RLPDYMVPAF
FVVLDALPLT ANGKVDRRNL PAADASHRVG VEERVAPRDD IEAAILRLWQ QVLAVDELGV
GDDFFAAGGH SLLATQLISR VNAAFAIALP LRVVFDAPTV AAMATEVRAC AGDAPSDIAP
RERIAARAQQ GPAPLSFAQR RLWFLDQFEP GNPAYNISEF VHLRGALDVA VLRRSLSEVV
RRHEVLRTRF AAAGPDGQGP VQIIDPPAAE PLALPIIDLG HLSGDERAQR CQELALAAVR
APFDLSTGPL LRVQLVRLDE GEHVLLLVIH HIVSDGWSTG VLTRELGALY RAFSRGEDSP
LAPLSLQYAD FSAWQQAWLE SPALSAQMDY WRQQLAGDDE PLSLPSDRPR PAIRTDRGGT
TTFAIEAEVL ASLRALGARE HASLFMVLLA ALDVLLVRYS RQEDIRVGTY IANRNRPELE
DLIGFFLNTL VLRSDCSGDP GFRELLGRVA ATTLEAYANQ DVPFEKLLET LQPTRDMRHT
PLFQVLLVLQ NTPAPQSEDG ALELLPYELA GEAHAHFDLT LWVTEKDGGL LATLEYNADL
FDHATAERMA AHYHTLLRGI AANPERRISE LPLLPEAERV PVLETFAVRR RERDIERGIH
TLFEARAAAT PEACAVVDAE QRLSYGQLDA RANQLAHYLR ARGVVAETRV GICLDRSVEL
LVAVLGVLKA GATYVPLAPD YPAERLAIMA HDSDMRALLT SADLAELAGS LGVPALLLDR
ERAEIAAQPT ASPALSVAPS SLAYVVFTSG STGRPKGVMI EHRSLVNAYY GWEEDYGLDG
LRCHLQMASF SFDVFAGDWV RALGSGAALV LCPRETLLDP AALHALIERE RVDCGEFVPA
VVRLLMEHLR ARGATLETMR LVAVGSDVWD MREYHQLAAL CPAGTRVLSS YGLSEASIDS
TFFESAEPTP SEQVVPIGRP FPNTEVYVLD AHGQPAPIGV PGELFLGGPG LARGYAGQPE
LTAARFVPNP ISREPGARLY RSGDLVRWMA SGDLAFLGRT DTQIKIRGHR IEPDEIKAVV
LEDEAVREAV LIGRGEGESK QLVAYLTLSA PAATSAEAVR ARLADKLPPF MVPSALIILE
TLPLTPNGKV DLRALPAPSA EDRVAADEHT PPRTATEARL VEIWEQVLEL APVGVFDDFF
QLGGHSLLAV RLMAEIRDRL GQSLPLATLF QGATIERLAR AIDGGAAGPW SPLVLLQSGD
ASTPLFCVPG AGGNVLYFRD LARSLGGERP IYGLQARGLD GVSTPHDSVE DMAACYVEAL
RTVQPHGPYA LAGHSFGSWV AFEMAQQLVR AGEEIAIVAI FNTPIPRMTP GSTPDFDDAT
WMAALAGSVG RFYGADLGID AESLRPLSMD ARYRQLTERL VAARILPAGA AEAMVRGLVQ
VYKAAYLIDY EPGDATAVPI AFFRADTWHE EDGEVPADLF EQPAWGWTRF ASDDITIVQV
PGDHMTMLAP PHVDQLSEIL RALLGESLRK RR