Gene Franean1_5940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5940 
Symbol 
ID5674261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7217110 
End bp7238109 
Gene Length21000 bp 
Protein Length6999 aa 
Translation table11 
GC content76% 
IMG OID641244788 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001510190 
Protein GI158317682 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGTTCC TGCAGCAGCT CGACCCGTCC AGCGCCGCGT ACAACGTCGC CGCCTCCTTC 
GACCTCACCG GTCCGGTCGA CGCCGACGCG CTGCACGCGG CCCTGCGCGC CGTCGCGCGC
GGTCACGAGA TCCTGCGCAC GACCTACCGG GTGCCCGCCG GTGGCCGGCC CGAGCAGGTC
GTCCACGCCG AGCTGGACCC GCAGTGGTCG AGCCACGACG TCTCCGGGGA GCCCTCGGCC
AGCCGGGAGC GGCTCGCCGA CGAGATTGCG CGGCACGCGG CCCGCCGTCC CTTCGACCTC
ACCAGCTCCT CGCCGCTGCG GGTGACGCTG ATCAGGCTCG CCGCCGACCG GTTCGTCCTC
GTGGTGGTCG CCCATCACGT GGCCTGGGAC GACGCGTCCT GGGAGATCTT CTTCCGGGAG
CTGTTCGCCC ACTACGACAA CCTCCGGGCC GGCGACGGCC CGCTGCCCGC CGACGAGCGG
ACGGCGCAGT ACATCGACGT GGCCGGGCAA CCCAGGGGCG CTGACGCGCA GGACCCCGAC
CCGGCGGCGC TGGACTACTG GCGCGGCCAG CTCACCGCGC TGCCCGAGCC GCTGCGGCTC
CCCGGCGGCG AGCCCGTGGC GGCCGCCACC GCCGCCGGGG CCGACCCGGG CAAACCGGCC
GACCCGGGCA AACTGGCCGA CCCGGGCAAA CTGGCCGACC CGGGCGGCCA GGTCGTCCAC
CAGGCGGCGG CCGGGCTCGG GCGGCGGGTG CGCGCGTTCG CGGCCGAGCA CGGCGCCACG
CCCTTCATGG TGCTGCTGGC GGCCTTCGCC GCGGTGCTGC ACCGGCACAC CGGGGCCACC
GACATCGTGG TCGGCTCTCC CGTGGTCAAC CGGGACCAGC CCGGCGCCGA CCGCCTCATC
GGCTATTTCG GCAACACGGT CGCGCTGCGG GTGCGGGTCC GCCCCGGCGA CTCCTTCCGG
GACCTGCTCG CGCGGACCAG CGAGACCTGC CGCGGCGCCT TCGCCCACCA GCACGTCGAC
CTGGAGGACG TCGTCCGCGC GGTCAACCCG GACCGCTCCG ACGGCGTCTC GAGCGTGTTC
ACCACCCTGT TCGCCGTGCG GACGGCCGCT GCCGAGCACC TCGGCGCGCG CGAGCTGAGC
GGCGCGCGCC GGCCGCTGCA CACCGGTGAC GCCCAGTTCC CGCTGGGAGT CACCGTGGAG
ATCGGGGCCG ACCAGCTCGA TCTGGAGGCC GGGTTCCTCA CCGCCGCGGT CACCGCCGGC
TCGGCCGGCG CCGTGCTCCG CCACCTCGAG ACGTTCCTCG ACCATGCGAC GCGCCATCCG
CGGCGGCCGC TCGCCGCCGT CGACCTGCTC ACCGATTCCG AGCGCCGGCG GACGCTGGTC
GACTGGAACA ACACCGCGGC GCCGAGCCCC GACGCGACCT GGGCCGAGCT GCTCGCGCGG
CGCGCCGCGC GCGCTCCCGG GCACGCCGCC GTCGTCACGT CCGGCGGCAC CCTGACCTAC
GGCGAGCTGG TCGGACGGGC CGACGCGCTC GCCTACCAGC TGCGCGGCCT GGGCACCGGC
CCGGGCGCCA TCGTCGCGCT GGCGCTCCCG CGCACGCTCG ACCTCGTCGT GGCCCTGGCG
GGGGTGACCC GCGCCGGCGC CGCCTACCTG CCGGTGGACC CGGGGTACCC GGCCGACCGG
ATCACGCTCG TCCTCGAGGA CGCCGCGCCC TCGCTGCTCA TCACCACCCG CGAGCTCGCT
GCGACGCTGC CCATCCCCGC CGGCGTCACC GTCGTCCTCG TCAACGGGCC GGCCGCCGGG
CCCGAGGCCG GGACCGAGTT CGGGACCGAG TTCGGGTCGG TGCCCGAACC GCCGGCCGCG
GCCGCGGTCC CGGCGGAGCT GGGAAGCGCC GCCTACGTCA TCTACACGTC CGGCTCGACC
GGGCGTCCCA AGGGCGTGGT GGTGCCGCAC CGCGCGCTGA CGAACTTCCT GGCCAGCATG
GTCGACCAGT TCCAGCTGGG TGACGGCGAC CGGGTGCTGG CACTGACGAC GCTGTCGTTC
GACATCGCGG CGCTGGAACT GCTCGCCCCG CTCACCGCCG GCGCCACCGT CCACCTGGCC
GGCCAGGACG AGGCCCGTGA CCCGACCGCG CTGGCGGCGC TGCTGGCGGG CGGCGGGATC
ACCGTCGCGC AGGCCACCCC CTCGATGTGG CAGTCGATCC TGGCGGCGAG CGACGACCGC
TTCCCCGGGG TGCGCGTGCT CAGCGGCGGA GAGGGCCTGC CGCCGGCGGT GGCGGCCACG
CTCGCCGAAC GGGCCGATGA GGTCGTGAAC CTGTACGGGC CGACCGAGAC GACGATCTGG
TCGACGGCCG GCCCGGTCGC CGGCGACGGA CGGCCGACCG TGGGCCGTCC GATCCGCTCG
ACCCAGGTCT ACCTGCTGGA CACCGCGCTG GCCCCGGTCG CCCCCGGCAT GCCCGGCGAG
CTGTACATCG GCGGGGCGGG CGTCGCGGAC GGCTACCTGC GCCGGCCCGG CCTGACCGCG
TCCCGCTTCG TCGCCGACCC CTTCAGCGCC GGGGGGCGGC TCTACCGCAC CGGCGACCTG
GCGCGCTGGA CCGCCGACGG CGAGCTGGAC GTCCTCGGCC GGGTCGACGA CCAGGTCAAG
GTGCGCGGGT TCCGGATCGA GCCGGGCGAG GTCGAGGCGG TGCTCGGGCG GCATCCCGCG
GTGGCGCGCT GCGCGGTGGT GGCCCGGGAC GACGGCCCGG CGGGGCGGCA CCTCGTCGCC
TACGTGGTGC CGGCCGCGCC GGGGACCGTG GTCGACCCGG CGCTGCTGCG TGAGCACCTG
GCGGCGGCCC TGCCGGAGCA CATGGTGCCG GGAGCGTTCG TCCAGCTGGC GGCCCTGCCG
ACCACGCCGA ACGGGAAGCT TGACCGGCGC GGGCTACCCC GGCCGGACTT CGCCGCGGCG
GCCGGGTCGC GGGCGCCGGC CACCGCCGCC GAGTCGCTGC TCTGCGATCT GTTCGGGACG
GTTCTGGGTG TCGAGCGGGT CGGTGCCGAC GACTCCTTCT TCCACCTCGG CGGCGACAGC
ATCCTGGCGT TCCAGGTGGT CGCCGGTGCC CGCGCCGCCG GCCTGGCCTG CGTGCTGCCG
GACATCTTCC GCCACCCCAC CCCGGCGTCC CTCGCCGCCG CGGCGGAACA GGCCACTGTG
CCACCGCCCG CCGACACAGC CCTCGACGCG GATCTCGCCG GGGTGAGCGC CGTCGACCTG
GAGCGCTGGC GGGCCCGGTA CCCGGATCTG ACCGGTGTCC GGCCCGTGTC GCCGCTGCAG
GCCGGCCTGG TCTTCCACTC GCTGCTCGGC GACGGGTCCG GGACGGTTCC CGGCGCGGCC
GGTTCCGACG CCTACATCCT GCAGTTCGTC GTCGACCTGC GTGGACGGCT CGACCCGGCC
CGCCTGCGCG CGGCGGGCCG GGCCCTGTTC GAGCGGCACG AGAACCTGCG CACCGCCTTC
CTGTACGACG GCGACGCGCC CGTCCAGATC GTGCTGGCCG ATCTCGCGCC GGCCGCCTTC
GACGAGGTGT GGACGGAGGT CGACCTCGCG GGCTGGCCGC GGGCCGAGGC GCTGGCGGAG
GCCGAGCGGC TGGGACGGGC CGACCGGCAG ACCCCCTTCG ACCTCACGGC GCCGCCGCTG
GTGCGCTTCC TGCTGCTGCG CACCGGCCCG GACGCCTGGC GGCTCGTGCT GAGCCCGCAC
CACGTGGTGC TCGACGGCTG GTCCGTCCCG CTGCTGCTGC GTGAGCTGTT CCGCCGGTAC
GAGCTCGCAC AGTCACAGTC ACAGGCACAG GCGGACGGGC CGGCCGCCGC GCTGCCGCCG
GCGCCGTCCT ACTGGGAATA CCTGCGCTGG CTGGCCGGCC GGGACCGGGA CGCCTCCCGG
CGGGCCTGGG CCGCCGCGCT GGCCGGCGTC ACCGAGCCGA CGCTGCTGGC CGCGGCCACC
GCCCGCCCGG GCACCCCGGC CCAGACCGCG GCCCACGCCC CGGACCAGGT CCCCGAGCAG
GCTCCAGATC AGGCCGTGGA TCAGTGGGAC ACGGTGGACG GCGGGGTGAC CGCCGCGGTG
GAACGTCACG CCGACGCCGC CCTGACCGCC GGCCTGGGAA CCTTCGTCCG CGAGCGCGGG
CTCACCCTCA ACACCGTGCT GCGCGCCGCG TGGGCGGTCG TGCTCGGCCG CGCGGTCGGG
CGCGACGACG TCGTGTTCGG GACGTCGGTG GCCGGCCGGC CGGCGGAGCT GCCCGGCGCC
GCGGAGATGA TCGGACTGCT GCTCGCGACG GTGCCCGTCC GCGTCACCCT CGACCCGGCC
GAGAGCTTCG TGGGCCTGCT CGACCGGGTG CGCGCCGACC AGACGGCGAC CCTCGAGCAC
CACCACCTGG GCCTGGCCGA CATCCACGCG GCCGTGGGGC TGCCCGAGCT GTTCGACACC
CTGTTCGTGC TGGAGTCCTA CCCGTTCGAC CCGGCCGGCC TGCTCGGGCC GGACAGCGGC
CTGCGCCTGG AGGGCCTGGA CGGCCACGAC GCCACGCACT ACCCGCTGAC GATCCGGGTG
ATCCCGGGGG AGCGGCTGCG TATCACCTTC GGCTACCGGC CGGAGCTGCT CTCCGGGGCT
ACCGTCACCG CGCTCGCCGA CCAGCTCCTG GACCTGCTCG CCGCGGCCAT CGCAGATCCG
GACAAGCCCG TCGGCGCCAT CGGCGCGCCC GCCGGGTCCG TGGCGCCCGC CGGCACTGTA
GGCGCCGCCC CGGCCGCGAC CCGTCCGGAG GAGGACGAGC CGACCGGTGC GTGGGCGACC
GACACGACAC TTCCCGAGCT GTTCGAGGCC CAGGCGGCGC GGGTTCCCGA CGCCGTCGCG
GTCACCTGGG GCGAGCGGCG GCTGACCTAC GCCGAGCTGG ACGCCGCCGC GAACCGCCTC
GCCCGCCTGC TGGCCACCCG CGGGGTGGAG CCGGAGTCGC TCGTCGCCGT GGCGCTGCCG
CGCTCGATCG ACCTGGTCGT CGCCCTGCTC GCGGTGCAGA AGGCGGGCGC GGCCTACCTG
CCGCTCGACA CCGCCTACCC CGCCGACCGG CTCGCCTTCA TGCTTTCCGA CGCGGCACCG
GTGTGCCTGG TGACCTCCGC CGAGGCTCTG CCGGCCCTGC CCGCGCGCAC CCGGGTGCCG
ATGATCGCGC TGGACGCGCC GCCGGTCGTG GCCGCGCTCG CGGAGCAGTC CCCGGCCCGG
CTGCCCGCCG CCGGCCGGGC CCGGCCGGAG AACGCGGCCT ACGTGATCTA CACCTCGGGC
TCGACGGGAC GTCCCAAGGG CGTTGTCGTC CCGCACCAGA CGGTCACCAG GCTGTTCGCG
CACACCCAGC CGTGGTTCGG CTTCGACGAG ACCGACGTGT GGACGATGTT CCACTCCGCG
TCGTTCGACT TCTCCGTCTG GGAGCTGTGG GGCCCGCTGC TGTACGGCGG CCGCCTGGTC
GTCGTCGACC ACCACGTCTC GCGGTCGCCG GAGCTGTTCC TGGACCTGCT CCGGCGGGAG
TCGGTGACCG TGCTCAGCCA GACCCCGTCG GCGTTCAGCC AGCTCATCGA GGCCGACCGG
GCGGGCGGGG AGGACCCGGC GGAGCTGGCG CTGCGGTACG TCGTGTTCGG GGGCGAGGCG
CTCGACCTCG GCCGGCTGCC CGCCTGGTAC GCCCGCCACC GCGACGACGC CCCGGTGCTC
GTCAACATGT ACGGCATCAC CGAGACGACG GTGCACGTCA CCTACCTGCG GCTGGACGAG
GCCGTCGTCG CGGCGGCGCG GGCCAGCCTG GTCGGCGGGC CCATCCCGGG GCTGCGGGTG
CACGTCCTCG ACCAGCACCT GCGGCCGGTG GCGCCGGGCG GCCTCGGCGA GCTGTACGTC
AGCGGCGGGC AGCTCGCCCG CGGCTACCTG GGCCGGCCCG GTCTCACCGC CACCCGCTTC
GTCGCGGACC CGTTCGGAGG GCCGGGTGCC CGGATGTACC GCACCGGCGA CCTCGCCCGC
CGAACCGCCG ACGGCGGCTT CGAGTACCTC GGCCGGGCCG ACGACCAGGT CAAGGTGCGA
GGCTTCCGCA TCGAGCTCGG CGAGATCCAG GCGGCGATCG CGACCCACCC GGCGGTGGAG
CAGGCCGTGG TGCTGGCCCG GGAGGACCAG CCGGGGCAGC GGCGGCTGGT CGGTTACGTC
GTGGCCGCGC CGGGGCGCCG GGTCGACTCC GAGGAGCTGC GCCGGCACGC CGCCGCCATG
CTCCCCGAGT TCATGGTGCC GGTTGCCGTG CTGGCGCTGG ACGCCTTCCC GCTCACCGGC
AACGGCAAGC TCGACCGCGC CGCCCTGCCG GCGCCGGATC TGGCGAGCGT CTCCACCGGC
ACCCGCGCCC GCACCGAGCG GGAGCGAACG CTGTGCGCGG TGTTCGCGGG CGTTCTCGGC
CTGGACGAGG TGGGGGTCGA CGACGACTTC TTCACCCTCG GCGGCGACAG CATCAGCGCG
ATCCAGCTCG TCAACCGTGC CCGGCGCGAA CAGGTCGGCT TCACGCCCCG GGACGTCTTC
ACCCACCGCA CCCCGGCCAA GCTCGCGGCC CTCACCGAGC CCGCCGCCTC CGCCGTTTCC
ACCGCGCCCG ACGGGCCGGC CGCATTCGCC GGACCTGGCG CCGCGCCTGA CCGCCCTGGC
GGAGCGGGAA CGGGCCTGGG TGCCGATCCC GAGGCGATCG GCGTGTTCAC CCCGCTCCCG
ATCGTCTCCC GGCTGGCGAG CTGGGGCGGG CCGGTGCGCC GTTTCAACCA GGCCATGCTG
GTCTCGACGC CGGCCGGCGC CACCGTGGAC CTGCTGCGGG GCGCCGTCCA GGCCCTGCTC
GACCAGCACG ACGCGCTGCG GGCCCGGCTG CTGCGCCCGT CGCCCGCGCT GTGGCTGCTG
GAGACCCTGC CGGTCGGTTC GGCGCGTGCC GGCGACCTGC TGCGTCGGGT GGACATCGCC
GGCCTCGACG AGGCCGGCCT GCGCGCCGCC GTCGCGGCCG AGTCGGCCGC CGCGGCCGAC
GGGCTCGACC CCGAGGCCGG CGCGATGCTG CGGGTGAGCT GGCTCGACGC CGGCTCCGAC
CAGCCCGGGC GGCTGCTGCT GGTCGCACAC CACCTCGTCG TCGACGGGGT CTCCTGGCGG
GTCCTGCTCG CCGACCTGCG GACCGCCTTC GCGGCCGTGC TCGCCGGGCG AGCTCCCCAC
CTGGACCCGG TCGGGACCTC GCTGCGCGCG TTCGCCCGGA TCACCGCCGA ACGTGCCCAG
GACCCGGCCC GCCTCGCCGA GCTCGCGCAC TGGACGGCGA CCCTCGCCCC GGGCGGTCGG
CTGCTGGGCG GCGCCCCGTC GACCGGCGCC GCGGCTGCGA CCGGCGCCGC GGCGGCTCCC
GGAGCGGCCC CCGCCGCGGC CGACACGATC GCCGACACCG ACCGCCTCGT CGTGGAGCTG
CCCGCGCGAG CCACCGCGCC GCTGCTCACC TCCGTTCCCG CGGCGGCGGG AACCGACGTC
ACCGCGGTGC TGCTCGCCGC GCTGCGTGCC GCCGTCACCG GCTGGCGCCG CGAGCGGGGC
TGGGACGGAT CGTCCGACCT GCTCGTGGAT CTCGAGCGCC ACGGCCGCGA CCCGATCGCG
CCCGAGGTCG ACCTCTCCCG GACGGTCGGC TGGTTCACCG CCATCTCACC CGTCCGGCTG
CCGGCCTCCG GCGATCCGGC CTCCGGCGAT CCGGCCTCCG GCGGGCTGGC GTCCGGCGGG
CTGGCGGGCA CGCTCGAGGC CGTGGCGGGC CGGCTGCGCG AGATCCCCGA CGGCGGGGCC
GGGTTCGGGC TGCTGCGCTA CGCCAACGCG GCCGCGGCGC CGGTGCTCGC CGGGGCGGAG
CACCCGGAGG TCCTGTTCAA CTATCTCGGG CGGTTCGGCG CAGACGCGGC CGACGCCTGG
GCGCCGGCAC CGGAGCTCGA CGCCCTCGCC GCCGAGCCCG ACCCGGCCAT GGGCGTCGCC
TACCCGCTGA CCGTGGACGC CGTCAGCGTC CAGACGCCTG ACGGGCCCGT GCTGCGGACC
ACCCTCACCT ACCTCACCAC GGTAATCGAC CGGACCGCGG CCGGCGCCCT GGCCACCGCC
TGGCTGGCCG CGCTCGACGG CCTGGCCGGC CTGGCCGACG ATGCCGTCCT GGCTGGCGCC
GCTGCCCAGG CTGACACCGC TGGTCCGGCC GCCGTCACCG GTGCGGACGG CGCCGCCGGC
CCGGCAGCGA CCGGTGACCT GGTCGAGCTG TCGCCGGCCG AGCGGGCGCG GGTCGAGCGG
GTGAGCCCGG GGCCCGTCGC CGAGATCTGG CCGCTCTCGC CGTTGCAGGA GGGCCTGTTC
TTCCACTCCG CCTTCGACCG CTCCAGCGAC GCCTACACCG CCCAGTTCGC GCTCGACTTC
GGCCACCGTT TCGACGTAGA CCGGCTGCGC CGCGCCTGCG AGACGTTCAT GCGGCGCAAC
CCGACGCTGC GCGCCGGTTT CCTCGGCGAC GGGCTGCCCG CGCCGCTGCA GTTCGTCGTC
ACCGAGCTGC CCGCGCCGCT GGAGGAGACC GACCTCACCG GCCTGCCCGC TGCCGAGCGG
GCCGAGCAGG CCGAGCTGCT GGCGCGCCGC GACCGGGAGC GCCCCTTCGA CATCTCCAAC
CCGCCGCTGT GGCGGCTCAC CCTGCTGCGC CTCGGCCCCG ACCACGACCG GCTGGTCTTC
AACCGGCAGG TGCTGGTCTG GGACGGCTGG TCTGGCGCCC TGGTCATCGA CCAGCTCCTC
GGGCTCTACG AACGTGGCGG CGGTGACGAG GGGCTGCCCG TCCCGGCCGC CTCCTACCGC
GACTTCCTGG TCTGGCTGCG GGCCCGGGAC ACCGCGGGCG CCGAGGAGGC ATGGCGGGCC
GCGCTCGCCG ACCTCGACGG CCCGACGCTG GTGGTGCCCG AGGCCCGCGG GCTGCCGCCG
ATCGAGCCCG AGCGGATCAC CACCGAGCTG GCCGAGTCGA CGGCGCGCGC GGTGCGGGAG
CTCTCCCGCC GCCACGGCAT CACCGTCAAC ACGGTGTTCA ACGCCGCGCT GGCGCTCGTC
CTCGGCAACG CCGTCGGCTC CGACGACGTC GTGTTCGGCA CCACGGTGGC CGGTCGGCCC
ACCGACGTCG AGGGCATCGA CGAGGTGATC GGCCTGTTCC TGAACACCGT GCCCGTCCGG
GTGCGGCTGG ACCCGCGTGA GTCGGTGCTG GACCTGCTCC GCCGGGTCCA GGACCAGCGG
GTCGACGTGA TGGACCACGA GTACCTGGGG CTCGGCGACA TCCAGCGCGC CAGCGGCCGC
ACGCAGCTGT TCGACACGCT GTACGTGCTG CAGAACTTCA TCGACGAGGT GGCCACCGAG
CAGAGCAGCG ACCGGTTCGG GATCACCGGC GGCACCAGCA TCGACCACAC GCACTACCCG
CTGACGTTCG TGCTGTTCCC CGGCACGCGG ATCACGATGC GCCTGGAGTA CCGGCCGGAC
GTCGTCGGCG CGCAGCGGGC TGCGGCGCTG TTCGACCGGT TCCGCGGGCT GCTCGACGAG
CTGGTCCGGG ACGTCACCGT CCCGGTCGGG TCGGTCGAGG TGCTGCTGCC GGCCGAGCGG
GCGGAGCTCG CGGCGAGGTG GGCCGAGCCG TTCCTCCCGG TCGGCACCGA GACCGTCGCC
GACATGCTCG CCGCGCAGGT CGCCCGCACG CCCGATCTCA CCGCGCTGGT GTTCGGCGAC
GAGCGCGTCA GCTACGCCGA CCTGGACGCG CGGGTCAACT GGATGGCCCG GCTGCTGCTC
GCGCGCGGCG CCGGCCCGGA GACGGTCGTC GCGCTGGGGC TCGGCCGCTC GGTGAACATG
GTGGTGGCGC TGTTCGCGGT GCTGCGCACC GGGGCCGCCT ACCTGCCCCT GGAGCTCGAC
CACCCGCCCG CCCGCCTGCT CGGCATGGTC GCCGACGCCG GGGCGGCGCT GCTGGTGGCG
ACCGATGCCA CCGCCGCCTA CCTGGACGGC GCCGGCGCCG GGCCCGACGA GCCGGTACCG
AGACTGCTGC TCGACGACCC CGCGATGGCC GCGGAGCTCG CAGCCACCGG CGCCGGAGAG
CTGTCCGACG CCGAGCTCGG GCTGTTCGCC CGGGACCGCG CCGACCGCCT CGACCACCCC
GCCTACGTCA TCTACACCTC GGGGTCGACC GGGCGTCCGA AGGGAGTGGT GACCCCCTAC
CGGGGACTGA CCAACATGCA GCTCAACCAC CGCGAGGAGA TTTTCGCCCC CGCGGTGGCG
GCGGCCGGCG GGCGCCGGCT GCGGATCGCG CACACCGTCT CCTTCGCCTT CGACATGTCC
TGGGAGGAGC TGCTCTGGCT CGTCGAGGGC CACGAGGTGC ATGTGTGCGA CGAGAACCTG
CGCCGTGACG CCGAGGCCCT CGTCGCCTAC TGCGACGCCC ACCAAGTGGA CGTGGTGAAC
GTGACGCCCA CCTACGCCCA CCACCTGTTC GAGCTCGGCC TGCTCGACCG CGCCGAGGAC
GGCCGGCACC GGCCGCCGCT GGTCATGCTC GGCGGCGAGG CGGTCTCGGA GGCGGTGTGG
AACCGGCTGC GCGACACCGC GGACACCGCC GGGTACAACC TGTACGGCCC CACCGAGTAC
ACGATCAACA CTCTGGGCGC CGGCACCGCC GACAGCCCGA CGCCGACCGT CGGCCGGCCC
ATCCGCAACA CCCGCGCCTA CGTCCTGGAC GGCTGGCTGC ACCCGGTGCC CGACGGGGTC
CCGGGCGAGC TCTACATCGC CGGCGACGGC CTGGCCCGGG GCTACCTCGA CCGCTTCGCC
CTGACCGCCA CCCGCTTCGT CGCCGACCCG CGGGTCCCCG GCGGGCGGAT GTACCGCACC
GGTGACCTGG TCGTGCGCGG CGCCGCTGAC GGGAACCTCG ACTTCCTCGG CCGCACCGAC
GAACAGGTCA AGATCCGCGG CTACCGGGTG GAGCTCGGGG AGATCACCGC GGCGCTCGAC
CGTCATCCGC GGGTCAGCCA GGCCGCCGTC GTCGCGGCCG ATGACACCGG GGCGCCGGGA
ACCCGGCGGC TGGTCGCCTA CGTCGTGCCC GCGGAGCTGA CCGCCGCCGA CCGGGCCGCC
GTCGAGGCCG ACCAGGTCGG TGAGTGGCGC CAGGTCTACA GCGACGAGTA CGTGCAGATC
CCGACCGCGG TGCACCGCGA GGACTTCGCC GGCTGGGACT CCAGCTACGA CGGGCAGCCC
ATCGCGCTGG AGCACATGCG GCAGTGGCGG GCGGCGACCG TCGAGCGCAT CCGCGAGCTC
GCCCCGCGTC GCATCCTGGA GATCGGCGTC GGCACCGGTC TGCTGCTCGG CCAGCTCGCG
CCGGAGTGCG ACAGCTACTG GGGGACGGAC TTCGCCGCCC CGGTGATCGA CAAGCTCCGG
GCCGAGACCG CCGGTGACCC CCGCTTCGCC GGCCTGGAGC TGCGCTGCCA GCCGGCGCAC
GTCACCGACG GCCTGCCGGC CGGGCTCTTC GACACCATCG TGATCAACTC GGTGGTGCAG
TACTTTCCCA GCCCCGAGTA CCTGCGCGGG GTGCTGCGCG CCGCGCTGGA GCTGCTCGCC
CCCGGCGGCG CCCTCTTCGT CGGGGACGTA CGCAACCTCG CCAGGCTGCG GGCCTTCCAC
ATCGCGATCG AGGTGAGCCG GCCCGGTCCC GGCGGGCTCG CCGAGCCCGC GGGCGGCCTG
CGTGACGCCG ACCTCGCCCG GCTCGCGCCG GCCGTCGACC GCCGGGTCCG GCTCGACAAG
GAGCTGGTGC TGGCGCCGGA GTTCTTCACC GCGCTCGCCA GCGAGACCGG CGCCGAGCTC
TCGCTGCGGG TCAAGCGGGG CTCGTTCCAC AACGAGCTCA CCCGGCACCG GTTCGACGTC
GTGCTGCGCC GGCCCGCCGC GCCGCCGGTG CGTCTCGGTG AGGCCCCCGT GCTGGTCTGG
GGGCATCAGG TCGCCGGTAC CGACGCGATC GAGGCGCATC TGCGTGACCG CCGCCCGCCG
ATGCTGCGGG TCGCGGCGAT CCCCGACGCG CGGGTGGCCG GCGAGCTCGC CCTCGCGGCC
CGGGTCGCCG GGCGGGACGT CCCGGGCGGT GCCGCCGCGC CCGAGTGCGC GGTCGACGCT
GAGCACACGG TCGACCCCGA GGAGCTGTTC GCGCTCGCCG ACCGGCTCGG CTACGAGCAG
CACCTCACCC TGTCCGCGGC CGGTCCCGGC CTGTTGGACG CGGTGTTCGT GCGCGCGGTC
GACGTCGGCG CGGGCCCCCG CGTGGACGGC TACCTGCCGG ACCCGGCGGG CGGCGAGGCC
GCGCCGCCAC TCGCCAACGA CCCGACCGCG GCCCGCGGTG CCGCGGCGCT GGCGGCGCTG
CTGCGCGCCG ACCTCACCAC CGCCCTGCCC GACTACATGG TGCCGGCCGC CTACGTCACG
CTGAGCGAGA TCCCGCGCAC CGCCAACGGC AAGCTGGACG TCGCCAGCCT GCCCCCGGCC
GATCCCGCCG TGGCCCTGGT GGCCTCGCGC CCGGCGTCGT CACCGGCGGA GGTCACGCTC
TGCGAGCTGT ACGCCGAGGT GCTCGGGCTG CCCGAGGTCG GCGTCGAGGA TGACTTCTTC
GCGCTCGGCG GCCACTCGCT GCTGGCCACT CGCCTGGTCA GCCGGGCCCG CGCCGCGTTC
GGCGCGGACC TGGCGATCCG TGACCTGTTC GAGGCCCCGA CGGTGGCCGC GCTCGCGGCC
CGGGTCGGCG CCGCCGGTGA CGAGCCCGCC GTGTCGAGCC GGCCGCCGCT GGTCCCCGCC
GACCGCCCGG AGCGCATCCC GCTCTCGCCG GCCCAGCGCC GGCTGTGGCT GGTGGACCGC
GTCGCGGGCG GGACGAACGC GTACAACTAC CCGTTGGTGA TCCGGCTGGG CGGCGAGATC
GACCTGGACG CGCTGCGCGC CGCCGCCGCG GACGTCGCGG CCCGGCACGA GATCCTGCGG
ACCGTCATCG CGGAGCACGA CGGCGAGGAC TACCAGCGGA TCCTCGACCC CGCCGAGGCA
GGTCCCGAGG TCCGGTTCGT CGACGTCGCC CCCGCGTGCC CGGGCGCTCC GAGCGCCGGC
GCGGAGACGG GCCCGGCCGC CGAACCGAGC CTGGACGGCC CGGAGCGCCC CCTTGAGCCG
GATGAGCTTG CGCGCCTCGT CGAGCGGTTC GTCGCCGAAC CGTTCGACCT GCGGACCGAC
CCGCCCCTGC GGCTGGCCGT GTTCCGGACC AGCCCCGCCG AGTCGGTGCT TGCCGTGGTC
CTGCACCACA TCGCGACCGA CGAGTGGTCG GACCGGCCGT TCCTGGAGGA CCTCGACGCC
GCCTACGCCG CGCGCCGTGC GGGCCGGGAG CCGGACTGGG ATCCGCTGCC GGTGCAGTAC
GCCGACTACA CCCTCTGGCA GCGGGAGCTG CTCGGCGAGC CGGCCGACCC GTCGTCGCCG
GCGGGCCGCC AGCTCGCCTA CTGGGAGCAG GCGCTGCGGG GCATCCCGGC GGAGATCGAG
CTGCCGCTGG ACCGGCCGCG CCCGGCGGTG CGCGACGGCG CCGGCGGCCG GGAGGCCCGC
GAGCTGCCCG CCGCCACCGT GACGGCCCTG CGCGGCCTGT GCGCGCGGAC CGGCGCCAGC
ATGTCGATGC TCGCGCACGC CTCGACGGCC ACCCTGCTGC ACCGCCTCGG CGCCGGTGAC
GACATCCCGC TCGGCGTGCC GATCGCCGGC CGCACCGACA CCGCGCTCGA CCACCTCGTC
GGGTTCTTCG TCAACACGCT GGTGCTGCGC TCGGACCTCT CCGCCGACCC GACGTTCGCC
GAGCTGCTGG CCCGCACCCG CGAGACGGAC CTGGCCGCGT TCGACCACGC CGACCTGCCC
TTCGAAGACG TGGTCGCGGC TGTCAACCCG CGCCGCTCGG CGGGCCGCAA CCCGCTCTTC
CAGGTGATGA CCGGCTACCA CCACCTGGCC GACGGCGAGC ACACCCTGCT CGGCCTGCCG
ACGTCCTGGC TGCGAACCGA GGCGGGCACG GTGAAGTTCG ACCTGGACGT CACCTTCGTC
GACCGCGGCG GGAGCGACCA GATCACGCTG CTCGTCGAGT ACGCCCGCGA CGTGTGGGAC
GCCGACACCG CGCGGCGGCT GGCGGACCGG CTCGTCGACC TGCTCGGACA GCTCGCCCGG
GACCCGGACC GGCCGGTCAG CCGGCTGGCC CTGCTCGGCG GGGACGAGCG CCGGCGGGCC
CTGCGGTCCG CCACCGGCCC GCGCCGGGAA CCGCCGCAGC CCACCGCGGT GCGCCTGTTC
GCCGCGGCGG CCGCGGCCAG CCCGGACCGC CCCGCCCTGG TGGCCGGCGG GACGAAGCTG
ACCTTCGCCG AGCTGGCCGA GCAGGTCGGG AGCCTGGCCT GGCTGCTCGC CCGGCGCGGG
GCCGGCATCG AGGACGTGGT CGCGCTCGCG CTGCCCCGGG CCCGGATGGT GCCCGCCCTG
CTCGGCGTCA TGACCGCCGG GGCGTGCTAC CTCCCGATCG ACACCGACCA CCCCGCGGAC
CGGCTGGCCT TCCTGCTCAC CGACGCCCGG CCGCGGCTCG TGCTCACCAC CGCGGCCCTC
GCCGGCCAGC TGCCGGCGAC CGGGGCGGAG GTGGTCGTCC TCGACGACCC TGCCGTCCAG
GCCGAGCTGG CGGCCCATCC CGCCGGCCTG CCGATCGCCG GCCTCCCGCC GGCGGGCCAG
GCGCTGCGCG GCGACAACGC CGCGTACGTG ATCTACACCT CCGGCACCAC CGGCCGTCCG
AAGGCAGTGG TCGCCACGCA CCGCGGCGTG ACGAACCTGT TCGCCTCGCA CGAGGTGGAG
CTGATCCTGC CGGCGGTGGC GGCGTGGGGC GGCGACGGGC CGCTGCGGGC CGTCCACGCG
GCGTCGTTCA GCTTCGACGG GTCGTGGGAG CCGCTGCTGT GGCTGTTCGC CGGGCACACC
GTGCACGTCG CCGACGAAGC GACGATGCGC GACCCGGCGG CCCTCGCCGA GTACGTGGTC
AACGCGCGGA TCGACTTCCT CGACGTCACC CCGACATACC TGCGTGAGCT CGTCCACCTC
GGGTTCCTCG ACGGCGCCCA CCTGCCGGGC GTGATCGCGG TCGGCGGCGA GGCCACCCCG
GCGCCGCTGT GGGAGCGCCT GCGCACCCTG CCCGGTGTCG TGGCGCACGA CCTGTACGGC
CCGACCGAGT ACAGCGTCGA CGCCTACGGC TGGCACGGTG ACGGCACGGC CGGCCCCGTC
GCGAACACCC GTGCCCTCGT CCTGGACGCC GGTCTCGAGC CCGTCCCGGA CGGGGTGCCC
GGGGAGCTGT ACCTGGCCGG TGACGGCCTG GCCCGCGGCT ACCTGGGCCG GCCGGGGCTG
AGCGCCACCC GGTTCGTCGC CGACCCGTTC GGCGCCCCGG GCGGGCGGAT GTACCGCACC
GGCGACCGGG CGCGGCGCCG CGCCGACGGC ACGCTGGCCT TCCTCGGACG GGTCGACGAC
CAGGTCAAGG TACGTGGCTT CCGGATCGAG CCCGGCGAGA TCGAGGCCGC GCTGCTGGCG
CTGCCCGGCG TCGCCGCCGC GGCCGTGATC GTCCGGGAGG ACGCCGCGGG AGACCCGCGT
CTGGTCGCCT ACGTCGTCCC GGACGGCCCT TGCGCCACGG CCGATCCCGG CGTGGCCGTT
CCTGGCCCGG CCGATCCCGC CGCGGCCGAT CGCGCGACGG GCGATCGCGC CGTGGCCGAT
CTCGGCGCGC TGCGCTCCGA GCTCGCGCGG ACGCTGCCGT CCCATCTCGT GCCGAGCGCG
TTCGTCGCGG TGGCCGAGCT GCCGCGCACC GTCAGCGGAA AGCTGGACCG CGCGGCCCTG
CCGGCACCCG GCGAGCCGGC GGCGTGGCCC GGCCGCCGCC CGCGCGGCGC GCGCGAGGAG
CTGCTCGCCG AGCAGTTCGC CGCCGTCCTC GGGACGGCTG AGGTTCTCGG GACGGCGGAG
ATCGGCGCCG AGGACGACTT CTTCGCCCTG GGTGGGCACT CGCTGCTGGC GATGCGCCTG
CGCAGCCGGA TCCGCTCGGT GTTCGGCGTG GAGGTTTCCG CCCGTGACAT CTTCGACGAG
CCGACAGTGG CGGGCCTGGC CCGCCGGCTC GACGGGGCGG TGGCCGCGGC CCGGCCCGCC
CTGGCGCCCG CCGACCGGCC GCAGCGCCCA CCGCTGTCGC ACGCCCAGGC CCGGCTGTGG
GTGCTCGGCC AGGTGGAGGG GCCCAGCCCG ACCTACAACA TCCCGGTCAC CTGGCGGCTG
GAAGGGCCGC TCGACGTCGG TGCGCTGCGG GCCGCCGTCG GCGACGTGGT CACCCGGCAC
GAGGCGCTGC GCACGGTCTT CCCCGCCCCG GACGGCGTGC CCCACCAGCG GGTCCTCGAC
CCCGCCGCGG CCCGGGTGGA CGTCGAGCTG ATCGAGCTCG GCTCCGCCGG GGACCTGGCC
CGGCGTCTGG AGAACGCGTC GGCACGGGTG TTCGACCTCG AGCGAGAGCT GCCGGTGCGG
GTCACCGCCG TCCGGCTCGA CCCGCGGCTG CACGTGGTCC AGTTCCTGGT CCACCACATT
GCCGCCGACG AGGGCTCGGA CCAGGCGCTG GCCCGTGACC TGTCGACCGC CTACCGCGCC
CGGCTGCGCG GCGAGGCGCC GCGATGGGCC CCGCTGCCGG TGCAGTACGT CGACTACACC
CTCTGGCAGC GCGCGTTGCT GGGGGACGAG TCGGACCCGG CCAGCCCCGC CTCGGCCCAG
CGCGACTTCT GGCGCCGCGA GCTGGCCGGG CTGCCGGTCG AGCTGGCGGT GCCCACCGAC
CGGTCCCGGC CGGCCGAGCC GTCCCACCGC GGCGGCGTCG TCGAGCTGAC CTGGGACGCC
GAGTTGCTCG ACCGACTGCG GGCCACCGCC CGGGCGCACG ACGTCAGCCT GTTCATGGTG
TTGCAGGCCC TGGTCGCGAC GCTGCTACAC CGGCTCGGGG CCGGCGACGA CATCCCGATC
GGCAGCCCGG TGGCCGGCCG CGGCGACGAG CGGCTCGACG ACCTGGTGGG CTTCTTCCTC
AACACGCTGG TGCTGCGGAC GGACCTGTCG GGTCCGGTGA GCTTCGGCGA GCTGCTCGGC
CGAGTCCGGG CCGCCGACCT CGCGGCCTTC GACCACCAGG ACCTGCCGTT CGACCGGGTC
GTCGAGGCGG TCAACCCGCC GCGCTCCCTC GCCCGGCACC CGCTGTTCCA GGTGATGGTC
GTCCACCTGC CGGCGGCGGG CGCCGCCGCC GGGCTCGACC TGCCGGGCGT GACGGCCCGG
CCCGAGCCGG TGCGCGCCGC CACCGCCAAG TTCGACCTCT CCTTCGACTT CCTGGAGCGC
GTCACCGAGG CCGGCACGAC CGAGCTGGCG GTCGGGCTCG CCTACAGCGC GGACCTCTTC
GACCACGCCA CGGCCGCGTC GTTCGGCCGC AGGCTGCAGC TGCTCACCGA GGCCGTGCTC
GCGGCCCCGA CAGCACCGCT GGCTGCCCTG CCCGTTCTCG ACGACGCAGA GCGGGATCTG
GTCCTGACCG GGTTCAACCG CACCGGGCGC GCGGTCACCG AGCTGACCTG GCCGGCGGCC
TTCGAGGCTC AGGCCGACCG CACCCCGGGG GCGGTGGCCG TGGTCTGCGA GGACGTCGAG
CTGACCTACG CCGAGCTCGA CGCGCGGGCG AACCGGCTCG CCCGGCTGCT GGCCGCCCGC
GGCGCGGGGC CGGAATCGGT GGTGGCCGTC GCCGTGCCGC GCTCGGCCGA CCTGGTCGTC
GCGCTGCTGG GGGTGCTGAA GACCGGCGCC GCCTTCCTGC CGCTCGACCT CGACCACCCG
GCGGACCGCG TCGCCTTCAT GATCTCCGAC GCGGGCGCCC GGCTGCTGGT CTCCACCCGC
GGCCACGCCG AGGAGCTGAT CGCGCTCGGG GCGCTCGGGG CGCTCGGGGC GCTCGGAGTG
CCGGGGCCGC TGGGCGATCC GGGCGCGCCG GGCACCGGCG CGCCCGGGCT CGACCTCGTG
CTGCTCGACG AGCCCGGGAC GGCGGCCGAG CTGGCCACGG GTGACCCGGC CCGGCCCGGC
ACCGGCGTGG CCACGAGCCT CGACAGCGCC GCCTACGTCA TCTACACGTC GGGCTCGACC
GGACGTCCCA AGGGTGTGGT GGTCACCCAC GAGGGTGTGG GCAGCCTGGT CGCGACCGCT
GTCGACCGGC TCGGGGTGAA CGGGACGTCC CGGGTGGCGC AGTTCGCCTC GGTCGGGTTC
GACGTCGCGG TCTTCGACCT GTGCATGGCG CTGTGCGTCG GCGGGCGCGC GATCATCGTG
CCGGCGCAGC GGCGCGTCGC CGGGCCGGAG CTCACCGGGT ACCTCGCCGA CCACGGCGCC
ACGCACATGA TCCTGCCGCC CGCCCTGGTC GCCGCGTTGC CGGCGGACTG CGCGCTGCCG
GCGGGCGCGG TGCTGGTGGT GGGCACCGAG ACCGTGCCGA TCGAGACGGT GCGCCGCTGG
TCGAGCCACC TGCGGGTGGT CGCCGCCTAC GGCCTGACCG AGGCGACGGT CAACTCCACC
CTGTGGCAGG CGGACCCGGA CTGGACCGGC GCGGTGCCGA TCGGCGTCCC CGACCCCAAC
ACCCGCGCCT ACGTGCTGGA TACCGCGCTG CGGCCGGTCG GGGTCGGGGC GGTCGGCGAG
CTCTACATCG GCGGCCGCGG GCTGGCGCGG GGCTATCTCG GACGCCCGGC GCTGACCGCC
CAGCGGTTCG TCGCTGACCC GTTCGGCCGG CCGGGCGACC GGCTCTACCG GACGGGCGAC
CGGGCGCGCT GGCGCCCCGG CGGGGTGCTG GACTTCCTGG GCCGGGCCGA CGACCAGATC
AAGATCCGTG GCTACCGGAT CGAGCCGGGC GAGATCCAGT CTGTGCTGAT GCGTCACCCC
GGCGTGCGCC AGGCGGTCGT GCTGGCCCGG GAGGACCGCC CTGGATCACT CCAGCTCGTC
GCCTACGTCG TGCCGGTCGA CCCGGCGCAG ACTGGCCCGG CGCCCGGCGG TGGCCGGCTC
GACCCGGCGG CGCTGCGCGT GCACGCGGCC GAGTACCTGC CCGAGTACAT GGTGCCGGCG
GCGGTGGTGC TCGTCCCGGG CCGGCTTCCG ATGACCCCCA ACGGCAAGCT CGACCGGGCA
GCGCTGCCGG CTCCCCGGTT CGCGCTCTCG GCAGCCGACC GTGCGCCGGC CTCGGCGACC
GAGGCACGCC TGTGCGAGCT GTTCGCTGAG CTCCTCGGGC TGGGCGACAG CCGTCGCGGG
CCGGCGGCAG AGGAACAGGC GGGGCGGCAG CCCGCCGCCG CCGCCGTCGG CGTCGACGAC
GACTTCTTCG CCCTCGGCGG CGACAGCATC ATCTCGATGC GGCTCGTCAG CCGGCTGCGG
GCCGACGGGT TCGTGGTCCG GCCGGCCCAG GTGTTCCGGC ACCGGACGCC GGCCGAGCTC
GCGGCGGTGC TCACCGGGGC CGGGGCGCTC TCAGGCACCG TCGCCACGAC CGGCGCCGGG
CCGCTTCCTG GTGCCGTCGC CACGGCTGTG GCCAGCGACG GCGTCGGCGT GGTGCCGCCG
ACCCCGCCGC TGCTCGCGCT GCGGGCCGCC GGCGGCGGTG TGGACGGTTT CTCCTCGCCC
ATGCTGCTGC ACGTCCCGGC CTCGGTCACC CTGCCGGCGC TGCGGGCCGT CCTGGCCGCG
GTGGCGGAGC GGCACGAGAT CCTGCGGGCC CGGCTGGTGC GTGCGGCGAC CGGCACCGTC
CCCGACGGCC TGCCGGCGAC CAGCGCCGAC GGCGGTGAGG CGTGGTACCT CGATGTGCCT
CCGCCCGGCC CGGCCGGTGT CGGCTCGTGG GTGCGCCGGG TGGAGGTGGG CGCGGACGAC
GACCTCGACC GGATGCTCGC CGCCGAGGCG CTCCGCGCCA ACCGGGAGCT CGACCCCGAC
GCCGGCGCGA TGGTCCGCGT CGTCTGGTTC GACGCCGGGC ACAGCAACTC GGGCCGGCTG
CTGGTCCTCG TGCACCACCT GGTGACCGAC GGGGTGTCCT GGCAGATCAT CCTGGATGAT
CTCGCCGCGG CCTGGAACGC CCTGACCCGG GGCGGCGTAC CGGCCCTCGC ACCGGTGTCG
ACGTCCTACC GGCGGTGGGC GCTGGGTGTC ACCGAGCAAG CCCGGCGACC CGAGCGGGAG
GCGGAGCTGG CGTACTGGCT CCGGACTGTC GAGGGCGGCG CGCCGCTACC GCGCCCGGAG
ATACCGGAGG TCGAGCGGGC GGCACCGGAG CACGACACGG CGGTGGTCGA GCGCGACGCC
GGGGCGGTAA CCGCGACGAT TCCCGAGACC TGGGCCGGCG ACGTGCTGGT GGTCGCGCCC
GCTGCCTGGG GCGTCGGGGC CGACGAGATC CTGCTCGCCG CGCTGGCGCT GGCGGTGGCC
GACTGCCGCC GCCGGTGGGA GGGCACGCAG GCCGCCGGGC TCGTGGTCGC CCTGCAGAGC
CATGGCCGCG TCGACCCGGA GGTGGACGAG GCGGACCTGA CCCGCACCGT GGGGTGGTTC
GCCGACGTCC ACCCGCTCTG CCTGGATCAG CGGCCGGTCG ACCTTGGCGA CCCGGCGGCC
GGCGCGGCGC TGGACGCCGC GGTCGTCGCC GTCCACCGGC GCAAGGCCGA GACCCCTCGG
GGCGGTAGCG GCTACGGCCT GCTGCGCCAT CTGAACCCGC GCACGGCGCC GGTCCTCGCC
CGCGCCGGCC AGCCGGCGGT CTACCTGAAC TACGAGGGCA GGTGGTCGCG GCCCGAGCCG
GCTGACTGGG ATGTCGCGCG CGAGGACGAG GGTCTCTTCG CCGGCTGGAA CTCCGAGCGC
GCCGACCCCT TCCCACTCAC GGTGATCATC CGGGCGCTCG ACCTGCCCGG CGGCCTGCGG
CTGGTCGCCC GGTGGACGTC CGGCGCCGGC GGGCTCCCGG CATCCGCGGT CGGTGATCTC
GCCACGGCGT GGACGCGTGC CCTGGCCGCG CTCGCTGCGC GGGCCCGGCA CCTGCGCTAA
 
Protein sequence
MWFLQQLDPS SAAYNVAASF DLTGPVDADA LHAALRAVAR GHEILRTTYR VPAGGRPEQV 
VHAELDPQWS SHDVSGEPSA SRERLADEIA RHAARRPFDL TSSSPLRVTL IRLAADRFVL
VVVAHHVAWD DASWEIFFRE LFAHYDNLRA GDGPLPADER TAQYIDVAGQ PRGADAQDPD
PAALDYWRGQ LTALPEPLRL PGGEPVAAAT AAGADPGKPA DPGKLADPGK LADPGGQVVH
QAAAGLGRRV RAFAAEHGAT PFMVLLAAFA AVLHRHTGAT DIVVGSPVVN RDQPGADRLI
GYFGNTVALR VRVRPGDSFR DLLARTSETC RGAFAHQHVD LEDVVRAVNP DRSDGVSSVF
TTLFAVRTAA AEHLGARELS GARRPLHTGD AQFPLGVTVE IGADQLDLEA GFLTAAVTAG
SAGAVLRHLE TFLDHATRHP RRPLAAVDLL TDSERRRTLV DWNNTAAPSP DATWAELLAR
RAARAPGHAA VVTSGGTLTY GELVGRADAL AYQLRGLGTG PGAIVALALP RTLDLVVALA
GVTRAGAAYL PVDPGYPADR ITLVLEDAAP SLLITTRELA ATLPIPAGVT VVLVNGPAAG
PEAGTEFGTE FGSVPEPPAA AAVPAELGSA AYVIYTSGST GRPKGVVVPH RALTNFLASM
VDQFQLGDGD RVLALTTLSF DIAALELLAP LTAGATVHLA GQDEARDPTA LAALLAGGGI
TVAQATPSMW QSILAASDDR FPGVRVLSGG EGLPPAVAAT LAERADEVVN LYGPTETTIW
STAGPVAGDG RPTVGRPIRS TQVYLLDTAL APVAPGMPGE LYIGGAGVAD GYLRRPGLTA
SRFVADPFSA GGRLYRTGDL ARWTADGELD VLGRVDDQVK VRGFRIEPGE VEAVLGRHPA
VARCAVVARD DGPAGRHLVA YVVPAAPGTV VDPALLREHL AAALPEHMVP GAFVQLAALP
TTPNGKLDRR GLPRPDFAAA AGSRAPATAA ESLLCDLFGT VLGVERVGAD DSFFHLGGDS
ILAFQVVAGA RAAGLACVLP DIFRHPTPAS LAAAAEQATV PPPADTALDA DLAGVSAVDL
ERWRARYPDL TGVRPVSPLQ AGLVFHSLLG DGSGTVPGAA GSDAYILQFV VDLRGRLDPA
RLRAAGRALF ERHENLRTAF LYDGDAPVQI VLADLAPAAF DEVWTEVDLA GWPRAEALAE
AERLGRADRQ TPFDLTAPPL VRFLLLRTGP DAWRLVLSPH HVVLDGWSVP LLLRELFRRY
ELAQSQSQAQ ADGPAAALPP APSYWEYLRW LAGRDRDASR RAWAAALAGV TEPTLLAAAT
ARPGTPAQTA AHAPDQVPEQ APDQAVDQWD TVDGGVTAAV ERHADAALTA GLGTFVRERG
LTLNTVLRAA WAVVLGRAVG RDDVVFGTSV AGRPAELPGA AEMIGLLLAT VPVRVTLDPA
ESFVGLLDRV RADQTATLEH HHLGLADIHA AVGLPELFDT LFVLESYPFD PAGLLGPDSG
LRLEGLDGHD ATHYPLTIRV IPGERLRITF GYRPELLSGA TVTALADQLL DLLAAAIADP
DKPVGAIGAP AGSVAPAGTV GAAPAATRPE EDEPTGAWAT DTTLPELFEA QAARVPDAVA
VTWGERRLTY AELDAAANRL ARLLATRGVE PESLVAVALP RSIDLVVALL AVQKAGAAYL
PLDTAYPADR LAFMLSDAAP VCLVTSAEAL PALPARTRVP MIALDAPPVV AALAEQSPAR
LPAAGRARPE NAAYVIYTSG STGRPKGVVV PHQTVTRLFA HTQPWFGFDE TDVWTMFHSA
SFDFSVWELW GPLLYGGRLV VVDHHVSRSP ELFLDLLRRE SVTVLSQTPS AFSQLIEADR
AGGEDPAELA LRYVVFGGEA LDLGRLPAWY ARHRDDAPVL VNMYGITETT VHVTYLRLDE
AVVAAARASL VGGPIPGLRV HVLDQHLRPV APGGLGELYV SGGQLARGYL GRPGLTATRF
VADPFGGPGA RMYRTGDLAR RTADGGFEYL GRADDQVKVR GFRIELGEIQ AAIATHPAVE
QAVVLAREDQ PGQRRLVGYV VAAPGRRVDS EELRRHAAAM LPEFMVPVAV LALDAFPLTG
NGKLDRAALP APDLASVSTG TRARTERERT LCAVFAGVLG LDEVGVDDDF FTLGGDSISA
IQLVNRARRE QVGFTPRDVF THRTPAKLAA LTEPAASAVS TAPDGPAAFA GPGAAPDRPG
GAGTGLGADP EAIGVFTPLP IVSRLASWGG PVRRFNQAML VSTPAGATVD LLRGAVQALL
DQHDALRARL LRPSPALWLL ETLPVGSARA GDLLRRVDIA GLDEAGLRAA VAAESAAAAD
GLDPEAGAML RVSWLDAGSD QPGRLLLVAH HLVVDGVSWR VLLADLRTAF AAVLAGRAPH
LDPVGTSLRA FARITAERAQ DPARLAELAH WTATLAPGGR LLGGAPSTGA AAATGAAAAP
GAAPAAADTI ADTDRLVVEL PARATAPLLT SVPAAAGTDV TAVLLAALRA AVTGWRRERG
WDGSSDLLVD LERHGRDPIA PEVDLSRTVG WFTAISPVRL PASGDPASGD PASGGLASGG
LAGTLEAVAG RLREIPDGGA GFGLLRYANA AAAPVLAGAE HPEVLFNYLG RFGADAADAW
APAPELDALA AEPDPAMGVA YPLTVDAVSV QTPDGPVLRT TLTYLTTVID RTAAGALATA
WLAALDGLAG LADDAVLAGA AAQADTAGPA AVTGADGAAG PAATGDLVEL SPAERARVER
VSPGPVAEIW PLSPLQEGLF FHSAFDRSSD AYTAQFALDF GHRFDVDRLR RACETFMRRN
PTLRAGFLGD GLPAPLQFVV TELPAPLEET DLTGLPAAER AEQAELLARR DRERPFDISN
PPLWRLTLLR LGPDHDRLVF NRQVLVWDGW SGALVIDQLL GLYERGGGDE GLPVPAASYR
DFLVWLRARD TAGAEEAWRA ALADLDGPTL VVPEARGLPP IEPERITTEL AESTARAVRE
LSRRHGITVN TVFNAALALV LGNAVGSDDV VFGTTVAGRP TDVEGIDEVI GLFLNTVPVR
VRLDPRESVL DLLRRVQDQR VDVMDHEYLG LGDIQRASGR TQLFDTLYVL QNFIDEVATE
QSSDRFGITG GTSIDHTHYP LTFVLFPGTR ITMRLEYRPD VVGAQRAAAL FDRFRGLLDE
LVRDVTVPVG SVEVLLPAER AELAARWAEP FLPVGTETVA DMLAAQVART PDLTALVFGD
ERVSYADLDA RVNWMARLLL ARGAGPETVV ALGLGRSVNM VVALFAVLRT GAAYLPLELD
HPPARLLGMV ADAGAALLVA TDATAAYLDG AGAGPDEPVP RLLLDDPAMA AELAATGAGE
LSDAELGLFA RDRADRLDHP AYVIYTSGST GRPKGVVTPY RGLTNMQLNH REEIFAPAVA
AAGGRRLRIA HTVSFAFDMS WEELLWLVEG HEVHVCDENL RRDAEALVAY CDAHQVDVVN
VTPTYAHHLF ELGLLDRAED GRHRPPLVML GGEAVSEAVW NRLRDTADTA GYNLYGPTEY
TINTLGAGTA DSPTPTVGRP IRNTRAYVLD GWLHPVPDGV PGELYIAGDG LARGYLDRFA
LTATRFVADP RVPGGRMYRT GDLVVRGAAD GNLDFLGRTD EQVKIRGYRV ELGEITAALD
RHPRVSQAAV VAADDTGAPG TRRLVAYVVP AELTAADRAA VEADQVGEWR QVYSDEYVQI
PTAVHREDFA GWDSSYDGQP IALEHMRQWR AATVERIREL APRRILEIGV GTGLLLGQLA
PECDSYWGTD FAAPVIDKLR AETAGDPRFA GLELRCQPAH VTDGLPAGLF DTIVINSVVQ
YFPSPEYLRG VLRAALELLA PGGALFVGDV RNLARLRAFH IAIEVSRPGP GGLAEPAGGL
RDADLARLAP AVDRRVRLDK ELVLAPEFFT ALASETGAEL SLRVKRGSFH NELTRHRFDV
VLRRPAAPPV RLGEAPVLVW GHQVAGTDAI EAHLRDRRPP MLRVAAIPDA RVAGELALAA
RVAGRDVPGG AAAPECAVDA EHTVDPEELF ALADRLGYEQ HLTLSAAGPG LLDAVFVRAV
DVGAGPRVDG YLPDPAGGEA APPLANDPTA ARGAAALAAL LRADLTTALP DYMVPAAYVT
LSEIPRTANG KLDVASLPPA DPAVALVASR PASSPAEVTL CELYAEVLGL PEVGVEDDFF
ALGGHSLLAT RLVSRARAAF GADLAIRDLF EAPTVAALAA RVGAAGDEPA VSSRPPLVPA
DRPERIPLSP AQRRLWLVDR VAGGTNAYNY PLVIRLGGEI DLDALRAAAA DVAARHEILR
TVIAEHDGED YQRILDPAEA GPEVRFVDVA PACPGAPSAG AETGPAAEPS LDGPERPLEP
DELARLVERF VAEPFDLRTD PPLRLAVFRT SPAESVLAVV LHHIATDEWS DRPFLEDLDA
AYAARRAGRE PDWDPLPVQY ADYTLWQREL LGEPADPSSP AGRQLAYWEQ ALRGIPAEIE
LPLDRPRPAV RDGAGGREAR ELPAATVTAL RGLCARTGAS MSMLAHASTA TLLHRLGAGD
DIPLGVPIAG RTDTALDHLV GFFVNTLVLR SDLSADPTFA ELLARTRETD LAAFDHADLP
FEDVVAAVNP RRSAGRNPLF QVMTGYHHLA DGEHTLLGLP TSWLRTEAGT VKFDLDVTFV
DRGGSDQITL LVEYARDVWD ADTARRLADR LVDLLGQLAR DPDRPVSRLA LLGGDERRRA
LRSATGPRRE PPQPTAVRLF AAAAAASPDR PALVAGGTKL TFAELAEQVG SLAWLLARRG
AGIEDVVALA LPRARMVPAL LGVMTAGACY LPIDTDHPAD RLAFLLTDAR PRLVLTTAAL
AGQLPATGAE VVVLDDPAVQ AELAAHPAGL PIAGLPPAGQ ALRGDNAAYV IYTSGTTGRP
KAVVATHRGV TNLFASHEVE LILPAVAAWG GDGPLRAVHA ASFSFDGSWE PLLWLFAGHT
VHVADEATMR DPAALAEYVV NARIDFLDVT PTYLRELVHL GFLDGAHLPG VIAVGGEATP
APLWERLRTL PGVVAHDLYG PTEYSVDAYG WHGDGTAGPV ANTRALVLDA GLEPVPDGVP
GELYLAGDGL ARGYLGRPGL SATRFVADPF GAPGGRMYRT GDRARRRADG TLAFLGRVDD
QVKVRGFRIE PGEIEAALLA LPGVAAAAVI VREDAAGDPR LVAYVVPDGP CATADPGVAV
PGPADPAAAD RATGDRAVAD LGALRSELAR TLPSHLVPSA FVAVAELPRT VSGKLDRAAL
PAPGEPAAWP GRRPRGAREE LLAEQFAAVL GTAEVLGTAE IGAEDDFFAL GGHSLLAMRL
RSRIRSVFGV EVSARDIFDE PTVAGLARRL DGAVAAARPA LAPADRPQRP PLSHAQARLW
VLGQVEGPSP TYNIPVTWRL EGPLDVGALR AAVGDVVTRH EALRTVFPAP DGVPHQRVLD
PAAARVDVEL IELGSAGDLA RRLENASARV FDLERELPVR VTAVRLDPRL HVVQFLVHHI
AADEGSDQAL ARDLSTAYRA RLRGEAPRWA PLPVQYVDYT LWQRALLGDE SDPASPASAQ
RDFWRRELAG LPVELAVPTD RSRPAEPSHR GGVVELTWDA ELLDRLRATA RAHDVSLFMV
LQALVATLLH RLGAGDDIPI GSPVAGRGDE RLDDLVGFFL NTLVLRTDLS GPVSFGELLG
RVRAADLAAF DHQDLPFDRV VEAVNPPRSL ARHPLFQVMV VHLPAAGAAA GLDLPGVTAR
PEPVRAATAK FDLSFDFLER VTEAGTTELA VGLAYSADLF DHATAASFGR RLQLLTEAVL
AAPTAPLAAL PVLDDAERDL VLTGFNRTGR AVTELTWPAA FEAQADRTPG AVAVVCEDVE
LTYAELDARA NRLARLLAAR GAGPESVVAV AVPRSADLVV ALLGVLKTGA AFLPLDLDHP
ADRVAFMISD AGARLLVSTR GHAEELIALG ALGALGALGV PGPLGDPGAP GTGAPGLDLV
LLDEPGTAAE LATGDPARPG TGVATSLDSA AYVIYTSGST GRPKGVVVTH EGVGSLVATA
VDRLGVNGTS RVAQFASVGF DVAVFDLCMA LCVGGRAIIV PAQRRVAGPE LTGYLADHGA
THMILPPALV AALPADCALP AGAVLVVGTE TVPIETVRRW SSHLRVVAAY GLTEATVNST
LWQADPDWTG AVPIGVPDPN TRAYVLDTAL RPVGVGAVGE LYIGGRGLAR GYLGRPALTA
QRFVADPFGR PGDRLYRTGD RARWRPGGVL DFLGRADDQI KIRGYRIEPG EIQSVLMRHP
GVRQAVVLAR EDRPGSLQLV AYVVPVDPAQ TGPAPGGGRL DPAALRVHAA EYLPEYMVPA
AVVLVPGRLP MTPNGKLDRA ALPAPRFALS AADRAPASAT EARLCELFAE LLGLGDSRRG
PAAEEQAGRQ PAAAAVGVDD DFFALGGDSI ISMRLVSRLR ADGFVVRPAQ VFRHRTPAEL
AAVLTGAGAL SGTVATTGAG PLPGAVATAV ASDGVGVVPP TPPLLALRAA GGGVDGFSSP
MLLHVPASVT LPALRAVLAA VAERHEILRA RLVRAATGTV PDGLPATSAD GGEAWYLDVP
PPGPAGVGSW VRRVEVGADD DLDRMLAAEA LRANRELDPD AGAMVRVVWF DAGHSNSGRL
LVLVHHLVTD GVSWQIILDD LAAAWNALTR GGVPALAPVS TSYRRWALGV TEQARRPERE
AELAYWLRTV EGGAPLPRPE IPEVERAAPE HDTAVVERDA GAVTATIPET WAGDVLVVAP
AAWGVGADEI LLAALALAVA DCRRRWEGTQ AAGLVVALQS HGRVDPEVDE ADLTRTVGWF
ADVHPLCLDQ RPVDLGDPAA GAALDAAVVA VHRRKAETPR GGSGYGLLRH LNPRTAPVLA
RAGQPAVYLN YEGRWSRPEP ADWDVAREDE GLFAGWNSER ADPFPLTVII RALDLPGGLR
LVARWTSGAG GLPASAVGDL ATAWTRALAA LAARARHLR