Gene Mlg_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1047 
Symbol 
ID4270520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1198977 
End bp1217165 
Gene Length18189 bp 
Protein Length6062 aa 
Translation table11 
GC content71% 
IMG OID638125799 
Producthypothetical protein 
Protein accessionYP_741890 
Protein GI114320207 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0593961 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACG GGACGCAGGG GACCCGTCGG GGACGGCGCA GGCACAAGCC CGTTCCTGAT 
CACCCGGTGG AGGGACCCGG GTGGCGCCGG GATCCTCTGG CGCTGGCCAT CTCCGCGGCG
TTGGCGTGCG GTGGGGCCTT CCCCTCTGCC AGTGCCGCCG GCAGTAGTAT TACGGTCAAG
GACAACAAGA CCGAGACCGA CATCTCCGAC GCCCCCGGTG TCGATGGCGG CACCCGCTAC
ACCATCACCA CCGAATCGCT GACCGATTCC GGCCGGACCG GCCTCAACGC ATTTGCCGAG
TTCATCCTGG CCAGCGGGGA CCGGGCGGAC CTGGTGTTGC CCGATGGCAC CCTGAACCTC
ATCAACCTGG TCTACGACAG CCGCGCTAAG ATCCACGGGG AACTGTTCAG TCAGCTGGAC
GGCGAGTTCG GCGAAGGGCA CCTGCTGTTC GCCACGCCCC ACGGGATGCT GGTGGGGGCG
GAGGGGGCGA TCAACGCCGG GGCGCTGACC GCCATCGCCC CCCGCTCCTC GCAACTGGAT
CGGATCCTGG ACGGCGATGT CGGCCTCGGC GAACTGATGC GGGGCGAGGT TGAACTGGAC
CCGGAAGCCA CCATCGAGAT TCAGGGGGAG ATCGACGCCG ACCACGTGCG CCTGATCGGT
CATCAGGTGC TGGTCCAGAG CGGCGCCCGG ATCGAGATCG CCGACCCCAT TGACGACCAC
GAGGCGGTGT TCGGCTCGGC GGTGAATATC GACGGCCTGG AGAGCGGGGC AGGGATCGCC
GTCGACGCCG GCGACCTGGT CATCGCCGGT GAGCGCCGCG CCGAGGTCTA TGGTGACCTG
CTGGCCCAGG ATGGCGGCGT GGTGGTGCGC GCCCACAACG TCGGGCGGTC GGACCTCGGG
GTCACCCGGG CCGACGCGGA GGTGGCGATC GGCGGCCGGA TCGAGGCCGA GGACATCACC
CTGTCCGCCC GGGTCGATGC GGAGGCGGAA CTGGACCTGG TGGAGGCGCT CAAGGGCCGG
GTCGAGGCGC TGGTGCCGGA GGGGCTGGAA GAGGTCGCCG AGTCGGCCAT CGACAGCGGA
CTGGAGGCCG TCGACGACGA GCTCGACAAC CAGGATCTGC CGGACTTCGG GCTTGGGCTC
GGTCTGGTGG ACGCCTCGGC CATCGTCACC CTGGAGGACG GTGCGGAACT GGACGCGGGC
GGGAACGTGA ACATCCATGC CGAGGCGCAG CGCACCGCCC GCGCCGAGGC GCATGCCGAC
GGCGACGGCC TCGGTGGCGC CTTCGCCCTG GCCTCGATCA GCGGGCAGAC CGCGGTCACC
GTGGAAGAGG GCGCGAGCAT TGTGGCCGGC GGTGACATCG ACCTGCGCGC GGCCAGTCAC
AATGAATTGG TGGCCGAGGC CGTCACCGAG GTGGGCGACG GTTTTGACGT CGGCCTGGGC
GGCAGTGTGG CCCTGGGCTT CCTACAGTCG GAGCCGGAGG ATGGCGGCAC CACCGTCCAG
GTGCAGGAGG GGGCGACCCT CGCCGCCGGG GGCGATCTCG GCCTGTCGGC GTTCACGGAA
CACGATGTCA CCGTCGACGC CACCACCACG CTCGGCAGCG GGGGCGAGGC GGCTTTTGGC
GCCACGGTGG CCTACCTGGG TCTGGACTGG ACCACCCGCA CGCTGCTCAC CGGCGGGGCG
GTCGCCGACG ACACCGTTGA ACCCCCCGTG GGCATCGTCG CCGGCGGTGA CCTGCGGATG
GTGGCGCGCA CCGACCAGGC GATCCACACC GGGGCCTCGG TGGAGACCAC CGGCGACATC
AGTGTGGGGC TGGCGGCGGC GGTGGCCGAG TTGAACAACA CCACCCGCGT GGTCCTGGAC
GGTACGGTGC GGGCGGGCGG GGACGCGACC CTGCTGGCGG AATCCATCAC CCGGGCGCAG
CACACCGTGG CCACCACCCG GGCCGCGGAG GACGCTGACA ATAACGGCGG TAATGGCAAC
GACGGACCGG ACGAGGCCGC CGACGATCAA GGCACCTCCG ACTACGTGGC CGGTCTGGTC
GGTGACGTGG ATACCGGCGA GCCGGAGGAG ACCCCGGGTG CCGGTGGCAA TGGCGGCCTG
GCCCTGGATA TCGGGGCGGC GGTGGCCTAC ACCGATGCCC GGGACAGCGC CAGCACCACC
CTCGGAGGCA ATACCCAGCT GTTGCCACCC GCCGGCCCGC TGGGACCGGG TGGCCAGGTG
GCCATCGTCT CCCGCCGGCA GTTGGACAAT CTCTCCACCC GCGCCCGCTC CGGCACCGAG
GCCACCGGCC CGGACGGTGA CGACGGGGCC AACAGCATCA ACGCGGCGGT CAGTTACGCC
CGGCTTGACC AGATCGCCCG TACCGAGATC GCCGACGGTG CCCGCATCGA GGCCGGGCGC
ATCGGCGTTG GCGCCGAGGT GGATCTGCCC GCACGGGTGG CGTGGGGTGA CGAGGACAAC
GAGGACGGGC CGGGCGGGCT GGACGACATT GCGGGCGCTG ACAGCGATGA TCTGGACGAC
TGGTTGACCA GCGAGGCCAG CACGTCGTCG GAGACCGGCG GCACGGTGAA TATCGCCGGG
GCGGTGAACC TGCTGGACCT GACCGCCACT GCCGAGGCGC TGGTGGGCAC TGGTGACGAT
CCCACCGATG TGACCACCCT GCACGCGGTG GGTGATGGCG GTGACTGGGA AACCGAGCTG
GGCGGTGACA CCACCTGGCA CTGGGACGAT GCCGTTGCCG TGGCGGCCCG CAGCCGGGTG
GAGACCGTGG ACGTGGCGGG GCAGGCGGCG GCCGCCGGCA CCGTGGGTGT TGGTGGCGGC
GTCAACTGGC ACCTGCGGGA GAGCACCGCC CGGGCGCGCC TGGCCGGCAG CGCGGATATC
AGCGCCGACA CCGGCGGCGT GTCGGTGTCC GCCGATCGTC GGGACAGGGC CATCGTGCTC
ACCCCGATAG CCGGAGAGTC TGATGGTGGT GCCGTTGGCC TGGGGGCGTC GGTGGCCTAT
GCCCGTATCG ACGGGACCAG CCAGGCGCAG CTCGACGAGG GCGCCATGGT CGCCGCCGGC
GGTGCCCTGG CGGTCACCGC TGGCGGCGAC CTGCATGCCG AGATCCTGAC CGCCAGCAAA
ACCAGCGGCG ACGCCACCGC CGGGATCGCC GGCACGGTGG CCTTCGCTGA CATCTCCCTG
GATACCACCG CCGCAGTCCA CGGCCAGCCC GTGGACAATG CCCTGGAGGT GGGCGACCTG
TACCTCCACG CCCTCGACCT GCAGGACTAC CAGGTGCAGG CCTCGGCGGA GGCCCAGGGC
GATGTCGCCA TCGGCCTGGC CGGCGCGGTA CTGGATACCG AGGCCGGCAC CCGGGCCCTG
CTGTACCGGG ATGTCACCAC CGACGGCGAC GTGGTGCTGA TCGCCGACTC CGAGACCACC
GGGCGCGGCG TGGCCGCGAG CACCCGGGCC GAAACCGGGG AGGCGGGTGA CAACGGCAAC
GGAGAGAACA ATACCGGCGA CGATCGCGAC CTCAACACCG TCGACTTTGT CCAGCAGCGG
GGCGAGCGCG GCAACGAGGG GGTCGATGAC GGCCTGCAGG ACGTGGCCGG GGACCCCGGT
GCCGGCCAGT CGGAAGAGGC CGGCTTCGAC GTCAATGTCG GCGCGGCGGT CGCGTTTACC
GGTGCCGACG ACTCGGCCGG GGCGCACATC GGCGAGGCGG TGACCATCAG CGGCCCGGGC
GGGGGTGCCG CGGGTGACGT GGCCGTGCTC GCGCGGCGCG CCGAGAGCGG CTATCGCGGG
CGCGCCCAGA CCGAGGTGGA GACGGCCCCC GAGGATGACG GCGTCGGTGT CGGGGTCGGC
GCCGCACTCA CCCTGACCTT CATGGACAAC AGCGCGGAGG CACTTATCGG CGCCTCGGCG
GTGGTTGACG CCCATCGCGT CGGGGTGGCC GCGGACGTCA TCCTGCCGGA CCACTTTGAG
GGGCCTGCCT GGGACGGTCT CGAGACCCTC CGCGACCTGG CTCAGGACGA CGACGATGAG
CTGACCTACG ACGAGGCGGA GTCCAATCTG CTGGACATGG CCGACGGTTG GCTCACCACC
TATGCCAACG CCGCGGCCGA ATCGGAGGAC GGGGTCTTCG ACGGCGCCGC CGCCGTCAAC
TTCGCCCGCT ACAACGTCAA CGCCGACGCC TGGATCGGTG AGGGCGCCGA GGTGACGGCC
GGCGGCACGG CTGGCGATAC CGGCTGGAGC ACCGAGCTGC GCCCGGCGCA GGACAATGGC
GATCCGGCCC TGAGCCGTCG GTGGAACGAG GCGATCACCG TCAGCGCCGG GGCCGATGTC
CGCACCATCG ACGTCGCCGG CAACGTCGGT GGCCTGCTCA CCGCCGGCGT CGCCGGGGAC
GCCACCGTGG GCATCGGCGC CGGTTTCGCC TGGTTGGAGC GCGGCGGCCA CGTCAGCGCC
GGTATCGCCG ACCACGCGGT GGTGAGCGCC AGCCACGGGC AGGGCACCAT TGGCGTGCAC
GCCGAGCGCT TCGACCAGTC CATCGGCATT GCACCCAGTT CCGGCCACGG GGCCAGCTTC
GCCGGCAACG GTACGGTGGT GGTCAACCGC CTGGACACCG ATACCGTGGC CGCGGTGAGC
CACGCGGCGG CGGTCGACGC CGCGGCACTG ACCGTCAGCG CCGGCCGCGA TCTGCGCTGG
TGGTCGGTGG CCGGTTCCAT CACCCTGGCC GAAAGTGTCG CTGTCGGCGT CGGCATCTCT
GTCAACGAAC TGCGGACCGA TACCCGGGCC GTGATCGGCG ATGTGGCCGG TCTCCGCCCG
GCGATGGCCG GCGGCGGGGA CGTCACCGAG CTGGATGACA CCGAACAGGG GATTACCGCC
GACCAGGTCA CCGTGGCCGC ACGCAGCGAT GGCCTGGTGG GCAGTGCCTC CATCGCCGGC
GGCGCCGCCG GTGAGACCGG CGAGCCGCCG GCCACCGACA ACTGGACCGA CTCCGGGCTG
ACCAATGACG TGATTGGTGT CACCTCCGAT CGCACCGAAG GGGCGGAAGC CCCCGAGGAT
GAGACCATCG CCGAGGCCGT GCAGAGCGGG ACCGGCGAGC CAGCCGGGGC GCTGGATGAG
CAGGAGGCGT CGGCGGACGA CCTGGACGAG GACGGCGACC TTGCCAACCC GGACCCCACC
CAGCCCGACC TGGAGACCTT CCCCGACGAG GGTGCGGACA TCCCGGACGC GGGCGCCGAT
GCCGACAGCG AGGAGACGGA CCTGGGCGTC GCCGTGGCGG GGTCGGCCTC GGTCAACCTC
AGCAGTCTGA ACACCACGGC GCTCATCGAG GACGTGTCGC TGGCGGCGCG CGATGCCGAC
ACCACCGACG TTACCGCCAG CGCGGTGGCC GCCGTTGACA CCATCGCCGT CAGCGGCTCC
GGGGCCCTGG TGCTGGGCGG TAACAGCAGC GACCCTCAGG TGGCCATCGC CGGCACCGTG
GGCGTCAATC TGCTCGGCGA CCACACCCGG GCGCGGCTGG TGGACACCCG GATCGAGCAA
CCCGGCCAGG TGACGGTGGA GGCCCTTCGC GACGGTGAGT TGATCAGCGT CGGCATGGGG
TTGGCGATCA CCGCCAGTGG CTCCACCAAG GCCGCCGCCG TGGCCGGTTC CGTCTCCCTC
AGCGACATCA GTAATGAGAC CGCGGCCACC ATCGAGGGCG GCACCATCGG CGACCCCGGG
AGCGATCCCG AACCCGACGA CGGGAGCGGG GTGCAGGTGC TGGCCTATGA CCGCTCCAGC
ATCGGGACCG GTGGGGGTTC GCTGTTCGGC GGCAAGGGCG GTGGCTTCGG CGCCGCGGTG
ACTCTGGCCC AGGTGCGTAA CACCATCGAT GCGGGCATTC TCGGTACGCG CATCACCGAC
GTGGCCGAGG TGACCGTGGA TGCCCTCAGC GCCACCCGCA TCATCAGTGC CGGCGCGGTG
CTGGGTTACG GCGGAAAAGG GGCGGTGGGC GGCGCGGTGG TGTTGAACCG CATCGGCAAC
ACCACCAAGG CCGCGATCGC CGACCGGGAC GACGGCGACG ACGTGGAACG CGCGGAGATC
ACCGCCAATG AGCGGGTGCG GGTGCGGGCG CGCAGCGCCA GTGACGATGA ACGCGTGGCG
CTGGACGACC GCATCGATGC CGCCGGTGGC GGTCACGTTT ACAACTTCAG TGGTGAGGGC
ACCGCGCTGG CCGAGCCCGA GGCCGAAGGT GATCGCGACT GGGGCGATGA CAGCTACGAA
ACCGGCGGCG ATGAGTTCGC CGGCGAGGGC GGCGACCAGG CGGAGATCGA CAGCGGCTAC
GATGACGCCG CCGAGGATGC CCGCTTCGAC GGTGGCCCGC TGGAGGGTGA TGCCATTGTC
GGTGTGGCCG GCAGCATCAG CGCGAGCGGC AAGGCCAGCA TCGGGCTGGC CTTCAGTGGC
AACCAGATCG ACAACGACTA CATCGCCGAG GTGCGCGGCG CCCGGATCGA CGGCGTGGAT
GGCGAGCTGA GCGTCGATGC CGCCGACCGG TCGCGGGTCA TCGGCCTGGG GGTGGGCGGC
GGCGCCTCGG GCAAGGTGGC CATCGCGGGC TCCGGCGCCG CCAATCTGAT CGGTGGCGAG
GCGCGTGCCA CCATCGGCGG CAGCCGTGCG CAGGCGGACG ACGGCCCGGT CCACCTGGCG
GAGATCGACG CCGGGGCGGT GAATCTCGGC GCCGATCGCG GTAGCCGCAT CGATGCCCTG
GCCGGCAACG TCGCCTTCTC CGGCAAGGCC GGCATCGGTG CCGCCGTGGC CTACAACGCC
ATCGACACCG CGGTGGCGGC CGAGATCAAC CACGCCGATC TCGCGCTGTC CGGGGGCGAC
CTGAGCATCG ACGCCGGCAG CTCTTCCGAC ATCTACGGGG TGGCCGTCTC CGGCGGGGGC
GGTGGCAAGG TGGCGCTCAA CGGCAGTGCC ATCATCAACT TCATCGATCT CGATGTCAGC
GCCGGGCTGG GCAGCAGCCG GGTGCGCGAC ACGGGTGCCG TGCGAATCAC CGCCGGCGAC
CATGGCGCCG GTGGCCAGGC CGCCATCTGG AGCCTGGCGG GAGCCATCAA CGGTGCCGGC
AAGGTCGCGC TGGGTGCGGC GGTGGCCTAC AACGAGATCG AATCCCGCTT TGCCGCGGAC
ATCACCGGCG CCGACATCGA AGCGCTGGGC CCGGTGGATG TGACCGCAGA GGTCAGCGGT
GACATCAACA CCCTGGGCGC CGCCGGCGGA GGCGCCGGCA AGGTGGCCCT GGGGGGCGCC
GCCACCGTGA GCCGGATCGA CAACACCGTC ACCGCAAGCC TGACCGGGAG CCGGCTTTAC
GCCCCGGCGG CGCTGGTCAC CGTGGCGGCG TCGCAGGATA GCCGTATCCG CGCCCTGGGC
GCTGCCATTC AGGGTGGCGG CAAGGTGGGC GGGGGCGCGG CAGTGACCGT TAACCAGATC
GGCTCCGGGG TGACTGCCGA GGTGACCGGC GGTGCGCCCG GTCTGCCGGC CGCCATCGAT
GATCCCGCCG GCGGCGGCGA CGCCGAGGCC CACTACCATC TCGGTCACCT GGTGGTGGAT
GCCCGCTCGG ACAACGAGAT TCAGACCATC GCCGCCGGTG CCGCCGCCGG GGGTGTCGGC
GGCGTGGCCG GTTCGGTCTC CACCAACCTG TTCGGCAACC GGACCAAGGC GCGGATCGCC
GACGGTGCCG ACGTGCTGGC CGAGGGCCAC GTGCTGGTGG ACGCCGCCAG CAGCGACAAC
GTGGCGTTGG TGGCCGGCAG CCTCGGCTTC GGCGGCAAGG CCTTTGGCGC TGCGGGCACC
GTCGCGGTCA ATATCGTGGA GAGCGAGACC CACGCCTGGA TCGGCGGTGA GGACCCCGGC
GACGCCACCG TGGTGGTCGC CCGGGCGCAA CAGCCGGGCA CGGTCAGTGG CCGTGACCAC
CGGTTGCACG CCATCCCCGG GCTGGTGGAC AAGTCCGACG GCGAGAACAC GTTTGATGAC
GAGGACGGTG TCGAGATCTA CGACAGCGAG GTCGACGACG CCGCCGAGGA CCGCGACGGC
ACCGGGGGCG GTAGCACCGG AGGCTTCTCG GTGGAGGACG AGGCGGGCGA TTACCTGCTC
AGCCGCCTGG TCCGGGATGA TGATCGCGGG GTGGACGGCG TCCGGGTGAG TGCCCGCTCC
GATCAGACCA CCAGCGCCGC CCTGGCCACC GCCGGCATCT CCGCCAACGT GGTGAAGGGC
GGGGTGGGGT TGGCCGCCAC CGCGTTGGTC AACCGCATCG CCGGCGAGAC CACCGCCGGT
ATCCGCAACA GCGAGGTCAG CTCGGAGACC GATGTGGCGG TCTCTGCCGG AGGGCATGTG
CATACCCGCG GCCTGGTCAT CGGCGCTGCC CTGGGCAGTG TCGGTGCCTC CGGTGCGGCG
GTGGTGGATG TGGTCACCCG CCGGACCCGC GCCGGGATCG ACGACGCCAC GGTGACCGCC
GGGGGGCGGA TCGACGTGGA CGCCGCCGGT TCTCAGAGCA CCAGCGGCCT GGCCATCGGG
GCCTCCGGCG GCACCTACGC CAGCCTGGCC GGCGGCGGGG TGGTCAGCCG GCTGGGGGCG
CAGACCCTGG CGGAGGTCAC CGGCAGTCGC CTGGAGGCGG GCGACGTGGG CGTGCAGGCG
GACAGCGACA CCGGGGTGAC GATGCTCACC GGCACCGTCT CGGTGGGTGC CGCGGCCGCC
GCCGGCTCCT TCAACGTGGC GGTGGTGGAC GCCGTCACCG TGGCCCGCAT CGCCGACAGC
GATCAGGACC GTTCCCGGCT CTCGGTGGAG GGCGAGGTGG ACGTGGCTGC GGAGTCGTTG
AACCGCTTCG GAACCATTGC CGCCAGCGGT GCCGCCGGCG GCACGGCCAT TGCCGGCACC
CTGGGCCTGA GCCTGCAACA GAGCGTCACC CAGGCCCGGA TCGAGGGCGC GGCCATCGGC
CAGGGGGACG ACGATGTCCC CGCGCAGGTT AACGTCCGCG CCCGCGACCA CCTGGAGGTG
GGGGCCTTTG CCGGTGGTGC CTCCCTGGGC ATGGGGATGG GCCTGGGGGC CGCCGCCAAT
GTCTTCAGCG GCCAGGCCTC CGTCCTCGCC GAGGTCAGAG ACGGCGCGGC GATCGAGGCC
GGCGCGGTGT CGGTCTCCGC CGAGCGCAGT GCCGAGCTGA GCCTGTATAC CGCCACCCTG
GGTGCCGGCC AGACCGGCTT CAGCGCCGGG GTCGGCGTGA TCCTCCTGGG GGTGGGCGCC
GGCGAGGTGA CGCCCACCCG GGGCGAGGAT GATATCGCCG AAGACGACGT GGAGGGCGGT
CTCGAGCGGG AGCTCAATGG CAACGGCGGC GGCACTCTGG ACACCGCCAA TACCTTCGCC
AACTTCAGTT TCGAGGGGGA TCCGGACAAT GACGCCAACA CCGAGGACGA TGTCGGCGGC
AGCCTCAGCG ACGATCAGCG GGACAGCCTG GAGAGCGACC TCGAGTTCAA CCTGACCGAG
TCGGTGATCG ATGGCGATGA CCACCAGACG GTGGCCCGCA TCGATGCCAG CGATCTCCGC
GCCGACCGGG TGGACGTGAC CGCCGATGAC CGGGTGGCCA CCCGCAACTA CGTGGGCTCC
GGTGCGTTGG GGGGTGTCGG CTTCGGCGCC GCGGTGGGCT TCACCCGGGT GGGTAACGGG
GTGGTGGCCG AAATCGGGGG TGACGGCACC CTGGATGTCG GCGAGCTCAC CATCCGAGCC
GGTCACGACG ACCTTGGTGG TGGCACTGCG GCGGAGACCC GCGCCTGGGC CGGGGCCGCA
GGCGGCATCG GCCTGGCCGC CGCCTATGCC GATGCCGATG TGACCACCGG GGTGCGTGCC
ACCCTGGGCC CGGTGAACCT CCAGCGCGAC GCGGCCGACG ACGCCGGCGT CGAGCTCACC
ATCGACGCCT TCGACCACGG CGGGACCTAT GCCGAGACCA TCGGCGTGGC CGCGGGCTTC
GTGGCCGGTG TGGGGGCGGC GATCTCCCGC GCGGCGCGAA CCTCCACGGT AATCGCCGAG
GTGGCCGACG ACACCGCCGT CGGCAGTGAC GACGAGGATT CGGCGGACTT CGATCTCTCC
CTCAGCGCCG TCTCCGACGG TGGCGTGGCG GTCGATGGCA TCGCCCTCGG GGCCGGCGTG
GTGGGCGGCG GCGCCGCCGC CCTGGCGCTG GGCGAGGAGC GCTCGACGGT GCAGGCGCGC
GTGGGCAACG GCGCTGACCT GCGGCTGGGC GACGGCGCGC TGACCCTGGA TGCCCTGGCC
CGGCCGGATG TGACCGCGGA CGGGGTCGGG GTGGCGGCAG CGGTGACCGG GGCCATCGCT
GCGGCGGTTT CCCGCGCCTA CTCTTCGGCC ACCGTTGAAG CGGCCGTCGG TGACTTAGAC
GGCCCCGGTG GCGCCGCGGT GCAGGCGCGC GCGGTGGATG TCCGGGCACG CAGCCTGCCG
GTGGGCAGCT ACGGTGCCGA GGCCGAGGCC ATCGCGGGGA GTATCGCCAA GGGCGTGGCG
GTCTCCGGGT CCTTCGCCTT CGCCACCGAC ACCGCCACCG TAAACGCCGG CCTGGGGCCG
GGGGCGGACC TGACCCTGGG CGGCGACGGG CTGTCGGTGT GGGCCGAGGC CGATCCCGCG
GCCCGGGCCC GGACCCACGC CCGCACCTTC GCCGGCGGGG TCGCCTTCGG TGCCAACCTC
TCGCAGGCCG CGTCCAATGC CGAGGTGGCA GCGGTGGTGG GTGACCAGGC CTCGGTGCGG
CTGGCCGACG GGGTGGACGA GGCCGATTTC ACCATCCGCG CCATCGCCAA TGACGACGGC
AGTCGTACCG CCTACGCCAC CGGCGCGGTG AGCGGTGGCG CGCTGCTGGT GTCCGCCAAC
GGCGGCTATG CCCGGGCCTA TGAGCAGACG GCGGTGGACG CCCGCATCGG CACCGGCGCG
ACCCTGGACG CCGGTGCCGG TGAGGTGCGG CTGGGCGCCG ACGCCACGCC CCACGCCCGG
GCCGTGCTCG CCAGCCAGTC TTACGGTGGC GCGTTGGCCA TCGGGGCCGC CATTACCGAC
GCCCGGGTGG CGGCCGACGT GACCGCTGCC ATCGGTGACG GAACCGCCAT TCTGGGCGGC
GGTGACTTGA CGGTGCACGC CCACGTCGCC CGCCCGGCGG GTGCCGACAG TGCCTTTGCC
CGCTCCGAGG CCAGCAGCGG CGCGCTGGCA TCGGCCAATG CGGCCCTGGC GGAGGCCCGC
AATCACGCCC GGTCGCGGGC GCTGACCGGG GCCGGTGTCA CCCTGCCCGG TGGCGTGGTG
AGCCTGCGGG CGGAGAACCA CAGCCGGCAG CACGGCCACA GTGCGTCGGA ATCCTACGGC
GCCTTCGCGG CCGGGATGAC CGTGACCCGG GTGGAGTCCG ACACCCTGAC CCACGCCATC
CTCGGCACCG ACAACCAGGC CAGCGACAAC GATGATCGGC CGGACGTGGT GGATGTCACC
ACCAACGCCA GCAGCCACGA CCACGGCTTT GGCCAGACTG CCACCGGGGG CGCCATCGAC
GGGGCCGCGG CGGAGGTGCA TACCTTCAGC ACCGCCGATA ACCGGGCCCA GGTGCTGGGC
CACCACAGCG ACCCGCTGCA AGTGGGTGTG CTGAATGTCC GGGCGCTGCA GGACACCCGC
TTCTCCGGCG TGGTGGACAC CAGCCGCGCG TCGGTGGCCG GTGCCAGTGG TGCCCGCCTG
TACCATAACG CCAACCGGGT CACGGGCATC GTCAACGTCG ACGAGGAGCG GGAGGCCTTC
TACGGGGAAA CCAGTGTAGT GGAGGCCGGG ATCGGCGACC ACGCCGATAT CCGCGCCCTC
TACGTGGACG TGATCGCCGA GCAGGCGGTG GAGCGCGTCG CCGACGCCGG TAACGACATC
CGGGTTGCGG GCGGCGGGGT GCTGGGTGCC TCCGCCGGCC GCGGCGCCAC GTATATTGAG
ACGATCACCC GGGCGGGTAT CGGCACCGGG GCCACGGTCA ACGTGGAGCT GGACCTGCTG
GTCCGCGCCA TCGAGGCGAT CACCGCCGAA CAGGTCGCCC GTCTGAACAC CGGCAACGCC
ATCGATATCG CCCGGGCCGA GTCGGTGATG CACACCGATC ACCAGAACGA CGTGATCATC
GGCGGAAACG CCAACATCAC CGCCGGTGGC GATGTGGTGC TCTCCAACGC CACCGAGTTG
GACCTGGTGG CTCATACCCG CGCCCGCACC TCCGGGGTGG CCAGTTTCGC CCGCGGCAAC
AGCGAGGCGG AGGCCGATGT GCGTGAGGGG ATCACCGTGG GCAACGGCGC GCGGCTGGCC
TCCGCCGGCA ACGTGGTCAT GCGCACCGGC CTGGGCGACG GCGGTTTGCC GGCTCGCAAG
GGCGTGGAGG CCGATACCTA CCTGTACAAC AAGGGGGCCC TGCCGCTGGA GAACGACCCT
CGCGTGCATG CGGACCTGGT GGTGGACAGC CGTATCGACG TGGCCGACAA CGCCGTGGTC
GAGGCCGGAC AGGACGCCTA TCTGATCGCG CCCGTGGGTT TCCGGTACCG GGCCCGGGGC
GAGGGCGAGG GCCAGGACAT CTACCGGCGC GCCGCCGAGT CCATCGCCAA CTTCTTCCGC
CGGCTGGTCC GTGCCGATGA GGTCTCGCTG GCGAAGGAGG TGCGTAGCGA GGACCTGGTC
CGGGGTAGCA CCATCACCAT CGACGGCCAG GTGGTGGGCG CCGGCGGCAT GCACCAATAC
CTGTGGGTGC GCCAGGGCGA CGATGACGAC GAACGCATCC TGGCCAGCCC GGGTGTCGGC
TACTTCTTCG AGGAACGCGA CGTGCGCGGT GACCTGGAGG CGCTGGTCGG TGCCTACCGG
GATCTGCTGG GGGAATACGT GGAGCCGGAT CCGGACGATC CGGAGGGCTC CGGTCTGCTG
TTCTGGCGCG CCGCCGCCCA GATCGAGACC CTCGCCGGCC GCCTGGCTGC CCTGGGGGAT
GAGGATGACG GGCCCTATGA CGGCGGCCGG GCGGTGCCCT TCATCGAGGT GGAGGACATC
ACCGCCACCG GTGGTGATGT GGTCGTCCGG GCGGATGCGG TGGAGGGTGG TGGCGAGGTC
CGGTTGGGCC CGGACCTGGA GGCCATCTTC AGCGACTTCG CCGGTGGGGA TCTCAACCAG
GTGCCGGCCG AGTTCCTGGA GCCGGGCGGG CCGAGCATTC GCATCCTGGT GGAGTCCAAC
GAGTGGCTGC AGGTCAACGA CCTGATCATC GACGACCGCC GTGACGGCAG CATCACCATC
GCCCAGGAGA GCGGCGCCTT CGGCGGTATC GCCGTGCCGA CCCGGGTCTC CGGGGTGGGT
GAACTGCCAA GCGGGCTTCG CTACGACACC GGCGGGGTGG AGGCCACCCC GGAGATCCGC
GTGGTGCACC AGTTCGAGGT CCCGCCCCAG GAGGACCACC TCTACACCAA CCCGGAGATC
CGCCTGGGCC ACGAGCCGGA CCCCGCCGAC CAACTGGAAG GCGAGTCGGT GACCGCCCGC
ATCATCAACC CGCGCGGTCT GGTGCAGGTG GGCAACCAGG CCGGCAGCAT TATCTCCTCG
GCCAGCATCG AAGCCGCCGC GGTGGAGATC GTCGCCGGTG GTAATTTCGT GCAGAACTGG
ATTCCCGGGT TCTTCCACGC CGGCGGGCAG CCCGAGGCCG CCGGCGAGGC CAGCCGCATC
CTGGCCGGGG GCGACATCTA CATCAGCGCC GAGCACATCA ATGTCAATGG CCTGATCCAG
AGCGGGTTCG ACGAGTGGAC GCTGGATTTC TCCTCCGACC TGCTGGGGAA TGCCGGGACC
ACCCTCGATT TCGACGCCGA CAAGGGCCAC AGCCTGGGTG ATTACCTGGA GTGGTACGGC
AAGGGTTATC GTGCCACCAA GGAGGTGGAG GAGGCCCCGA TACCCTTCTG GGCGCAGTTC
TTGCCGCCCT CCATTTGGCA CCTGCTGGGC TATGGCTTCG CCCCCGACGT CGACGCGGCG
GATGTGCCGG ACGATTTCCG CACCGCCTAC AACGCCGGCC GTAGCGGTGA GCAGTTCCTG
CAGCTGACCG GCGGCGAGGG CTTCGACGTG GAACTCGGCA CCGTGGCCAG CTTCTACGAT
GCCGAGGCCG ACCGCATCGA CATCCAGCGC GCCGCCGTGG GCGGTGGCTA CATCTTCCTG
GCGGGCGACG TGGTGAGCAC CGGCGGAGGC GAGATCCGGG CCTTGTCGGG TTTCGGTGAG
ATCCGGGTGG ATACCACCGG CGGCGGGGAG GACGCCTTCC GTACTCACCA GGATGTGGTG
CTCTATGACC TGGACGCCGG GACCCAACCC CATGCGCCCA GTTTCGACCT GCCCGAGGAG
GAGCTGATTT CTGCGGAGGG CCTAGCGCCC TGCCGGCCCT GCGTGAGGAT CTGGGATACC
GCCAAGCGTG ACGGCGAGGG CAACCCCATC GAGACCGTCT TCCAGCACGA CTGGAACCCG
GACCCAGCGG CCCCGGACGG GAGCTTCGGC ACCATCGCCA AGACCGTCTC CCGGCTCGAC
CGCGAGCAGC GTGACGATGA CTTCTTCCTG CTGCGCACCG TGGTCGACGA GGATGACCCC
TTCTATGGCC AGGAGCAGGA CGCAGGCTTC GGCCCCCGTC CGGGGGCCAC CGCGCAGTAC
CTGCCCCGCG AAGGGATGCG CTACCTCTAC GGCGACAGCG TGGATGAAGA CTCCGGGCGG
CTGCTGTTCC CGGACGACGA TGTGCCGTCA GGGCCGGGCT ATCAGTCCAT GCGGGCGGAC
CGGGCTATCG ACGTCGGGTT CATCGGCAAC CCGGTGGGTA CCATCGACAT CAAGAGCATC
GGTGACGTGA TCATCGACGG CAGTGTGCGC AACCCCCAGG GGCTGACCAC GGTGGAGACC
GTCGGTAGCA TCCGGGAGCC GGAGCAGGGC GGGCTGTTGC AGGGTCGCCA GCTCACTCTG
GAGGCCGGTG GCGCCGTCGG CAGTGAGGGC GCCCCGCTGC GGGTGATCGT GGGTGACGGC
GAGGTCACGC CGGACGATGC CTGGCTGGTG GCCCTGGCCG GGGATGGCGG GGTCTTCCTG
GACGGCCTGC GCGGCGATCT GCCGGTGCGC GAGGTGGGGA CGCTCAGCGG CCATACCGTG
GACATCGACG CGGCCCGTCA CTTGATCGAC CGCGGCACCG CCCCCGACCG GCCCACGGTC
TACGGCGCCC GGGTGCATCT GCGTGCCCGG GACGGCTCTA TCGGTGGCAG GGAGCAGGGC
GAATTCGTGC TGCCGCTGAC CATCGCCTCC GGGGTCGGGG TTGTGGTGGA CGATCCGGAC
GCGCTGCCGG ATTGGCTGAA CCTTCCGCTG GGCAGCGCCA ACGTAACCGC CCGCGCCGCC
GGCCATGTCC TGCTCGCGGA AGATGCCCGC GGACTGGCCA GTGCCGGTGT GCCCGTCGAC
GGTGACATTC ACGTGGAGGC GCTCCGCGCC GCCGAAGGCG ATGTCCGGCT CTTTATCGCC
GACGGCAAGC TGATCAGCGC CATCGACGCC GAGCTCGATA CCGACCGCAT GGTCCGCCTG
GGCGAGATGT GGGACTCGCT GGGGATGCTC GACGACGGCT CGCCCAACAG CGGCGTAGCC
CGCTTTGTCG AGGATCAGGT GGAGGCCTTC GAGGCCTCGG TGGAGAGCCG CTACCGTGAC
TGGTGGGAGC TGGACCGCCT GGCCGGGGAG GACGGCGGCT TCGACCTGGC CCCGGATCGG
GCCCGCATTT ACGCCCACCG GGCCGGCGTC GACGGCTCAC CGGAGGCGTG GGACGAGACG
GCGGACCCGC AGGGCGTGGC CGACAACACC GCCGCGGTCA ATGCCTGGCT GGAGGGCCGT
TACGGCGCGC TCAGCGATGA ATTGGGCGAA CTGCTGCTGG AGGGCAACGG CGACCAGTCC
CTGGCGGACC TGGGTACGGT GGTCGAGGAA CCGCTGGCCG AGGATCCGGC CACCGGGGAG
ATGCTGTTCT ATGAACGGGT CGAGTTCGAT TACCAGGCGC CGGACAGTGT CCGTGACCAG
CTCGCGGAAG GGGCGGAGTG GACCGAGGAG CAGTTGCTCA ACGTCCTCAG CGAGGACCGG
ATGCTGGAGA CTGTGGACAC CCAGTACCAG CGGGTGGAGA CCATTATCCA GGGGGAGAAT
GTGGCCATCG AGGCGCCGCG GGGTTCCATC GGCTCGTCTG ACGACCCGCT CTGGATCCCG
GCCCCCGAGG ACGAGGACAG CAACTTTGAT TTCACCGAGG AGGAACGCGC CGCCCTGGTC
AGCGCCCAGC CGGGCGATGT CCGGCTGATC GGCGAGGTGG TGTTGGTGGA TGGCCAGGTC
ATGGACCTGG GCGATTTCCT GCGCCTGCTC GGCACCGGGG GCCCCGGCGG GCTCACCCTG
GAGGATCTCG ACACGGTCGA TTTCGACGGC ATCGAGATCG AGCAGGTACG CTACGCCGGG
GTCATCGCCA CGGAGGGGGG GCAGTCGCAC CATGCCGCCG ATGACCTTGG CGGCGACATC
TTCCTGGGTG TCCCCGGCAC CAGCATCGCC CTGGAGCGGC TCTGGGGTGG TGGCGACATC
CGCGTGCGGG TCGGCGGCGG TATCGAAGCC CACGACGACC CGACACGGGG CGATGATCCG
GAAGATGCCA TCATCCGCGG CGGTGCGGCG GGGGTGCTGC TGGAGGCGGC CGGCGGCGGC
ATCGGCGCCG AGGGGTTGCC GCTCACCACC GCGTTGGACC CCCGCGCCCT GATCACCGCC
CGGGCCGTGG GCTCGGTGCA CCTGCACGCC CTGGGCGGTA ACCTGACCCT GGACAGTGTC
TTCAGCGAGC AGGGCGATGC CGTGCTGGCT GCCGACCAGG GGCATATCGC CCGTTTGGCG
CTGCCGGGCG CGCCGCTGGC GGACTTCGGC ACCCTGGACA TCGCCGCCGA TGAGATCCGG
CTGGATGCCG GCACCTTTGT CGGCGCGGTG GATCGCGACG CCTTCCTGGC CCTGGACAGC
ACGGCTGAGC CGGGGGACCT GGAGGCGGTC ACCGACGCCG CCATGCGCCT GCGCCACGGT
GACGACGCCC GGGTGACCGG TTCTGCCGGG GACCGCTTCC AGGTGTTCGC GCCGGAAGGG
GGGCTGGTGG TCCGCGATGT GGAAGCCGGC GGCCGGGTCG TGGCCGGCGC CTGGGAGGAT
GTCACCCTGG AGCGGGTGAC CAGCACCGCT GGCGACGGCG AGGCGGCCTT CGTCTTCAGC
GAGACGGGCT TCATCATCGG CAGCAATGAG GCCGGCCTGG AGGAACCGCT ACACCTGCGG
GCACTGGGCG GGGACGCCAC CACGCGGTTG ATCGCCCCGC TGGGCATCGG TCTGCCCGAT
GACTTCCTGT TCCTCGATGC CACCCGGATC AGCTCCGCCA CCACCCTGGG CGGTCACGCC
TTCCTCCACG GCACCACCGA TTTGCGAGCC GATCTGATAG ACGTGCCCAC CGGGCGGTTG
GAGGTGCGCG CGCCCCAGGC CATCGAAATC GATCGGTTGC GGGTTCACGA CCGGGTGGAC
CTCAAGGGCG ACGCCATCGA GGCGCACATC GAGCACACCG CCAACCCGGA CCCGCTGCCG
CTGGACGTGG TGGGCCTGGT CGAGCACTAC GCATCGGAGG TGGACCTGTC CGTGGACACC
CCGGCCGATC TCATCGTCGA GCGGCTCTAT GCCCGCCGCG CCCGATTGGA GACCAACACC
ACGCGAGTCG ACATGCCGGA CAACGACATC TTCCACTGGT TGGAGGTGTT CACGCCGGAG
GTCCACCTGT GGGCCGACAA CCAGCGGCCC GACCGGCGCG ACGTGGACGT CCAGCTCTAT
GAGCCGGGGT ACCGCTTCTT TGTCGACCAG GACGGGCGGC ACACCATCAC CAACGCCTTC
AGCGGCGCCT ACCGGCCCGG CTACCGCCTG GAGCTGGTCA ACTATCAACC GGGCCGCGAC
CGCGCCCGTT TCGATGTGGA TGGTCGCAGC ATTGTGCGCG ATGCCGGTCG GCTGGAGCAG
CGCATCGTGC CGATCCCCGA CACCCTGCCG GGGCTGCGGG CCTTCTGGCC GGACGACGAG
GTCTCGGTGC AGGTGGTCGG TGCCGGCCTG CCGGGGGCGC CGCCCGACTT CCCGGTTAAC
CTGGACTGGA CCGGTCTGTT GCAGGGCGAT GACGATCAAT CCGGGGCAGA GGAGAATGGC
GCTCAATGA
 
Protein sequence
MKNGTQGTRR GRRRHKPVPD HPVEGPGWRR DPLALAISAA LACGGAFPSA SAAGSSITVK 
DNKTETDISD APGVDGGTRY TITTESLTDS GRTGLNAFAE FILASGDRAD LVLPDGTLNL
INLVYDSRAK IHGELFSQLD GEFGEGHLLF ATPHGMLVGA EGAINAGALT AIAPRSSQLD
RILDGDVGLG ELMRGEVELD PEATIEIQGE IDADHVRLIG HQVLVQSGAR IEIADPIDDH
EAVFGSAVNI DGLESGAGIA VDAGDLVIAG ERRAEVYGDL LAQDGGVVVR AHNVGRSDLG
VTRADAEVAI GGRIEAEDIT LSARVDAEAE LDLVEALKGR VEALVPEGLE EVAESAIDSG
LEAVDDELDN QDLPDFGLGL GLVDASAIVT LEDGAELDAG GNVNIHAEAQ RTARAEAHAD
GDGLGGAFAL ASISGQTAVT VEEGASIVAG GDIDLRAASH NELVAEAVTE VGDGFDVGLG
GSVALGFLQS EPEDGGTTVQ VQEGATLAAG GDLGLSAFTE HDVTVDATTT LGSGGEAAFG
ATVAYLGLDW TTRTLLTGGA VADDTVEPPV GIVAGGDLRM VARTDQAIHT GASVETTGDI
SVGLAAAVAE LNNTTRVVLD GTVRAGGDAT LLAESITRAQ HTVATTRAAE DADNNGGNGN
DGPDEAADDQ GTSDYVAGLV GDVDTGEPEE TPGAGGNGGL ALDIGAAVAY TDARDSASTT
LGGNTQLLPP AGPLGPGGQV AIVSRRQLDN LSTRARSGTE ATGPDGDDGA NSINAAVSYA
RLDQIARTEI ADGARIEAGR IGVGAEVDLP ARVAWGDEDN EDGPGGLDDI AGADSDDLDD
WLTSEASTSS ETGGTVNIAG AVNLLDLTAT AEALVGTGDD PTDVTTLHAV GDGGDWETEL
GGDTTWHWDD AVAVAARSRV ETVDVAGQAA AAGTVGVGGG VNWHLRESTA RARLAGSADI
SADTGGVSVS ADRRDRAIVL TPIAGESDGG AVGLGASVAY ARIDGTSQAQ LDEGAMVAAG
GALAVTAGGD LHAEILTASK TSGDATAGIA GTVAFADISL DTTAAVHGQP VDNALEVGDL
YLHALDLQDY QVQASAEAQG DVAIGLAGAV LDTEAGTRAL LYRDVTTDGD VVLIADSETT
GRGVAASTRA ETGEAGDNGN GENNTGDDRD LNTVDFVQQR GERGNEGVDD GLQDVAGDPG
AGQSEEAGFD VNVGAAVAFT GADDSAGAHI GEAVTISGPG GGAAGDVAVL ARRAESGYRG
RAQTEVETAP EDDGVGVGVG AALTLTFMDN SAEALIGASA VVDAHRVGVA ADVILPDHFE
GPAWDGLETL RDLAQDDDDE LTYDEAESNL LDMADGWLTT YANAAAESED GVFDGAAAVN
FARYNVNADA WIGEGAEVTA GGTAGDTGWS TELRPAQDNG DPALSRRWNE AITVSAGADV
RTIDVAGNVG GLLTAGVAGD ATVGIGAGFA WLERGGHVSA GIADHAVVSA SHGQGTIGVH
AERFDQSIGI APSSGHGASF AGNGTVVVNR LDTDTVAAVS HAAAVDAAAL TVSAGRDLRW
WSVAGSITLA ESVAVGVGIS VNELRTDTRA VIGDVAGLRP AMAGGGDVTE LDDTEQGITA
DQVTVAARSD GLVGSASIAG GAAGETGEPP ATDNWTDSGL TNDVIGVTSD RTEGAEAPED
ETIAEAVQSG TGEPAGALDE QEASADDLDE DGDLANPDPT QPDLETFPDE GADIPDAGAD
ADSEETDLGV AVAGSASVNL SSLNTTALIE DVSLAARDAD TTDVTASAVA AVDTIAVSGS
GALVLGGNSS DPQVAIAGTV GVNLLGDHTR ARLVDTRIEQ PGQVTVEALR DGELISVGMG
LAITASGSTK AAAVAGSVSL SDISNETAAT IEGGTIGDPG SDPEPDDGSG VQVLAYDRSS
IGTGGGSLFG GKGGGFGAAV TLAQVRNTID AGILGTRITD VAEVTVDALS ATRIISAGAV
LGYGGKGAVG GAVVLNRIGN TTKAAIADRD DGDDVERAEI TANERVRVRA RSASDDERVA
LDDRIDAAGG GHVYNFSGEG TALAEPEAEG DRDWGDDSYE TGGDEFAGEG GDQAEIDSGY
DDAAEDARFD GGPLEGDAIV GVAGSISASG KASIGLAFSG NQIDNDYIAE VRGARIDGVD
GELSVDAADR SRVIGLGVGG GASGKVAIAG SGAANLIGGE ARATIGGSRA QADDGPVHLA
EIDAGAVNLG ADRGSRIDAL AGNVAFSGKA GIGAAVAYNA IDTAVAAEIN HADLALSGGD
LSIDAGSSSD IYGVAVSGGG GGKVALNGSA IINFIDLDVS AGLGSSRVRD TGAVRITAGD
HGAGGQAAIW SLAGAINGAG KVALGAAVAY NEIESRFAAD ITGADIEALG PVDVTAEVSG
DINTLGAAGG GAGKVALGGA ATVSRIDNTV TASLTGSRLY APAALVTVAA SQDSRIRALG
AAIQGGGKVG GGAAVTVNQI GSGVTAEVTG GAPGLPAAID DPAGGGDAEA HYHLGHLVVD
ARSDNEIQTI AAGAAAGGVG GVAGSVSTNL FGNRTKARIA DGADVLAEGH VLVDAASSDN
VALVAGSLGF GGKAFGAAGT VAVNIVESET HAWIGGEDPG DATVVVARAQ QPGTVSGRDH
RLHAIPGLVD KSDGENTFDD EDGVEIYDSE VDDAAEDRDG TGGGSTGGFS VEDEAGDYLL
SRLVRDDDRG VDGVRVSARS DQTTSAALAT AGISANVVKG GVGLAATALV NRIAGETTAG
IRNSEVSSET DVAVSAGGHV HTRGLVIGAA LGSVGASGAA VVDVVTRRTR AGIDDATVTA
GGRIDVDAAG SQSTSGLAIG ASGGTYASLA GGGVVSRLGA QTLAEVTGSR LEAGDVGVQA
DSDTGVTMLT GTVSVGAAAA AGSFNVAVVD AVTVARIADS DQDRSRLSVE GEVDVAAESL
NRFGTIAASG AAGGTAIAGT LGLSLQQSVT QARIEGAAIG QGDDDVPAQV NVRARDHLEV
GAFAGGASLG MGMGLGAAAN VFSGQASVLA EVRDGAAIEA GAVSVSAERS AELSLYTATL
GAGQTGFSAG VGVILLGVGA GEVTPTRGED DIAEDDVEGG LERELNGNGG GTLDTANTFA
NFSFEGDPDN DANTEDDVGG SLSDDQRDSL ESDLEFNLTE SVIDGDDHQT VARIDASDLR
ADRVDVTADD RVATRNYVGS GALGGVGFGA AVGFTRVGNG VVAEIGGDGT LDVGELTIRA
GHDDLGGGTA AETRAWAGAA GGIGLAAAYA DADVTTGVRA TLGPVNLQRD AADDAGVELT
IDAFDHGGTY AETIGVAAGF VAGVGAAISR AARTSTVIAE VADDTAVGSD DEDSADFDLS
LSAVSDGGVA VDGIALGAGV VGGGAAALAL GEERSTVQAR VGNGADLRLG DGALTLDALA
RPDVTADGVG VAAAVTGAIA AAVSRAYSSA TVEAAVGDLD GPGGAAVQAR AVDVRARSLP
VGSYGAEAEA IAGSIAKGVA VSGSFAFATD TATVNAGLGP GADLTLGGDG LSVWAEADPA
ARARTHARTF AGGVAFGANL SQAASNAEVA AVVGDQASVR LADGVDEADF TIRAIANDDG
SRTAYATGAV SGGALLVSAN GGYARAYEQT AVDARIGTGA TLDAGAGEVR LGADATPHAR
AVLASQSYGG ALAIGAAITD ARVAADVTAA IGDGTAILGG GDLTVHAHVA RPAGADSAFA
RSEASSGALA SANAALAEAR NHARSRALTG AGVTLPGGVV SLRAENHSRQ HGHSASESYG
AFAAGMTVTR VESDTLTHAI LGTDNQASDN DDRPDVVDVT TNASSHDHGF GQTATGGAID
GAAAEVHTFS TADNRAQVLG HHSDPLQVGV LNVRALQDTR FSGVVDTSRA SVAGASGARL
YHNANRVTGI VNVDEEREAF YGETSVVEAG IGDHADIRAL YVDVIAEQAV ERVADAGNDI
RVAGGGVLGA SAGRGATYIE TITRAGIGTG ATVNVELDLL VRAIEAITAE QVARLNTGNA
IDIARAESVM HTDHQNDVII GGNANITAGG DVVLSNATEL DLVAHTRART SGVASFARGN
SEAEADVREG ITVGNGARLA SAGNVVMRTG LGDGGLPARK GVEADTYLYN KGALPLENDP
RVHADLVVDS RIDVADNAVV EAGQDAYLIA PVGFRYRARG EGEGQDIYRR AAESIANFFR
RLVRADEVSL AKEVRSEDLV RGSTITIDGQ VVGAGGMHQY LWVRQGDDDD ERILASPGVG
YFFEERDVRG DLEALVGAYR DLLGEYVEPD PDDPEGSGLL FWRAAAQIET LAGRLAALGD
EDDGPYDGGR AVPFIEVEDI TATGGDVVVR ADAVEGGGEV RLGPDLEAIF SDFAGGDLNQ
VPAEFLEPGG PSIRILVESN EWLQVNDLII DDRRDGSITI AQESGAFGGI AVPTRVSGVG
ELPSGLRYDT GGVEATPEIR VVHQFEVPPQ EDHLYTNPEI RLGHEPDPAD QLEGESVTAR
IINPRGLVQV GNQAGSIISS ASIEAAAVEI VAGGNFVQNW IPGFFHAGGQ PEAAGEASRI
LAGGDIYISA EHINVNGLIQ SGFDEWTLDF SSDLLGNAGT TLDFDADKGH SLGDYLEWYG
KGYRATKEVE EAPIPFWAQF LPPSIWHLLG YGFAPDVDAA DVPDDFRTAY NAGRSGEQFL
QLTGGEGFDV ELGTVASFYD AEADRIDIQR AAVGGGYIFL AGDVVSTGGG EIRALSGFGE
IRVDTTGGGE DAFRTHQDVV LYDLDAGTQP HAPSFDLPEE ELISAEGLAP CRPCVRIWDT
AKRDGEGNPI ETVFQHDWNP DPAAPDGSFG TIAKTVSRLD REQRDDDFFL LRTVVDEDDP
FYGQEQDAGF GPRPGATAQY LPREGMRYLY GDSVDEDSGR LLFPDDDVPS GPGYQSMRAD
RAIDVGFIGN PVGTIDIKSI GDVIIDGSVR NPQGLTTVET VGSIREPEQG GLLQGRQLTL
EAGGAVGSEG APLRVIVGDG EVTPDDAWLV ALAGDGGVFL DGLRGDLPVR EVGTLSGHTV
DIDAARHLID RGTAPDRPTV YGARVHLRAR DGSIGGREQG EFVLPLTIAS GVGVVVDDPD
ALPDWLNLPL GSANVTARAA GHVLLAEDAR GLASAGVPVD GDIHVEALRA AEGDVRLFIA
DGKLISAIDA ELDTDRMVRL GEMWDSLGML DDGSPNSGVA RFVEDQVEAF EASVESRYRD
WWELDRLAGE DGGFDLAPDR ARIYAHRAGV DGSPEAWDET ADPQGVADNT AAVNAWLEGR
YGALSDELGE LLLEGNGDQS LADLGTVVEE PLAEDPATGE MLFYERVEFD YQAPDSVRDQ
LAEGAEWTEE QLLNVLSEDR MLETVDTQYQ RVETIIQGEN VAIEAPRGSI GSSDDPLWIP
APEDEDSNFD FTEEERAALV SAQPGDVRLI GEVVLVDGQV MDLGDFLRLL GTGGPGGLTL
EDLDTVDFDG IEIEQVRYAG VIATEGGQSH HAADDLGGDI FLGVPGTSIA LERLWGGGDI
RVRVGGGIEA HDDPTRGDDP EDAIIRGGAA GVLLEAAGGG IGAEGLPLTT ALDPRALITA
RAVGSVHLHA LGGNLTLDSV FSEQGDAVLA ADQGHIARLA LPGAPLADFG TLDIAADEIR
LDAGTFVGAV DRDAFLALDS TAEPGDLEAV TDAAMRLRHG DDARVTGSAG DRFQVFAPEG
GLVVRDVEAG GRVVAGAWED VTLERVTSTA GDGEAAFVFS ETGFIIGSNE AGLEEPLHLR
ALGGDATTRL IAPLGIGLPD DFLFLDATRI SSATTLGGHA FLHGTTDLRA DLIDVPTGRL
EVRAPQAIEI DRLRVHDRVD LKGDAIEAHI EHTANPDPLP LDVVGLVEHY ASEVDLSVDT
PADLIVERLY ARRARLETNT TRVDMPDNDI FHWLEVFTPE VHLWADNQRP DRRDVDVQLY
EPGYRFFVDQ DGRHTITNAF SGAYRPGYRL ELVNYQPGRD RARFDVDGRS IVRDAGRLEQ
RIVPIPDTLP GLRAFWPDDE VSVQVVGAGL PGAPPDFPVN LDWTGLLQGD DDQSGAEENG
AQ