Gene Haur_1882 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1882 
Symbol 
ID5733771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2253390 
End bp2270180 
Gene Length16791 bp 
Protein Length5596 aa 
Translation table11 
GC content56% 
IMG OID641279026 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001544653 
Protein GI159898406 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAGC ATTCAGATGA TTCGTTGCGC AACCAGACTG ATCCTGCATC TCGGCGAGCT 
CCGCTTTCGG CGGCCAAAAA AGCGTGGCTC GAAAAACGCC TGCGCGGTGA TGGCACCACG
CCGCCGCCGC AATCGAGTAT TCCTCGCCTC GCGACCTATG ATCAGGTGCC ATTATCGTTT
GCGCAAGAAC GGCTGTGGTT TCTCAGCCAA TACGAACCAA CGAGCAGCAA CGCCTATATT
ATTCCCTTGG CCGTGCGGAT CGATGGCCCC ATTGATCCAT CGCTGTTGGA GCAGGCTTTG
CAATTGGTGG TGGATCGCCA TGCCAGCTTC CGCACCACGT TTCATGCGCA GAATGGGGTG
CCGTTTCAAC GGGTTGCCCC ACAGCTGCCG CTCAGGTTGC CCGTGCTTGG GCTGGACGTG
GCTGATGCGA GCGATGAGGC GGCGGTGTTG CAGGTCGTGC TGGATCAGCT TGTACCGCTG
TTGCAGCTAC CGTTTGATCT TGAGCATGGG CCGTTGTTGC GAGCGACCTT ACTGCGTTTA
GCTGCCGAAT CGCATGTGCT GCTGCTGATT TGCCATCATA TTATTAGTGA TGGCTGGTCG
ATGGGGGTGC TGCTGCGCGA TTTTGCCAGC TTTTTAGGCG CGTTACGCAC CAATACTGCG
CCGGATGTAC CGCCGTTGGT GGTGCAAGCC CCGGATGTCG CGGTGTGGCA ACGGCAACGC
TTGCAAGGCC ACTATCTGAC GACACTCCAA GATTTTTGGA AGCAGCAGTT GGCGGATCTC
GAACCATTGA ATGTGCCTAC CGATTTTGTC CGACCGGCCC AGCAATCCTA TCGTGGGGCG
ACATTGAGTT TTCAGCTCCC CGCTGCGCTC AGCACCCAAC TTCAGCGTAT GGCGCAACAG
CATGATGTGA CCCCATTTAT GCTGCTGCTT GCCGCATTTC AGGCCTTTTT GGCGCGACTG
AGTGGGCAAC AGGATCTGGC GATTGGTTCC GTGCTAGCGA GTCGGGCCGA TGCTGACCTT
GATCCGGTGA TTGGCTTTTT GGTGAATACG TGGACGTTGC GGAACAACAT AGATGTAGCG
CAGCCATTGG CGCAGCTCTT GCCCACGGTG CGACGTACCG TCCTGGCAGC GTTTGAGCAC
CGCGATTTAC CGTTTGAGCA GGTGGTGCAA CTGGTGCAAC CGGAGCGTGA TCTGAGCCGT
TCGCCCGTGT TTCAGGTGAT GATGACCTAT CAAAACGTGC CGCAACGGCA GATGGAATGG
GGGGATGTTC GGCTGACCCC GATTAGCCTG CCCAGCACGG TGGCGAAATT CGACCTAACT
CTGGCGCTGA GCGAAAGCCC CGAGGGTTTT CGTGGGGTGA TGGAATATCG GAGCGATTTA
TTTCGGCGCA GCACGATTGC CACGATGGTT GCGCGTTGGG AAATGTTTTT ACACGCGATT
GTGGCTGAGG GTTCCACACC ACTTGCCCGA TTGCCGTTGG TCTTGCCTGC GGAACGCAGC
TTATTGCTTG ATACGCTGAA TGCGACCACA ACCGCCTACC CACACGATCA AAGCGTAGCA
AGTTTGTTTG CCGAACAAGC CCGCCTGTGG CCGGAGCGGA TTGCGCTTCG TTTTGGTGAG
CACAGCCTCA GCTATCACGC GCTTGAGCAA CGGGCCAACC AGCTAGCGCA CCATCTGCAA
CTGCTGGGTG TTGGGCCAGA GCATGTGGTT GGTTTGTGTG TTGAGCGCTC GTTGGACTTA
GTGGTGGCGA TTCTGGCGAT TCTCAAGGCT GGCGCAGCCT ATGCCCCGGT CGATCCGAGC
TATCCCGTTG AGCGTTTGGC CTGGATGCTG AGTGATTTAC AGCCAACGGT GGTGATTGCA
CAGCACGGCG TGCTCGACCG CTTACCGTCG GTTGCGTGTT CCGTGGTTGT GCTTGAAACC
ATAGCCGCGC ACCTCGCAGC GTATCCCACG ACTGCGCCAA CCGTGGACAT CAGCCCCGAA
AATTTGGCCT ATGTGATGTA TACCTCTGGT TCAACAGGCC GACCCAAAGG GATTATGATC
AATCAGCGGA ACATTGTGCG ATTGGTCCGC AACACCACGT ATGCGGCATT TGGGCCAGAC
CAGGTTGGGT TATTGCTGGC AACAGTGGCA TTTGATGCTT CGACGTTCGA ACTTTGGGGG
TGTTTGCTGA ATGGTGGACG CTTAGTGATC GCCCCACCGC AGCAACTCAG CCTTGCCGAA
TTGGGCCACT TGGTGGAGCG CGAACAGATT ACGACGCTCT GGTTGACCGC CGGATTGTTC
CATCAAATGG TGGATCATGC GCTGGATCGA TTGGGTTCGT TGCGTCAATT ACTGGCCGGT
GGCGATCGAC TGTCGCCCGT GCATGTACAC AAAGTGCTGG AACGCTGGCC GCAGTGTCGC
CTGATTAATG GGTATGGCCC AACGGAAAAC ACCACATTTA GCTGTTGTCA GCAGCTTAGT
GCAACCACTG ACCTGGCGCA GGGCGTGCCG ATTGGGCAGC CGATTGCGAA CAGCACGGCC
TATATTCTTG ACCGGTTGTT GCAACTGGTT CCCATAGGGG TTGTAGGCGA ACTGTATTTG
GGTGGCGCAG GCTTAGCGCG AGGGTATTTA GCGCGTCCAG ACCAGACGGC GGCGGCATTT
ATCCCGAACC CCATGAGCCA AACGGCGGGC GAACGCCTGT ATCGCTCGGG GGATCTGGCG
CGGTATCGCG ATGATGGGAC GATCGAATTT ATTGGACGAC GGGATCAGCA AGTCAAGGTA
CGCGGGTATC GGATTGAGCT GGAAGAAATC GTTGGCGTGT TGCTGGCACA ACCACAGGTG
GATGATGCGG TGGTGGTGGT GCGGGAGGAT CGGGTTGGTG ATCAGCGCTT GGTGGCCTAT
CTGGTGGGTG ACAATCCGGC GATTGAGCTG ATTGAACAAG CGGTGCAAGG CCAGGTCCCG
AGCTATATGC TCCCGAGTGC CTATGTTGTG CTTGATGCCT TGCCGTTGAC GGCGAATGGC
AAGGTTGATC GGCGGCGGTT GCCAGCGCCG AGCTATGCCG CCATCGCGAA CGATGATCCG
CCACAAACCG ATTTAGAGCA GGCGATAGCG GCGATTTGGG CCGAGGTCTT GGCGGTGCCG
AGCATTCAAC GCCAGACCAA CTTTTTCCAA GTAGGTGGGC ATTCGTTGTT GGCTACCCAA
GTTGTGGCGC GGGTGCGTGA ACTGCTGCCA ACGGTTGATC TGCCTGTGCG CCGCCTTTTT
GAGCTACCGA CCTTGGCAGC GTTTGTGGCT AGTCTCGACC CAACACCGCA TAAAGATGCT
GCTGTGGTGC TACCTGCTCT GGTGGCACAA CCACGACCGG AACATCTTCC ATTATCGTTT
GCCCAAGAAC GTTTGTGGTT TTTGAACCAA CTTGATGCGA CGAGCAATCA AGCCTATACG
ATTCCGCTGG CAGTGCGCAT TGTTGGCGAC CTGCCGCCAG CGGTGATCCA GATGGCCTTA
CAACAGCTGG TAGATCGGCA TTCCAGTTTG CGCACAACCT TTTATGCACT TGATGGTCAG
CCCTATCAAC GGATTGCGCC GACCCTCATG CTAGCGTTGT CGTTACACGA TCTGCGTGGT
CGGAGCGAAG CCGACGTGCA TGCGGCGATT CACGCGGCAA TTGCCGAACC ATTTGATTTG
GAGCGCGGGC CGCTATTGCG GGTAACGTTG CTGCGCGAAG CTGATGATCG CCATGTGTTG
GTGCTGACTT TGCACCATAG CATCACTGAT GGCTGGTCGA TGGAGCTATT GTTACGTGAA
TTTGCCCAAG CCTTGGCCGC TGCTCGCCTT GGTGAACCGA CCCGTTTTGC CCCGCTGGTG
GTCGATGTGG TCGATGTCGT GCTGTGGCAA CGAGCGGTAT TGCGTGGCGA GCGCTTGGCC
CAATTGCAGG GGTATTGGCA GACCCAGTTG CATGGGGTTG CTCCCTTGGC TTTGCCGACA
GATTATCCAC GACCTGCGGT GCAGCAGTTT GCCGGGGCAA CGCTGCGAGT GACAATCCCA
ACAGTGGTGC TTGAACAGTT GCGCGTCCTT GGGCAGACCC ATGGGGCGAC ATTGTTTATG
GTGCTGATGG CGGCCTATCA GGCATGGTTG GCTCGCCTGA GTGGGCAGCG CGATATTGCG
GTTGGTACGC CAATTGCCGG ACGTACCCAA GCTGAAATGG AAGGATTGAT TGGCTTTTTT
GCCAATACGT TGGTTGTGCG GAGTGATGTG CGGCGGGATC TTGGGTTTAC GGCGCTGCTG
GAACAGGTAC GGGCAACACT CTTGACTGGT TATAGCCAGC AAGAATTGCC ATTTGAGCAA
ATGGTGCAGC TGGTCCAGCC AGAGCGCGAT CTGAGTCGCT CGCCGTTATT TCAAGCGATG
TTGACCCTGC ATCATGCCCA ACCGACCAGC ATGGATTGGG GCGAGATTGC GCTCGAACCA
ATTGACTTGG CGGGAACGGT AGCGAAATTT GATCTGTCGT TGGGCTTGGT GGAGACGACG
AGCGGCTTAG TCGCAACATT TGAATATAGC ACAGCCTTAT TTACCCAGGC CACGATCGAG
CGCTGGTCGG GGCACTGGCT GACGTTCTTA ACTGTAATTG CAGCAAATCC AACCCAATCA
ATCGGCGAAT TGGCCCTCTT GACGACAGCC GAACAAGCAC AGGTATTAGT TGGCTGGAAT
CAAACGCGGA CGATCTATCC GCCAGCACAG GGTTTGCATC ATGTGATTGA AGCTCAAGTT
GCACAACATC CTACGGCGAT TGCGTTGCGC TATGAAGATC GAACGCTTTC GTATGCCGAA
CTGAATGTGC GGGCCAATCA ACTGGCGCAT CGCTTGATCG ATTTAGGGGT GCGACCTGAT
ACGCTGGTGG CGATTTGTGC CGAACGATCG ATCGAAATGG TGATTGGCTT GTTGGGGGTT
TTGAAAGCGG GCGGGGCATA TGTGCCTTTG GATGCTAGCT ATCCGATTGA TCGACTTGAG
TTTATGCTGG CCGATGCACA ACCGCTGGTG CTCTTAACCG CGCTGACTGC TGGGCAAACC
AACTCCGAGC TCCAGCAATT AATTGAATTT CAAGCATGTC CACGGCTTGA TCTGATGCAA
TTGCAGCTGC TTGCTGATCA GCCAACCCAC AATCCTGCGG TAGCCATCGA GGGCCATAAT
TTGGCCTACA TGATCTATAC CTCAGGCTCG ACTGGCAAGC CCAAAGGGGC ATTAAATCAA
CACGATGCGA TTATCAATCG TTTATTGTGG ATGCAAGCGG AGTATCGGTT GACGGCAGCC
GATCGGGTGC TCCAAAAAAC TCCGTTTAGT TTTGATGTAT CGGTGTGGGA ATTTTTCTGG
CCATTGCTGG TTGGGGCACA ATTGGTCTTG GCCCGACCGG AAGGCCACAA AGATCCGGAA
TATTTGACCG AGGTCATTCA GGCCGAGCAG ATTACCACAG TGCATTTTGT GCCATCGATG
CTACGCCTGT TTCTGGAGCA TCGGCAAGCA CCTATGTGTA CCTCGTTGCG ACGGACGATC
TGCTCGGGTG AGGCGTTGCC AGCAGATGTA GCTCAGCAGT TTTTGCAACA GCTTCCGCAA
AGTGGCTTGC ATAATTTGTA TGGGCCAACC GAAGCCGCAG TAGATGTGTC GTATTGGGCG
TGTAGGCCCG ATGCAACGGC AAGTAGTGTG CCGATTGGGC GACCAGTAGC CAATACCCAA
TTGTACCTGT TGGATCAGGC GCTGCAACCT GTACCGATCG GCTGTTTTGG CGAATTATAT
ATTGGTGGGC TGCAAGTAGG TCGCGGTTAT TATAATCGGC CAAACTTGAC GGCAGAGCGC
TTTGTGCCCG ATCCATTTAG CCCAATTGCC GGAGCGCGAC TCTATCGAAC AGGCGATCTT
GCGCGTTGGC GAGCAGATGG CAACATCGAA TATGGTGGGC GGATTGACCA CCAAGTCAAG
GTGCGTGGCT TGCGCATCGA GCTGGGCGAA ATTGAGCAGC AATTGTTGGC GTTGCCCGAC
ATAACCGATG CAGTTGTGGT GTTACGCGAA GATCAGCCAG GCGATCAACG CTTAGTTGCG
TATGTGGTTT CCTCAAACGA GACGTTGCTG ATAAGTATAG TTCGTCAGCG TTTGGCTCAG
CACTTACCTG AGTTTATGTT GCCGAATGCC CTGGTACAGA TGGATGCACT ACCATTGTCA
CCGAATGGCA AACTTGATCG TCGTCGCTTG CCTGTGCCGA GCTATGCTGA GCAATTGCTT
GATGATGCAC CACCACAAAC GGAGTTGGAG CAAGCAATTG CCGAAATTTG GGCTTCGGTC
TTGAGGGTAT CAACGATCCA ACGCCAGAGC AACTTTTTTC AGCTGGGCGG GCATTCGTTG
CTGGCGACTC AGGTAGTGGC CCGCGTTCGC GAGCTGCTGC CTCAGCTTGA TCTGCCGTTG
CGACGGTTAT TTGAATTGCC AACCTTGGCA GCGTTTGCCG CGAGTTTGCA GAATGCCCCC
GCTGAGCTGT TGGCTCCAGT GCTACCAGCA CTCATACCAC AGCCGACTCC AGAGCATATT
CCACTGTCAT TTGCCCAAGA ACGCTTGTGG TTTTTGAGCC AACTTGATGC TGAAAGCAGT
CAGGCCTATA CCATTCCAAT GGCAGTACGA ATTGTGGGTG ATCTACCGCC GATGGTGGTG
CAGTCAGCCC TGCAACAGCT TGTGGATCGC CATGCCAGTT TACGCACAAG TTTTTATGCG
CTTGACGGTC AGCCCTATCA GCGGATCGCA CCAACCCTAA CGCTTGGATT GGCGCTCCAT
GATCTCCGTG GTCGTAGCGA GGCTGATGTC CATGCGGCGA TTCAAGCGGC AATTGCTGAG
CCATTTGATT TGGAGCATGG ACCGTTGCTG CGGGCGATGT TATTGCGCGA GGCCGCTGAT
CGCCATGTAT TGGTGCTGAC CTTGCACCAC ACGATTACCG ATGGTTGGTC GATGGAGTTG
CTGGTGCGCG AGCTGGCCCA AGCCCTGGCG GCTGCTCGCG TGGGTGAAGT CGCTCGTTTT
GCACCATTAG TGGTCGATGT GGTTGATGTG GTGGTGTGGC AACGGGCAGT GTTGCGTGGC
GAACGCCTGA CTCGTTTGCA GGCCTATTGG CAATCACAGC TGAGTGATGC CACTCCCTTG
GCTTTGCCGA CGGATTATGC GCGACCAGCG GTGCAACAGT TTGCCGGGGC AACAGTTGGG
GTGGTGATCC CTGATGCTGT AGTGAGGAAG TTGCGGCTGC TGGGGCAGGC CCATGGGGCG
ACATTGTTTA TGGTGTTGAT GGCGGCCTAT CAAGCATGGT TGGCACGGCT GAGTGGGCAG
CGCGATATTG CCGTTGGCAC ACCGATTGCC GGACGCACCC AAGCTGGAAT GGATGAATTA
ATCGGATTTT TCGCCAATAC GTTGGTTGTG CGGAGTGATG TGCGGCGAGA TCTTGGATTT
ACGGCGCTGC TGGAACAGGT GCGCGAGACC TTGTTGGCGG GGTATAGCCA TCAGGAATTG
CCGTTTGAGC AGGTGGTGCA GGTCGTTCAG CCCGAGCGCG ATCTGAGCCG TTCGCCCTTG
TTTCAAACGA TGCTAACTCT ACAACACGCA CCCCAAGCAA GCCTTGATTG GGGTGAGATT
GCCCTTGAAC CAATCAGCCT CACCGGAACG ATTGCTAAAT TCGATTTGAC CCTTGGTTTG
TTGGAAACAC CCCATGGCTT GATCGGCAGT TTTGAGTATA GCACGGTGTT GTTTGCTGAG
ACCACAATCG AGCGCTGGTC AAAGCATTGG CAAACCTTTC TGACCGCACT GGCCGATGAT
CCAGCGCTGC CCGTTGGCCG TTTGCCACTT ATGCAGCCTG CCGAGCAAAC AGCGATTGTG
CAACGCCAAT CGCCAACGCT GGCGGTGACT GGCCCTACGA CCTTGGCTGA AGCCCTCACG
CAACAAGCGG CTGACACACC TACTGCCATT GCCGTGCAGA CAGAGGGATC TGTATGGACG
TATGCGCAAA TGAATACCGC CGCCAATCAG TTAGCGTGGG AGTTGCGTGA ACTAGGCGTT
GGCCCTGAGG TGGTGGTGGG AATGTACCTT AACCGCACGC CGCTCCTACT GGTGACGTTA
CTGGCGATTC TGAAGGCTGG TGGTGCCTAC CTACCGCTCG ACCCTCAGCA CCCGATTGAA
CGAGTAACAT GGATGCTGGC GGATAGTCAG GCACTGGTGG TGGTGACTGA GCAGCATCTA
AGTGCCCGCC TGAACGCATC ACCATGTCAA GTGCTGGATA TCGAAACCGC TTGGTTACGT
ATTACCGCAG AAAATTCTCC TTCCGCACCA CCATTAAGCA GCACTGCCGC CAATTTAGCG
TATGTGATTT ATACCTCGGG TTCGACGGGT GTTCCCAAAG GCGTAGCAAT TCCTCACCAT
ACCGTATTGG CCTTGCTGGC CTCGATGCGC GAACAAATAC TGATGAGAGT AGGTGATGTG
CTGCTGGCGC TGAGCACGAT GGCCTTTGAT ATTTCGGTGA CAGATGTCTT TTTGCCGTTG
ACGAGTGGTG CGACAATCTA TCTGGTTACG AGTGACGTAG CCCGAGATCC TGTCTTGTTG
CGGGATGTGC TGGAGCGCCA GCCGATTCGA CTGGTGCAAG CGACCCCCGC AACATGGCGA
TTGTTGATTT CGGCAGGCTG GCAGGGCGAC CCGACGCTGA CGGCGATTGC TGGCGGCGAA
GCCTTGCACG CGGATCTCGC GGCGGCCATT CGGATGCGGA GCAATGCGTT GTGGAATATG
TATGGCCCAA CCGAAACAAC GGTTTATGCA ACCCGGGCCT TGATTGAGAA CGATGATATT
AGCCTTGGTT GGGGCTTAGA TAATGTACGG TTATATGTGC TCGATCAGGC TGGGCAAGTT
GTACCATTGG GGGTTGCAGG TGAGTTATAT ATTGGGGGAT CAGGGATTGC CCGTGGCTAT
CTTGGGCGAC CGAGCCTGAC AGCGGAGCGC TTTGTGCCAG ACCCATTGAG TGGTGAGGCA
GGTGCACGGT TGTACCGAAC GGGAGATTTG GTGCGGCAAC GGGCCGATGG ACGGTTTGAC
TATGATGGGC GGATTGACCA CCAAGTCAAG GTGCGTGGAT TCCGCATCGA GTTGGGCGAA
ATTGAGCATC GCGTATTGGC GTTGCCCGCC GTGACCGATG CGGTGGTGGT GGTGCGCGAG
GATCAACCTG GCGATCAACG CCTCGTGGCC TATGTGGTGG CACCCCATGC GGCACTCACG
CTGGATGCAG TCCGACAGCA GTTGGCGCAG CACCTGCCCG AGTATATGCT GCCCAATGCA
GTGGTGGTGC TGGAGAGCAT GCCACTCTCA CCGAATGGCA AGGTCGATCG GCGGCGCTTG
CCCGCACCCA GCTATGAACC ACTGACGAGT GATGACCCGC CACAAACAGC GCTGGAGCAG
CTGGTGGCGA CGATCTGGGC GGAGGTGTTG GGGGTACCGA GCGTGCAGCG CCACGCCCAC
TTTTTCCAGT TGGGTGGTCA TTCGTTGCTG GCAACGCAGG TGGTGGCGCG GGTGCGCGAC
CTGCTGCCGA CGGTCGAGGT GCCATTGCGA CGGTTGTTTG AACTGCCAAC CTTGGCAGCG
TTTGCGGCGA GTCTCGACTC AACACTGCAT GAAGCGGCTG CGGTGGTGCT GCCCGCCTTG
GTGCGCCAGC CAACTCCCGA GCATATTCCC CTGTCGTTTG CCCAAGAGCG CTTGTGGTTT
TTGAACCAAC TGGATGCGGC AAGCAGCCAA GCGTACACGA TTCCGCTGGC GGTACGAATT
GTGGGTGCAG TACCACCAGC GGTGGTGCAA GCAGCGTTGC AGCAGGTGGT GGATCGGCAT
GCGAGTTTGC GGACGACCTT TTATGCGATC GATGGTCAGC CCTACCAACG GATTGCCGCG
CAGTTGCGAC TGGATGTGCC GGTGGAGGAT GTGGGTGGGT GGAGTGAGGC GGCGGTGGCA
GCGGCACTGG CGCGGGTGAT TGCGGAACCC TTTGATCTGG AGCGAGGGCC ATTGCTGCGG
GCGACGGTGC TGCGGGAGGC TGCTGATCGC CACGTATTGG TGCTGACCTT GCACCACACG
ATTACCGATG GTCAATCGAT GGAGATTCTG CTGCGTGAGC TGGCGCAGGC GTTGGCGGCG
GCACGGCAGG GAGCGGTGGT GGAGTTTGCA CCGTTGGCGG TGGATGTGGT CGATGTGGTG
GTATGGCAAC GGGCAGTGTT GCGTGGCGAA CGCCTGACCC GTTTGCAGAC GTATTGGCAG
CAGCAGTTGC GTGATGTCGC TCCCCTGACC TTACCGACGG ACTATCCGCG ACCGGCGGTG
CAACAGTTTG CTGGGGGGAC GGTTGGCGTG ACGATTGCGG CGGAGGTGGT GGCCGAGTTA
CGGGCACTGG GGCAGGCGCA TGGGGCGACG TTGTTTATGG TGCTGATGGC GGCGTATCAA
GCGTGGTTGG CGCGGCTGAG TGGGCAGCGC GATATTGCGG TGGGGACACC GATTGCGGGA
CGAACGCAGG CGGGGATGGA GGGCTTGATT GGCTTTTTTG CCAACACCTT GGTGGTGCGG
AGTGCTGTAG CTTGGGATGA CGGGTTCACG ACGCTGCTAG GGCAGGTGCG CGAGACGTTG
TTGGCCGGGT ATAGCCATCA GGAATTGCCG TTTGAGCAGG TGGTGCAGGT GGTGCAACCG
GAGCGTGATC TGAGTCGGTC ACCCTTGTTT CAAACCCTGC TGACCGTGCA GCATGCTCAA
CCAACCAGCC TTGATTGGGG TGCGATTCAG CTGGAACCCA TGCGGCTCGC CGGAACGATT
GCCAAGTTTG ATCTCTCGTT GGGCTTGGTG GAGACACCCC AAGGCTTGAT CGGCAGTTTT
GAATATAGCA CGGCATTATT TGCTGAGACC ACAATCGAGC GTTGGGCAAC CCATTGGCAG
ACCTTCCTCA CGGCCATTGC GGCACAACCA ACCCACCCGC TCGGCCAGCT CACGCTGTTA
ACCAGTGCCG AACGCACCCT CGTGGTGCAG GGTTGGAACC AGACCCAGGT GGACTATCCT
CAGGTTCAGG CATTGCACCA CCTGATTGAG GCGCAGGTGG CTCGCACGCC GACGGCGATC
GCGCTGCATT ATGAGGATCA GGTGCTGTCG TATGGGGCAT TAAATGCGCG AGCGAACCAA
GTGGCACAGC GGTTAATAGC CTTGGGAGTG CAACCTGACG ATCTGGTGGC GATATGTGCC
GAACGGTCAC TGGAATTGGT GATAGGGTTG TTGGGGATCA TGAAGGCGGG TGGTGCCTAT
GTGCCCTTGG ATGCGAGCTA TCCGCAGGAG CGGCTGGCAT TTATGCTGGC CGATGCACGA
CCATTGGTGC TGGTGACGGC GTTGACTGCG ACGCAAACGA CTCCCGCACT CCAGCAGCTT
CTGGAGGCGC AGGTATGCCC ACGCCTCGAT TTGATGGACT GGTCAACGCT CACGCAGGAA
GCAACCGATA ATCCTGTCGT GGCCATGCAT GGCCAGCATC TGGCCTATAT GATCTATACC
TCTGGTTCAA CAGGAACGCC CAAAGGGGCA ATGAATAGCC ATGCTGCGAT CCTGAATCGG
TTGATGTGGA TGCAAGCAGA GTATCGGTTG ACGGCGGCGG ATCGAGTGCT GCAAAAAACG
CCGATGAGTT TTGATGTGTC GGTGTGGGAG TTTTTCTGGC CGTTGCTGGT GGGAGCGCAA
CTGGTGATCG CGCGACCGGA TGGGCACAAA GATCCGGCCT ATTTGGCGGA GGTGATCCAG
ACGGCACAGA TCACGACGGT GCATTTTGTG CCTTCGATGC TACGGTTGTT TTTGGCGCAT
CCGCAGGCAC GAGGATGTCG CTCGTTGCGA TTGACACTGT GCTCAGGGGA GGCATTGCCA
GCAGATGTGG TGTCACAGTT TTTTGCACAC ATCCCGCTGA GTGCGTTACA TAATTTGTAT
GGGCCAACCG AAGCGGCGGT GGATGTATCG TATTGGGCAT GTGCAGCAGA ATCGCTGACA
ACGAGTGTGC CGATTGGGCG ACCAGTGGCG AATACGCAGT TGTATGTACT TGATTCGTTG
CTGGAACCCG TGCCGATCGG CTGTTTTGGT GAAGTGTATA TTGGTGGGGT GCAAGTGGGG
CGCGGGTATC ACCACCGACC GAGTCTGACA GCGGAGCGCT TTGTGCCAGA CCCATTGAGT
GGTGAGGCAG GTGCACGCCT CTATCGAACG GGTGATGTTG GGCGATGGCG TGCGGATGGG
AGCATCGAGT ATGCGGGACG GATCGATCAT CAGGTGAAAC TGCGAGGGTT GCGCATCGAG
TTGGGCGAAA TTGAGCATCG CGTATTGGCG TTGCCCGCCG TGACCGATGC GGTGGTGGTG
GTGCGCGAGG ATCAACCTGG CGATCAACGC CTCGTGGCCT ATGTGGTGGC ACCCCATGCG
GCACTCACGC TGGATGCAGT CCGGCAGCAG TTGGCGCAGC ACCTGCCCGA GTATATGCTG
CCCAATGGTT TGATGCTGCT GGAGAGCGTG CCACTCTCAC CGAATGGCAA GGTCGATCGG
CGGCGCTTGC CCGCACCCAG CTATGAACCA CTGACGAGTG ATGACCCGCC GCAAACAGCG
CTGGAACAGC TGGTGGCGAC GATCTGGGCG GAGGTGTTGG GGGTACCGAG CGTGCAGCGC
CACGCCCACT TTTTCCAGTT GGGTGGTCAT TCGTTGCTGG CAACGCAGGT GGTGGCGCGG
GTGCGCGACC TGCTGCCGAC GGTCGAGGTG CCATTGCGAC GGTTGTTTGA ACTGCCAACC
TTGGCAGCGT TTGCGGCGAG TCTCGACCCA ACGCCGCATG AAGCGGCTGC GGTGGTGCTG
CCCGCCTTGG TGCGCCAGCC AACTCCCGAG CATATTCCCC TGTCGTTTGC CCAAGAGCGC
TTGTGGTTTT TGAACCAACT GGATGCGGCA AGCAGCCAAG CGTACACGAT TCCGCTGGCG
GTACGAATTG TGGGTGCGGT ACCACCAGCG GTGGTGCAAG CAGCGTTGCA GCAGGTGGTG
GATCGGCATG CGAGTTTGCG GACGACCTTT TATGCGATCG ATGGTCAGCC CTACCAACGG
ATTGCCGCGC AGTTGCGACT GGATGTGCCG GTGGAGGATG TGGGTGGGTG GAGTGAGGCG
GCGGTGGCAG CGGCACTGGC GCGGGTGATT GCGGAACCCT TTGATCTGGA GCGAGGGCCA
TTGCTGCGGG CGACGGTGCT GCGGGAGGCT GCTGATCGCC ACGTATTGGT GCTGACCTTG
CACCACACGA TTACCGATGG TCAATCGATG GAGATTCTGC TGCGTGAGCT GGCGCAGGCG
TTGGCGGCGG CACGGCAGGG AGCGGTGGTG GAGTTTGCAC CGTTGGCGGT GGATGTGGTC
GATGTGGTGG TATGGCAACG GGCAGTGTTG CGTGGCGAAC GCCTGACCCG TTTGCAGACG
TATTGGCAGC AGCAGTTGCG TGATGTCGCT CCCCTGACCT TACCGACGGA CTATCCGCGA
CCGGCGGTGC AACAGTTTGC TGGGGGGACG GTTGGCGTGA CGATTGCGGC GGAGGTGGTG
GCCGAGTTAC GGGCACTGGG GCAGGCGCAT GGGGCGACGT TGTTTATGGT GCTGATGGCG
GCGTATCAAG CGTGGTTGGC GCGGCTGAGT GGGCAGCGCG ATATTGCGGT GGGGACACCG
ATTGCGGGAC GAACGCAGGC GGGGATGGAG GGCTTGATTG GCTTTTTTGC CAACACCTTG
GTGGTGCGGA GTGCTGTAGC TTGGGATGAC GGGTTCACGA CGCTGCTAGG GCAGGTGCGC
GAGACGTTGT TGGCCGGGTA TAGCCATCAG GAATTGCCGT TTGAGCAGGT GGTGCAGGTG
GTGCAACCGG AGCGTGATCT GAGTCGGTCA CCCTTGTTTC AAACCCTGCT GACCGTGCAG
CATGCTCAAC CAACCAGCCT TGATTGGGGT GCGATTCAGC TGGAACCCAT ACGCCTCGCC
GGAACGATTG CCAAGTTTGA TCTCTCGTTG GGCTTGGTGG AGACACCCCA AGGCTTGGTG
GGGGGCTTTG AGTATAGTAC GGCATTATTT GCGCCAGCGA CGATGGAACG CTGGGCAAGC
CATTGGCAGA CCTTCCTCAC GGCCCTTGCG GCGCAACCAA CGTTGGCAAT AGGGTATCTC
CCATTGTTGA CACCCATTGA GCGCACCGCA ATTGTGCGCC GCCAAGCTCC AACACTGGCG
GTGACTGGCC CTACAACTTT GGCTGAAGCC CTCACGCAAC AAGCAGCTGC GACTCCGCAG
GGGCTGGCGT TGTATGAAGA CGGTATCGCC TGGTCATATG CCGAACTGGC CGCTGCCACA
AATCGGTTAG CGTGGCACTT GCAGGATATG GGGATTGGCC CCGATATGGT GGTAGGGGTG
TATCTTGACC GTTCGCCGCG CTTGATTGTA GCCCTTTTAG CAATCATCAA GGCTGGAGCC
GCTTACTTAC CCCTTGATCC GAATCATCCG CTGGAGCGCT TGATCTGGAT GTTAGCGGAT
AGCCAAGCCA AACTGGTACT GACCGAACAG CTGCATGCAA TTCAGCTTGC CGATCAGCCA
TGTCCAGTGC TCGATATTGA AACAGCGTGG TCTATAATCG AGGCCACACA GCCTGACAGT
GCTCCTCCAT TGCAGATTGT GGCTGAAAAT TTGGCATATG TGATTTATAC CTCGGGTTCG
ACGGGTGTTC CCAAAGGCGT GGCAATTCCT CATCAGACCG TATTGGCCTT GCTGGCCTCG
ATGCGCCACC AATTGCAACT TGGGCTTGGT GATAGAATTC TAGCCTTGAG CACGATGGCC
TTTGATATTT CGGTGGTGGA TGTCTTTTTG CCGCTGACAA GTGGGGCTAC TATTGCCTTA
ATCTCTAGTG AGGTGGCTCG TGATCCTGTC TTGTTACGTA GAGTGTTAGA GGGTCAGGCG
ATTAACGTGT TACAGGCAAC TCCTGCGACA TGGCGTTTGC TGATTTCTGC GGGCTGGCAG
GGCAACCCGA CGCTGACGGC GATCGCTGGT GGTGAGGCGC TGAGTGTCGA TTTAGCAGCA
GCGATTCGTG CCCGTAGTAA GCAGTTGTGG AATATGTATG GGCCAACCGA AACGACGGTG
TATGCAACCC GGGCCTTGAT CGAGAATGAT GATATTAGCC TTGGTTGGGG CTTAGATAAT
GTGCGGTTAT ATGTGCTCGA TCAGGCTGGG CAAGTTGTGC CGTTGGGGGT TGCAGGCGAG
TTATATATTG GGGGATCGGG GATTGCTCGT GGCTATCTTG AGCGACCGAG CCTGACAGCG
GAGCGCTTTG TGCCTGACCC ATTAAGTGGC GAAGCGGGCG CACGCCTCTA TCGAACGGGA
GATTTGGTGC GGCAACGGGC CGATGGACGG TTTGACTATG ACGGTCGGAT TGACCACCAA
GTCAAGGTGC GTGGATTCCG GATTGAGCTA GGGGAAATCG AGCAGCGATT GATGGCATTG
GCTGAAGTAA ACGACGCGGT GGTGGTGGTG CGCGAGGATC AGCCAGGTGA TCAACGCTTG
GTAGCCTACG TGGTTGCTGC CCCGAACGCG CTGACCCTCA GTACGGTGCG CCAACGTTTG
GCACAGCACC TGCCAGAATA TATGCTGCCG AATCTGCTCA TGGTGCTCGA TGCCTTCCCG
CTTTCGCCCA ATGGCAAGGT TGATCGGCGG CGCTTACCTG TGCCGAGTTA TCAGCACGAA
GCGACACTCG CAGGACTGCC GCGCAATGCC TTGGAATTGC AGTTGGTCAA TCTGTGGGAA
AACGTTTTAG CGCTCCAACC GATTAGCATT AACCAAAACT TTTTTGAATT AGGTGGCCAT
TCGATGCTGG CTGTGCGTTT GATGGCAGAA ATTCGGCGCG AACTTGGCTA TCAAAGCCCA
CTCGCTGCCT TGTTTCAATA TCCGACAGTT GCCGATTTAG CGCACTTTTT AGCTGAACAT
AATGGTGATT TGGCCTATTC GCCGCTGGTA ACGCTCAAAG CTAGTGGCAC GAAAGCCCCA
ATCTTCTTGG TGCATCCAAT TGGTGGTAAT GTCGGTTGTT ATTTCAATTT GGTGCGCGAA
CTCGATGCTG AGCATCCCTG TTATGGCTTG CAAGCGGCGG GCTTAGATGC TCAAACCATT
CCGGTCGCCG ATATTCCAAC CATTGCTAGC AACTATATTG CTGCAATTCA GGCGGTTCAG
CCGCAGGGAC CATATATTCT GGGTGGTTGG TCGTTTGGTG GCTTAGTTGC TTACGAAATG
GCCCAGCAAC TACAAGCGCA GCACCAAGAC ATTGCCCAAG TATTGCTGCT TGATAGTTAT
TTGATTGAGG CTGAGTATGT GCAGGCCGAG CCACATCCAA GCCAAATGCT AACTCAGTTT
ATCCGCGATA TTATGGGGCT TGCTTTGGCA ACCTTTGAAC AAAACCTTGA GTCATCGGTT
GAACAATCAA TCGACGTGCA ATTACAACAA CTCCAACAAC AACTTGCTGA CCGTGGCCTC
AGACTTGAGT TTGAGCAATT ACAGGCGATG TTTAATGTCT TCAAAACCAA TACCCAAGCG
ATGTATCGCT ATCAGGTTCA GCCTTCAAAT GCTAAGGTGA TGTTGTTCCA ATCGACCAAT
AGTCAATCAA TTGAACAACG CTTTGGCGAT TCAACTTCCG GTTGGAGCCG CTATACAGAT
GATCAATTGA CCGTTTATAA ATTGACTGGC GATCATTTTA GTATGATCCA TGCTCCCTAT
GTTCATGAGC TGGCAGCCTA TATCAATCAA GCATTGCACC AGCTCGATTA G
 
Protein sequence
MSEHSDDSLR NQTDPASRRA PLSAAKKAWL EKRLRGDGTT PPPQSSIPRL ATYDQVPLSF 
AQERLWFLSQ YEPTSSNAYI IPLAVRIDGP IDPSLLEQAL QLVVDRHASF RTTFHAQNGV
PFQRVAPQLP LRLPVLGLDV ADASDEAAVL QVVLDQLVPL LQLPFDLEHG PLLRATLLRL
AAESHVLLLI CHHIISDGWS MGVLLRDFAS FLGALRTNTA PDVPPLVVQA PDVAVWQRQR
LQGHYLTTLQ DFWKQQLADL EPLNVPTDFV RPAQQSYRGA TLSFQLPAAL STQLQRMAQQ
HDVTPFMLLL AAFQAFLARL SGQQDLAIGS VLASRADADL DPVIGFLVNT WTLRNNIDVA
QPLAQLLPTV RRTVLAAFEH RDLPFEQVVQ LVQPERDLSR SPVFQVMMTY QNVPQRQMEW
GDVRLTPISL PSTVAKFDLT LALSESPEGF RGVMEYRSDL FRRSTIATMV ARWEMFLHAI
VAEGSTPLAR LPLVLPAERS LLLDTLNATT TAYPHDQSVA SLFAEQARLW PERIALRFGE
HSLSYHALEQ RANQLAHHLQ LLGVGPEHVV GLCVERSLDL VVAILAILKA GAAYAPVDPS
YPVERLAWML SDLQPTVVIA QHGVLDRLPS VACSVVVLET IAAHLAAYPT TAPTVDISPE
NLAYVMYTSG STGRPKGIMI NQRNIVRLVR NTTYAAFGPD QVGLLLATVA FDASTFELWG
CLLNGGRLVI APPQQLSLAE LGHLVEREQI TTLWLTAGLF HQMVDHALDR LGSLRQLLAG
GDRLSPVHVH KVLERWPQCR LINGYGPTEN TTFSCCQQLS ATTDLAQGVP IGQPIANSTA
YILDRLLQLV PIGVVGELYL GGAGLARGYL ARPDQTAAAF IPNPMSQTAG ERLYRSGDLA
RYRDDGTIEF IGRRDQQVKV RGYRIELEEI VGVLLAQPQV DDAVVVVRED RVGDQRLVAY
LVGDNPAIEL IEQAVQGQVP SYMLPSAYVV LDALPLTANG KVDRRRLPAP SYAAIANDDP
PQTDLEQAIA AIWAEVLAVP SIQRQTNFFQ VGGHSLLATQ VVARVRELLP TVDLPVRRLF
ELPTLAAFVA SLDPTPHKDA AVVLPALVAQ PRPEHLPLSF AQERLWFLNQ LDATSNQAYT
IPLAVRIVGD LPPAVIQMAL QQLVDRHSSL RTTFYALDGQ PYQRIAPTLM LALSLHDLRG
RSEADVHAAI HAAIAEPFDL ERGPLLRVTL LREADDRHVL VLTLHHSITD GWSMELLLRE
FAQALAAARL GEPTRFAPLV VDVVDVVLWQ RAVLRGERLA QLQGYWQTQL HGVAPLALPT
DYPRPAVQQF AGATLRVTIP TVVLEQLRVL GQTHGATLFM VLMAAYQAWL ARLSGQRDIA
VGTPIAGRTQ AEMEGLIGFF ANTLVVRSDV RRDLGFTALL EQVRATLLTG YSQQELPFEQ
MVQLVQPERD LSRSPLFQAM LTLHHAQPTS MDWGEIALEP IDLAGTVAKF DLSLGLVETT
SGLVATFEYS TALFTQATIE RWSGHWLTFL TVIAANPTQS IGELALLTTA EQAQVLVGWN
QTRTIYPPAQ GLHHVIEAQV AQHPTAIALR YEDRTLSYAE LNVRANQLAH RLIDLGVRPD
TLVAICAERS IEMVIGLLGV LKAGGAYVPL DASYPIDRLE FMLADAQPLV LLTALTAGQT
NSELQQLIEF QACPRLDLMQ LQLLADQPTH NPAVAIEGHN LAYMIYTSGS TGKPKGALNQ
HDAIINRLLW MQAEYRLTAA DRVLQKTPFS FDVSVWEFFW PLLVGAQLVL ARPEGHKDPE
YLTEVIQAEQ ITTVHFVPSM LRLFLEHRQA PMCTSLRRTI CSGEALPADV AQQFLQQLPQ
SGLHNLYGPT EAAVDVSYWA CRPDATASSV PIGRPVANTQ LYLLDQALQP VPIGCFGELY
IGGLQVGRGY YNRPNLTAER FVPDPFSPIA GARLYRTGDL ARWRADGNIE YGGRIDHQVK
VRGLRIELGE IEQQLLALPD ITDAVVVLRE DQPGDQRLVA YVVSSNETLL ISIVRQRLAQ
HLPEFMLPNA LVQMDALPLS PNGKLDRRRL PVPSYAEQLL DDAPPQTELE QAIAEIWASV
LRVSTIQRQS NFFQLGGHSL LATQVVARVR ELLPQLDLPL RRLFELPTLA AFAASLQNAP
AELLAPVLPA LIPQPTPEHI PLSFAQERLW FLSQLDAESS QAYTIPMAVR IVGDLPPMVV
QSALQQLVDR HASLRTSFYA LDGQPYQRIA PTLTLGLALH DLRGRSEADV HAAIQAAIAE
PFDLEHGPLL RAMLLREAAD RHVLVLTLHH TITDGWSMEL LVRELAQALA AARVGEVARF
APLVVDVVDV VVWQRAVLRG ERLTRLQAYW QSQLSDATPL ALPTDYARPA VQQFAGATVG
VVIPDAVVRK LRLLGQAHGA TLFMVLMAAY QAWLARLSGQ RDIAVGTPIA GRTQAGMDEL
IGFFANTLVV RSDVRRDLGF TALLEQVRET LLAGYSHQEL PFEQVVQVVQ PERDLSRSPL
FQTMLTLQHA PQASLDWGEI ALEPISLTGT IAKFDLTLGL LETPHGLIGS FEYSTVLFAE
TTIERWSKHW QTFLTALADD PALPVGRLPL MQPAEQTAIV QRQSPTLAVT GPTTLAEALT
QQAADTPTAI AVQTEGSVWT YAQMNTAANQ LAWELRELGV GPEVVVGMYL NRTPLLLVTL
LAILKAGGAY LPLDPQHPIE RVTWMLADSQ ALVVVTEQHL SARLNASPCQ VLDIETAWLR
ITAENSPSAP PLSSTAANLA YVIYTSGSTG VPKGVAIPHH TVLALLASMR EQILMRVGDV
LLALSTMAFD ISVTDVFLPL TSGATIYLVT SDVARDPVLL RDVLERQPIR LVQATPATWR
LLISAGWQGD PTLTAIAGGE ALHADLAAAI RMRSNALWNM YGPTETTVYA TRALIENDDI
SLGWGLDNVR LYVLDQAGQV VPLGVAGELY IGGSGIARGY LGRPSLTAER FVPDPLSGEA
GARLYRTGDL VRQRADGRFD YDGRIDHQVK VRGFRIELGE IEHRVLALPA VTDAVVVVRE
DQPGDQRLVA YVVAPHAALT LDAVRQQLAQ HLPEYMLPNA VVVLESMPLS PNGKVDRRRL
PAPSYEPLTS DDPPQTALEQ LVATIWAEVL GVPSVQRHAH FFQLGGHSLL ATQVVARVRD
LLPTVEVPLR RLFELPTLAA FAASLDSTLH EAAAVVLPAL VRQPTPEHIP LSFAQERLWF
LNQLDAASSQ AYTIPLAVRI VGAVPPAVVQ AALQQVVDRH ASLRTTFYAI DGQPYQRIAA
QLRLDVPVED VGGWSEAAVA AALARVIAEP FDLERGPLLR ATVLREAADR HVLVLTLHHT
ITDGQSMEIL LRELAQALAA ARQGAVVEFA PLAVDVVDVV VWQRAVLRGE RLTRLQTYWQ
QQLRDVAPLT LPTDYPRPAV QQFAGGTVGV TIAAEVVAEL RALGQAHGAT LFMVLMAAYQ
AWLARLSGQR DIAVGTPIAG RTQAGMEGLI GFFANTLVVR SAVAWDDGFT TLLGQVRETL
LAGYSHQELP FEQVVQVVQP ERDLSRSPLF QTLLTVQHAQ PTSLDWGAIQ LEPMRLAGTI
AKFDLSLGLV ETPQGLIGSF EYSTALFAET TIERWATHWQ TFLTAIAAQP THPLGQLTLL
TSAERTLVVQ GWNQTQVDYP QVQALHHLIE AQVARTPTAI ALHYEDQVLS YGALNARANQ
VAQRLIALGV QPDDLVAICA ERSLELVIGL LGIMKAGGAY VPLDASYPQE RLAFMLADAR
PLVLVTALTA TQTTPALQQL LEAQVCPRLD LMDWSTLTQE ATDNPVVAMH GQHLAYMIYT
SGSTGTPKGA MNSHAAILNR LMWMQAEYRL TAADRVLQKT PMSFDVSVWE FFWPLLVGAQ
LVIARPDGHK DPAYLAEVIQ TAQITTVHFV PSMLRLFLAH PQARGCRSLR LTLCSGEALP
ADVVSQFFAH IPLSALHNLY GPTEAAVDVS YWACAAESLT TSVPIGRPVA NTQLYVLDSL
LEPVPIGCFG EVYIGGVQVG RGYHHRPSLT AERFVPDPLS GEAGARLYRT GDVGRWRADG
SIEYAGRIDH QVKLRGLRIE LGEIEHRVLA LPAVTDAVVV VREDQPGDQR LVAYVVAPHA
ALTLDAVRQQ LAQHLPEYML PNGLMLLESV PLSPNGKVDR RRLPAPSYEP LTSDDPPQTA
LEQLVATIWA EVLGVPSVQR HAHFFQLGGH SLLATQVVAR VRDLLPTVEV PLRRLFELPT
LAAFAASLDP TPHEAAAVVL PALVRQPTPE HIPLSFAQER LWFLNQLDAA SSQAYTIPLA
VRIVGAVPPA VVQAALQQVV DRHASLRTTF YAIDGQPYQR IAAQLRLDVP VEDVGGWSEA
AVAAALARVI AEPFDLERGP LLRATVLREA ADRHVLVLTL HHTITDGQSM EILLRELAQA
LAAARQGAVV EFAPLAVDVV DVVVWQRAVL RGERLTRLQT YWQQQLRDVA PLTLPTDYPR
PAVQQFAGGT VGVTIAAEVV AELRALGQAH GATLFMVLMA AYQAWLARLS GQRDIAVGTP
IAGRTQAGME GLIGFFANTL VVRSAVAWDD GFTTLLGQVR ETLLAGYSHQ ELPFEQVVQV
VQPERDLSRS PLFQTLLTVQ HAQPTSLDWG AIQLEPIRLA GTIAKFDLSL GLVETPQGLV
GGFEYSTALF APATMERWAS HWQTFLTALA AQPTLAIGYL PLLTPIERTA IVRRQAPTLA
VTGPTTLAEA LTQQAAATPQ GLALYEDGIA WSYAELAAAT NRLAWHLQDM GIGPDMVVGV
YLDRSPRLIV ALLAIIKAGA AYLPLDPNHP LERLIWMLAD SQAKLVLTEQ LHAIQLADQP
CPVLDIETAW SIIEATQPDS APPLQIVAEN LAYVIYTSGS TGVPKGVAIP HQTVLALLAS
MRHQLQLGLG DRILALSTMA FDISVVDVFL PLTSGATIAL ISSEVARDPV LLRRVLEGQA
INVLQATPAT WRLLISAGWQ GNPTLTAIAG GEALSVDLAA AIRARSKQLW NMYGPTETTV
YATRALIEND DISLGWGLDN VRLYVLDQAG QVVPLGVAGE LYIGGSGIAR GYLERPSLTA
ERFVPDPLSG EAGARLYRTG DLVRQRADGR FDYDGRIDHQ VKVRGFRIEL GEIEQRLMAL
AEVNDAVVVV REDQPGDQRL VAYVVAAPNA LTLSTVRQRL AQHLPEYMLP NLLMVLDAFP
LSPNGKVDRR RLPVPSYQHE ATLAGLPRNA LELQLVNLWE NVLALQPISI NQNFFELGGH
SMLAVRLMAE IRRELGYQSP LAALFQYPTV ADLAHFLAEH NGDLAYSPLV TLKASGTKAP
IFLVHPIGGN VGCYFNLVRE LDAEHPCYGL QAAGLDAQTI PVADIPTIAS NYIAAIQAVQ
PQGPYILGGW SFGGLVAYEM AQQLQAQHQD IAQVLLLDSY LIEAEYVQAE PHPSQMLTQF
IRDIMGLALA TFEQNLESSV EQSIDVQLQQ LQQQLADRGL RLEFEQLQAM FNVFKTNTQA
MYRYQVQPSN AKVMLFQSTN SQSIEQRFGD STSGWSRYTD DQLTVYKLTG DHFSMIHAPY
VHELAAYINQ ALHQLD