Gene Haur_1857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1857 
Symbol 
ID5733746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2160990 
End bp2170289 
Gene Length9300 bp 
Protein Length3099 aa 
Translation table11 
GC content53% 
IMG OID641279001 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001544628 
Protein GI159898381 
COG category[H] Coenzyme transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase
[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAACGCT CGCCCGAACT GGTAGCACTG ACGGCTCCGT TTGCTGATCT TGGGCTTTCG 
TCGCGTGAAG CGGTTGGTTT GAGTGGCGAT TTACAGGCAT GGCTGGGGCG CAAAGTCGCT
CCAACTGTGC TCTGGGAGTA TTCAACCATT CAGGCCTTGA GCGATTTTTT GGCGCACGAT
CAAGCGCAAC AATTGCCTTT GCCAAAGCCC AAGCCCAGCC AAGCAAGCGC CACTTCCAGC
TCGGCAATTG CGATTGTGGC GATGAGTTGC CGCTTGCCTG GAGCCGATTC GCCTGAAGCG
TTGTGGCAAT TGTTGCTCGA AGGTCGTAGC GCAATTGGGT TTGTACCTGC CGATCGCTGG
GATGCTCAGG CCTTGTATAG CCCTGAAGCC CGCACTCCTG GCAAAATCAA CACGCGTTGG
GGTGGTTTTC TCGATCAGGT CGATCAATTC GATCCACAGG TGTTTGGCAT TTCAGGGCGC
GAGGCTAGCC GCATCGATCC TCAACAGCGC CTAGCTTTGG AGGTTGCTTG GGAAACCTTT
GAGCGAGCAG GCATTGCGCC TGATCAATTA GTGGGTAGCG CAACCGGGGT TTTCCTAGGC
ATTAGTAGCA ATGATTATGC GCGTTTGCAA TTTGCCCAGC TTGATCAGCT TGATGCGTAT
GCTGGCACTG GCAATGCCCA TAGCATTGCT GCCAATCGTT TGTCGTATGT GTTTGGTTTG
CAAGGCCCAA GCATGGCGCT TGATACCGCT TGTTCATCGT CGTTGGTCGC TGTGCACTTG
GCTAGTCAAA GTTTGCTGGC GGGCGAATGT GAGCAAGCGC TGGCTGGCGG GGTCAATCTG
ATTCTCAATC CTGAGTTAAG CGTCACTTTT GCTCAAGCCC AAATGCTTTC TGGCACGGGC
GAATGCCACA CCTTCGATGC TGCTGCCGAT GGCTATGTGC GCAGCGAAGG CTGTGGCATG
GTGCTGCTGA AGCGACTCGA TGTTGCCGAA GCGGCGGGCG ACCCAATTTT GGCCGTCATT
CATGGCTCAG CGGTAAATCA GGATGGGCGC AGCAATGGGT TGACTGCGCC GAATGGTCAG
GCTCAGCAGG CGGTTATTCG CCAAGCGCTG GCCAAAGCCC AGATTCAGCC TGATCAATTA
AGTTATATCG AGGCCCATGG CACGGGCACA CCCTTGGGCG ACCCAATTGA AGTTGCGGCC
TTGCAGGCAG TGCTGGGCGA ACGCCAACAA CCCTGTTTGC TTGGCTCGCT CAAATCTAAT
CTTGGGCATC TTGAGGCTGC CGCTGGGATT GCAGGCTTGA TCAAGCTGGT TTTGGCGTTT
CAGCAGCAAA TAATTCCCGC CCAAGCCAAT TTTAAGCAAC GCAACCCCCA AATCGAACTT
GGCTCAGCCT TAGAAATCGC TACAACCCCC CAGCCTTGGT ATAGCTTTGG CAGTTATGCG
GGCATTAGCA GTTTTGGCTT TGGCGGAACT AATGCCCATG TGATTCTCGG TGCTGCGCCG
ATTCAACCCA AGCGATTGCC GCAACCAAGC CCAGCTCCAA TCCAGTTGTT GGCTTTGCAA
GCCAATAGCG AAACTGCATT ACGCCAACTT ACTGAGCGTT ATCAGGCTTA TTTGGCGCAA
ACTGAGGTAA ACTTGGCCGA TATTTGCTGG AGCGCTTACC ACCAACGGGC CACGATGCGC
CATCGCCTAA TAGTCTCTGC TACAGACAAA ATACAAATGC TCGAACGCTT GCAGCACGCT
TGGCAAGCTC AAGCCACAGG TAGCATTTAT GCTGAGCAAC CCCAGCCCGC TCCCCGCATT
GCCTTTGTTT GCTCTGGCCA AGGTAGCCAA TACGTTGGTA TGGCGCAAAC GCTTTATCAA
ACCCAGCCGC TTGTGCGCCA AATTCTCGAT CAAGCGAACA GCATTCTCAA TGAGTATTTG
GCGATTCCAC TGCTTGATGT GTTGTATCAG CCGGACCATG GTGCGTTGCT GCGCGATACC
CGTTACACCC AGCCCGCCAT TTTTGTAGTG AGCTATGCCC TTGGCCAACT TTGGCGGTCG
TGGGGGATTG AGCCTGTCGC TTTGCTTGGC CATAGCATTG GCGAATATAG CGCGGCAGTG
TTGGCTGGAG TTTGGAGCTT TGAGCAGGGT TTGCGCTTGG TCGCCCAACG TGCTCAATTG
ATGCATGGCT TGCCCGAACA TGGCGCAATG CTAGCGATTC GCTCCCCGCT CGAAAGCATT
GAGCCATTGC TCGCACAGCA TCAGCTTGAT TTGGCGGCGA TAAACGGGCC AAATGCCGTA
GTTGTCGCGG GCAGCGTGGC AGCCATCAGC CAATGTGCTG TTGAATTGAA TCAACTTAAC
ATAACAAATA AATTGCTTGA TGTCTCACAT GGCTTCCACT CACGTTTGAT GCAGCCGATG
TTGGCCGATT TCCAACAGGT GCTGAGTGCA TATCCCGCCA TGGCTCCTCA GATTCGTCTA
ATTGCCAACC TCGATGGCTC GTGGCACGAA CAAGCTCCTA GTGCTGAATA TTGGGTCGAG
CACACCCGCC AGCCAGTGCA GTTTTATCGC GGTTTGCAAA GCTTGGTTGC TAGCGGCGTG
AGCCATATGC TCGAACTTGG TGGCCATAGC ACATTGATCG ATCTAGGTCG CCAAGCTGGG
CTACCCAATC TGACTTGGCT GGCGAGTTTG CGCCGCCAAC AAGCCGATTG GGAAACGCTC
TACCACGCGG CGGCAACGTT GTTGGCTCAT GGTTGCCAGC TGAATTGGGC CGCCATGAAT
CCCGATTATC AACCACAGGC GGTATTACTG CCAACCTATG CTTTTGATCG CCAACGTTAT
TGGTTTACGG AAGGAACTGG TATGAACCAA GCTAGCGCTC AACCAACGCC TGCTAGCGCC
TCAAGCCGCC ATAGCACGAT TTTGGCTGAA CTTCGCAGCC TGACTGCCAA TTTACTCCAC
GTCAAGCCCG AGCAGATCAA TATCCACAGT TCATTTGTTG AAATGGGCGC AGATTCGTTG
GTGATGATCG ACGCTGGTCG GGCGATTGAA TCACGCTATA ACGTGCATAT TACGATGCGC
CAATTGTTTG AAGATTTGGC TAGCCTTGAT GCTTTGGCGC GTTTTCTTGA TGCGCATGGC
ACGTTTGAAG CTGAGCCAGC CCCAAGTGAG CCAAGCGTGG TTCCGATTGT CGCCCAAGCG
CAACCTGTTG CTCCAGCAGC GGTTACTCCA GTGGCAGCGG CTGGCTTAGA GGCCGTGGTA
CAGCAACAAT TAGCACTCAT GCAACAGCAA TTGGCCTTGT TGGCAGGCCA ACCAGCGGCA
GTAACCCCGA TTCAGCCAGT AACTCCAGCT AGTGCAACCC CTGCGACCCC AGTGATTAAG
CCAGCCTCAA CTCCCGCTGC GGCTGCCCCC AAAGCCTATG TGCCCTATCA ACCAGTTCGG
CCCGGCTCAA TCACGGCCAA CGATCTGAGC AGCCAACAAC AAGCCCATTT GCAACAATTG
ATTAAGGATT ACACCACGCG GACTGCGACT TCCAAGCAGC TGACCCAAGC CTATCGTGCG
CAACTAGCCG ATAATCGCGA ATCAGCGGGT TTTCGCTTCT CGATCAAAGA GATGTTGTAT
ACCTTGATTT GCGAACGTTC GGAAGGTAGC AAAATTTGGG ATGTTGACAA TAATCAATAT
CTTGATTTGA CCATGGGTTT TGGAGTTAAT TTATTCGGCC ATCGACCTGA AATAACCACC
CAAGCTTTGG CGAACCAACT TGCTCAAGGC TATCAGCTTG GCCCACAAAC CCGCTTGGCA
GGCGAAGTTG CCCAATTAAT TTGCCGAATT ACGGGTATGG AGCGGGTGGC GTTTTGCAAT
TCGGGCACTG AGGCAGTGCT CACGGCAATT CGCGCTGCTC GCACGGTGAC TGGCCGCAAA
AAAATAGCGC TGTTTGCTGG CTCGTATCAT GGGTTTTATG ATGCAACCTT GGCGACTGCC
CAAGCTGGAG CCGCCACCCG TTCGGTTCCG CTAGCGCCTG GCATTCCTCA AGGGATGGTC
GATGATATTG TGGTGTTGGA TTATGTCACG CCTGAGAGTT TGGTCACGCT TGAGCAATTG
CTGCCCGAAT TAGCAGCTGT ACTGGTCGAG CCAGTTCAAA GCCGCCGCCC TGATTTGCAG
CCGCAGGCAT TTTTGCAAGC TGTGCGCGAA TTAACCAAGA CTCACGGCAG CTTGTTAATT
TTTGATGAGA TGATTACGGG CTTTCGGATT GCAGCGGGCG GGGCACAGGC TTGGTTCGGC
ATCGAAGCCG ATCTTGCAAC CTATGGCAAG ATCGTTGGTG GCGGGATGCC GATCGGCGTG
GTGGCTGGGC GGGGCGCGAC CCTCGATGCG CTTGATGGCG GCTTTTGGCA ATATGGTGAC
GATTCCTTTC CTCAAGCTGA AACTACCTTT GTCGCTGGCA CCTTCTGCAA ACATCCCTTG
GCTCTGGCGA GTGCTAAAGC CGTGCTGACA GCGATTGAGC ACGCTGGCCA AGGGCTATAT
GACCAGCTGA ATCAACAAAC TGCCAGTTTT GCGGCTGAAA TGAACGCCTA TTTTGCCCAA
GCCGAAGCGC CAATTAGTGT GGTGCATTTT GGCTCGCTCT TTCGCTTTAG CTTCAAGCAA
AACCTTGATT TGTTCTTTTA CCATTTGTTG TTGCAGGGCA TTTACATTTG GGAAGGCCGT
AATTGTTTCT TCTCAACCGC CCACAGCACC GCCGACGTGG AATGGCTCAA ACGGGCGATT
CGCAACGCGG TCGAGGCATT GCAAGACGGC GGATTTTTGC CCAAGCCACA ACGTCAATTG
AACCAACCCC ACAGCTTTGC CCTAAGCGAA GATCAATATC ATCTGTGGGT GCTGGGCCAA
CTTGGGCAGC ACGAGGCGAT TGCCTACAAT CTGCCAACTG GCTTGGAAAT TCGCGGCCAG
CTGGATTTAG TCCGCTTGGA GCAGGCCTTC AATCTGGTGG TGCAACGCCA TGAGAGCTTG
CGCACAATCA TTGCCAGCGA TGGCACGCAG CAAATTGTTC AGCCCCAGCA GCCAATCACG
ATTAATTTCA GTGATTTTTC AAGTGCAGCC AATCAGCAAC AGGCGCTTGA TCAATGGTTT
ACTCAACATA ATCAACAAGT ATTTGATTTG AGCCAAACCA ACCCATTGCG CTGTAACGTG
GCGCAGCTAG GGCCAGATCG CTATGCCCTG AGCCTTGTGG TGCATCACTT GTTGGTTGAT
GGTTGGTCGG TTGGCGTGAT TTTGCAAGAA GTTGCCCAAA TGTATCAAGT TTTGAGCGAC
GGCCAAACCC CGCAATTGGC TCAGCCTTTG CAGTTTCGCG ATTATCTGGC CTGGAAAGCT
GGGCGTGATC TGACCACCCA AGCCAATTTC TGGCAAGCGC TGTTTGCCGA GTTGCCAGCA
CCTTTAGCGT TGCCAACCGA TTACCCGCGC CCAGCGCTCA AAAGCTATGT TGGCCAACGG
GTGATGCAAG TATTGGAGCC AGCCAGTTAT CAAGCCTTGA AACAACTCAG CCGCCAATCA
GGGGCGACCT TGTTTATGGT GTTGTTGGCG GGCTACCAAT TGTTATTGCA TCGCCTGACG
GGCCAAAACG ATCTGGTGGT GGGCATTCCG GCGGCGGCGC GTTCGTTTGA GGGCAGCGAA
ACGATTGTTG GTTATTGTGG CAATTTGCTG CCCCTGCGCA GTCGCTTGCA AGCTGAACAG
AGCTTTAGCG ATTATCTGCA TCTGACTCGC CAGCAACTAT TCGATGCCTA CGAGAACGAA
GATTACTCCT TGGCGCAACT GTTGGCGGTC TTGAACCCAA TGCGTGATGC CAGCCGCTCG
GCGATTGTTG AAACCTTGTT CAACTTTGAG CCGCCGACAC CTGCTCCCAA CTTTGGTGGG
CTGGAAACCA GCTTTGTGCC TCAATCAATT AGCGCAACCG CGCTTGATCT CAGTGTCAAC
GTAATTGAGT TGAATCAACA GCTGGTGGTT TATTGCGATT ACAACAGCGA TTTATTTGAG
CAGGCCACGA TTGAGCGTTG GCTGGGCCAT TATCAAACAC TGTTGTTGAG TGCTGCCCAG
CAACCAACCA GCCCTGCCGA ACGCTTAGCC TTGTTGAACC CCAGCGAAAC AAACGTACTG
CTAGAAACTT GGAATGCGAC AGCGAAACAA GTTCCATTCG AGCAGACCTA TAGCCAATTG
TTTACTGATC AAGTTCAGCG CACGCCCTCC GCAATTGCGA TTAGCGACCA ACACACCAAC
TACAGCTACC AAGCGCTTGA TCAACGGGCC AATCGCTTGG CCAACTATTT GCAAAGTTTA
GCAATTAGTA CAAACCAAGT TGTGGCAATT TTGGCAGATC GTTCGTGTGA TTTTGTGAGT
GCAGTTTTGG GCGTGTTCAA GGCTGGCGCT GCCTATCTCC CGCTCGATTT GGAGCACCCA
CCACGGCGTT TGGCCCAGGT GTTGCAACAA AGCCAAAGTC GTTTGGTGCT GGTTGGTGAG
GCTTGGCAAG CAACCTTGGC CGCAGCGCTC AGCATTTTGC CCAGCGACCA ACGCCCAATC
ATCGTGCTGT TGGAGCAAGC CTTCAACCCT GAATTATCAA GCGAAGCACC GACGATTCAA
TCCCAAGCCA GCGATTTAGC CTATGTGATT TATACCTCGG GCTCAACTGG CTTGCCCAAA
GGAGCCATGA TCGAACAACG GGGCATGGTC AATCATCTAT ATGCCAAGAT TATCGATTTA
CAACTGACAG CGGCTGATCG GGTGGCGCAA AATGCCCGCC AAAGCTTTGA TATTTCAGTT
TGGCAGATGC TGGTAGCGCT GTTGGTTGGC GCGGAAACCC AAATCTACCC TGATAGTATC
GCCCGTGATC CTGAGGTATT GTTGAGCTAT GCCGAGCAGC AAGCAACCAC AATTTTGGAA
ATTGTGCCCT CGTTGTTGGG CGCATGGCTC ACAATTTTTC CCAATCGGGC CAACGATTTG
CCCAGTTTTG CGCAATTGCG TTGGCTCTTG CTGACGGGCG AGGCCTTGCC ACCCGCCGCC
TGCCGTGATT GGTTCACCTG GTATCCTACG ATTCCCTTGA TGAATGCCTA TGGCCCAACC
GAATGCTCCG ATGATGTGAC CCATTATGTG GTGCGTGAGG CTCCAGCAGC GCATGTGGTG
CATATGCCGA TTGGCCGTCC AGTGATCAAC ACACGCTTGT ATATTCTTGA TGGGTTGTTG
CAACCAGTGC CGATTGGGGT GATTGGTGAA CTCTACGTTG GTGGCGTGGG CGTTGGTCGT
GGCTACCTGA ATGATCCTGA ACGAACTCAA GCGGTGTTTG CAGCTGACCC ATTTATGGCT
GGCGGGCGTT GGTATCGCAC TGGTGATTTG GCTCGCTACC GCAGCGATGG CACGATTGAA
TACTTAGGTC GAATCGATCA TCAAGTCAAA GTGCGGGGCT TCCGCATTGA GCTGGGCGAA
ATTGAAGCAG CTTTGGCTCA ACATCAAGCG GTGCATCAAA GTATTGTGAC GGCAACTCCG
AATGCTCAAG GTCAATTGCG CCTGATTGCC TATGTCGTCT CGAAGGCCGC TGATCAGCCT
GCGGAACAAG CAACCAGCGC TCGGTTGGAG CAATGGGATA GCGTTTGGGC TGATACCTAT
GATCAACTGA GTGCTGGTGA TCATGGCACG ATCAACACGA TTGGCTGGAA CGATAGCTAC
ACCCGCCAGC CCTTCTCGGC AGAGGCCATG CACGAATGGA GTTGGGTCAC GGTACGCAAT
ATTTTGGCCC AACAGCCGAG CCGCATTTTG GAGCTTGGTT GTGGCACTGG TATGTTGCTG
TTGCCGTTGG CTCCCTATTG CTTGAGTTAT CGCGGCCATG ATATTGCCGC CGAGGTGTTG
GCTTACGTTC AACAACGGCT TGATCAACAA ATTCATGATT GGCCGCATGT GAGCTTGGCG
CAGTTTCCGG CCCACGATTT TAGCAATATT GCACCCCACA GCGTTGATAC GATTGTGATC
AACTCGGTGG CTCAATATTT CCCAAGTATC GACTATTTGG TGCAAGTGTT GGCGGCTGGG
CTTGAGGCCT TGGTGGCGGG TGGGCGAATC TATCTTGGTG ATATTCGCAA CTTGAGCCTC
AACCCATTGC TGCATGCCTC AATTCAGTTG TTCCAAGCAC CTAACGAATT GGCGGTTGAG
CAAATTGCCC AATTGGCGCA GCAGCAACAT TTGCGCGATC AAGAGTTGGT AATCGACCCA
AGCTTTTTCT ATGCGTTGCA ACAGCAATAC CCGCAAATCA GCCATATCGA ACTGGTGCTG
AAACGTGGGC GCATTCATAA CGAACTGACC CGTTTTCGCT ACGATGTGGT GCTGCATGTG
CAACGCCCAA GCCTTGATCT GCAACCACAT TGGTTTGATT GGCAAGCCGA TGGCTTGAGT
TTGAGTCTCG TGCGCCAGGT GCTTGAGCAA AGCCAGCCCG ATGCGCTAGG CATCGCCAAT
GTGCCGAATA GTCGGCTAAG CGAGGCCTGT GGTTTGTGGC AAGCACTGCA TGTAGCGGAG
CAACCATCAA CTGCTGGCGA GCTGAAACAA GCACTGCAAC CATTGGCTTT GCAAGGCATC
GACCCTGAAG ATTGGTATAA CCTGCATCTG AATGGGCGCT ACCGAATTAG CGTCAGCTTG
GCCCAAAGCG GCGAGCTTGG TTGCTATGAT GTGTTGTTCT ATGCCACTGC CAAGCTGGCA
GATGGCGGTT TGCCGCAGCA AATTCAACGA CTCGCCAGCC CACGCAAAGC ATGGTCGGCC
TATGCCAACA ATCCACTCCA AGAAAGCCAA TCGCTGACCC AGCAATTCCG TCAGCATTTG
CGCCAAGCCT TGCCCGACTA TATGCAGCCC GAAGCCTTTG TGTTGTTGGA GCAATTGCCC
TTGACTCCCA ATGGCAAGGT TGATCGTCGC GCCTTAGCGG CGCTCGAAGC GCCCATCCAA
ACGACGACCT ATTTGGCTCC GCGCAACCCG CTAGAGCAAC AACTGGCTAG CCTGTTTGAG
CAAGTGCTCA ATCTTAACCA AGTTGGCGTT GATCAAAGTT TCTTTGAATT GGGCGGCCAT
TCGTTGACTG GAACCCAACT GATTGGCCTG ATTCGCAGTG AATGCCATGC CGATTTGCCC
TTGCGCACCT TGTTTGAAGC CCCAACGGTT GGCGAATTGG CCTTGCGAGT TGCCGCTGCC
CAAACCGAGC CAAGCGAGAT TGCTAAACCA ACCGCGCTCA AACGTCAGCG CCAACGGGTC
AATTTGAATA CGGCTGGCTT TGGCCAAATT GCTGAAAGCA ACGACGGAGG TGCAGCATGA
 
Protein sequence
MKRSPELVAL TAPFADLGLS SREAVGLSGD LQAWLGRKVA PTVLWEYSTI QALSDFLAHD 
QAQQLPLPKP KPSQASATSS SAIAIVAMSC RLPGADSPEA LWQLLLEGRS AIGFVPADRW
DAQALYSPEA RTPGKINTRW GGFLDQVDQF DPQVFGISGR EASRIDPQQR LALEVAWETF
ERAGIAPDQL VGSATGVFLG ISSNDYARLQ FAQLDQLDAY AGTGNAHSIA ANRLSYVFGL
QGPSMALDTA CSSSLVAVHL ASQSLLAGEC EQALAGGVNL ILNPELSVTF AQAQMLSGTG
ECHTFDAAAD GYVRSEGCGM VLLKRLDVAE AAGDPILAVI HGSAVNQDGR SNGLTAPNGQ
AQQAVIRQAL AKAQIQPDQL SYIEAHGTGT PLGDPIEVAA LQAVLGERQQ PCLLGSLKSN
LGHLEAAAGI AGLIKLVLAF QQQIIPAQAN FKQRNPQIEL GSALEIATTP QPWYSFGSYA
GISSFGFGGT NAHVILGAAP IQPKRLPQPS PAPIQLLALQ ANSETALRQL TERYQAYLAQ
TEVNLADICW SAYHQRATMR HRLIVSATDK IQMLERLQHA WQAQATGSIY AEQPQPAPRI
AFVCSGQGSQ YVGMAQTLYQ TQPLVRQILD QANSILNEYL AIPLLDVLYQ PDHGALLRDT
RYTQPAIFVV SYALGQLWRS WGIEPVALLG HSIGEYSAAV LAGVWSFEQG LRLVAQRAQL
MHGLPEHGAM LAIRSPLESI EPLLAQHQLD LAAINGPNAV VVAGSVAAIS QCAVELNQLN
ITNKLLDVSH GFHSRLMQPM LADFQQVLSA YPAMAPQIRL IANLDGSWHE QAPSAEYWVE
HTRQPVQFYR GLQSLVASGV SHMLELGGHS TLIDLGRQAG LPNLTWLASL RRQQADWETL
YHAAATLLAH GCQLNWAAMN PDYQPQAVLL PTYAFDRQRY WFTEGTGMNQ ASAQPTPASA
SSRHSTILAE LRSLTANLLH VKPEQINIHS SFVEMGADSL VMIDAGRAIE SRYNVHITMR
QLFEDLASLD ALARFLDAHG TFEAEPAPSE PSVVPIVAQA QPVAPAAVTP VAAAGLEAVV
QQQLALMQQQ LALLAGQPAA VTPIQPVTPA SATPATPVIK PASTPAAAAP KAYVPYQPVR
PGSITANDLS SQQQAHLQQL IKDYTTRTAT SKQLTQAYRA QLADNRESAG FRFSIKEMLY
TLICERSEGS KIWDVDNNQY LDLTMGFGVN LFGHRPEITT QALANQLAQG YQLGPQTRLA
GEVAQLICRI TGMERVAFCN SGTEAVLTAI RAARTVTGRK KIALFAGSYH GFYDATLATA
QAGAATRSVP LAPGIPQGMV DDIVVLDYVT PESLVTLEQL LPELAAVLVE PVQSRRPDLQ
PQAFLQAVRE LTKTHGSLLI FDEMITGFRI AAGGAQAWFG IEADLATYGK IVGGGMPIGV
VAGRGATLDA LDGGFWQYGD DSFPQAETTF VAGTFCKHPL ALASAKAVLT AIEHAGQGLY
DQLNQQTASF AAEMNAYFAQ AEAPISVVHF GSLFRFSFKQ NLDLFFYHLL LQGIYIWEGR
NCFFSTAHST ADVEWLKRAI RNAVEALQDG GFLPKPQRQL NQPHSFALSE DQYHLWVLGQ
LGQHEAIAYN LPTGLEIRGQ LDLVRLEQAF NLVVQRHESL RTIIASDGTQ QIVQPQQPIT
INFSDFSSAA NQQQALDQWF TQHNQQVFDL SQTNPLRCNV AQLGPDRYAL SLVVHHLLVD
GWSVGVILQE VAQMYQVLSD GQTPQLAQPL QFRDYLAWKA GRDLTTQANF WQALFAELPA
PLALPTDYPR PALKSYVGQR VMQVLEPASY QALKQLSRQS GATLFMVLLA GYQLLLHRLT
GQNDLVVGIP AAARSFEGSE TIVGYCGNLL PLRSRLQAEQ SFSDYLHLTR QQLFDAYENE
DYSLAQLLAV LNPMRDASRS AIVETLFNFE PPTPAPNFGG LETSFVPQSI SATALDLSVN
VIELNQQLVV YCDYNSDLFE QATIERWLGH YQTLLLSAAQ QPTSPAERLA LLNPSETNVL
LETWNATAKQ VPFEQTYSQL FTDQVQRTPS AIAISDQHTN YSYQALDQRA NRLANYLQSL
AISTNQVVAI LADRSCDFVS AVLGVFKAGA AYLPLDLEHP PRRLAQVLQQ SQSRLVLVGE
AWQATLAAAL SILPSDQRPI IVLLEQAFNP ELSSEAPTIQ SQASDLAYVI YTSGSTGLPK
GAMIEQRGMV NHLYAKIIDL QLTAADRVAQ NARQSFDISV WQMLVALLVG AETQIYPDSI
ARDPEVLLSY AEQQATTILE IVPSLLGAWL TIFPNRANDL PSFAQLRWLL LTGEALPPAA
CRDWFTWYPT IPLMNAYGPT ECSDDVTHYV VREAPAAHVV HMPIGRPVIN TRLYILDGLL
QPVPIGVIGE LYVGGVGVGR GYLNDPERTQ AVFAADPFMA GGRWYRTGDL ARYRSDGTIE
YLGRIDHQVK VRGFRIELGE IEAALAQHQA VHQSIVTATP NAQGQLRLIA YVVSKAADQP
AEQATSARLE QWDSVWADTY DQLSAGDHGT INTIGWNDSY TRQPFSAEAM HEWSWVTVRN
ILAQQPSRIL ELGCGTGMLL LPLAPYCLSY RGHDIAAEVL AYVQQRLDQQ IHDWPHVSLA
QFPAHDFSNI APHSVDTIVI NSVAQYFPSI DYLVQVLAAG LEALVAGGRI YLGDIRNLSL
NPLLHASIQL FQAPNELAVE QIAQLAQQQH LRDQELVIDP SFFYALQQQY PQISHIELVL
KRGRIHNELT RFRYDVVLHV QRPSLDLQPH WFDWQADGLS LSLVRQVLEQ SQPDALGIAN
VPNSRLSEAC GLWQALHVAE QPSTAGELKQ ALQPLALQGI DPEDWYNLHL NGRYRISVSL
AQSGELGCYD VLFYATAKLA DGGLPQQIQR LASPRKAWSA YANNPLQESQ SLTQQFRQHL
RQALPDYMQP EAFVLLEQLP LTPNGKVDRR ALAALEAPIQ TTTYLAPRNP LEQQLASLFE
QVLNLNQVGV DQSFFELGGH SLTGTQLIGL IRSECHADLP LRTLFEAPTV GELALRVAAA
QTEPSEIAKP TALKRQRQRV NLNTAGFGQI AESNDGGAA