Gene Haur_1861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1861 
Symbol 
ID5733750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2186916 
End bp2195567 
Gene Length8652 bp 
Protein Length2883 aa 
Translation table11 
GC content53% 
IMG OID641279005 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001544632 
Protein GI159898385 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCGG TTCAAGCATT GCTCGAACAA TTAAATGCAC TCGACATAAA ATTGTGGCTC 
GATGGGGCAA ATCTGCGGGT AAACGCACCT AAAGGGGCAT TAACACCCGA ATTACGGGCT
GCGCTGAGTG AGCAAAAACC GCAATTAATT GCTTGGTTCG AGCACTATCA GGCGACGGCC
AATTCGAATG CAGCGATTCA GCCAAGCCCA CGAACTACGC CTTTACCGCT TTCCTTTGGC
CAAGAACGGC TGTGGTTTCT TGACCAACTC GAAGGCCAAA GCACTGCCTA CAATTTGTCG
GGCTGTTTTG AAATTCAGGG CGCGTTCAAG CCCGAATTAT TGGCGCGAGC GCTCCAGCTG
AGCCTCGAAC GCCATGAAAT TTTGCGTACT AGCTTTCCAA CCGTTGCGGG CCAGCCAATT
CAACAGGTTC ATCCAACGCC AAACTTGAGC GAATTGCCAT TAGAAATTGC CGATTATTCG
AGCGCTGCCG CGCCGCGCCA ACAGGCCTTG CAGTATCTAC GCGAGCAAGC TTTGGCTGGC
TTCGATTTGA GCAACGGGCC ATTATTGCGC GTGGTGGTGG TGCAACTTGG CGCTGAGCAA
GCCATGCTGC TGGTGGTGAT GCACCACACA ATTGCTGATG GTTGGTCAAT GGGCTGCTTG
ATTCATGAAT TAACCACCAG TTACCAAGCC TTTGCCCAAG CCACCCGACC GCAATTTCCA
GCCCTGCCAA TTCAATATGG CGATTTTGCT GCTTGGCAGC GTGATCCGGC GCAAGCGCCG
CTTTGGCAAA AACAGGTGCA ATTTTGGCAC GATACTTTGC TTGGCGCACC AGCCTTGTTG
GAGCTACCCA GCGATTATCC GCGCCCTTCT AGCCAATCGT GGCGCGGCAA AACCTGCCAT
TTTGCGCTTT CAGCTTGCCT TAGCCAGCAA ATTCGCCACG TTGGTCAGCG CTATGGTTGC
ACGCCCTATA TGACCTTGCT GGCAGGTTTC GCTTTTTGGC TCGCTCGCAT GACTGGCAGC
CACGATCTGC CAATTGGCAC ACCGATTGCC AATCGCAACC GCCTAGAAAC TGAACATTTG
ATTGGGTTTT TCGTCAACAG TTTGGTGATG CGGATCAAGC CTGATCGCAA TCTGCCATTT
TCAGCGCTGT TACGCCAAAC CCAAGCGGTC AGTTTGGCCG CCTTTGCCAA TCAGGATGCA
CCATTTGCTC AGGTGGTCGA GGCCTTGCAA CCGGAGCGCA CTTTAGGCTA TAACCCAATT
TTTCAAGTGA TGTTTGATTT GCAAACTGCG CCAACGAGCA GCTTGCAACT TCCCAATCTT
AGTTTTGAGC CATGCTTGCT TGATGAACTT GGCGAAGGCA CGGCGATGTT TGATCTGTCG
TGGACGATGC AAGATTTGGC GAGTGGCTTT ACTGGACAGG TCGAGTTTGC CACCGATCTG
TTCGCACCAA GCACAATCGA GCGCTGGATT GAAGATTTTG AACAGACCTT GAGCTATGTT
TGTGCTGCGC CTGAGCATCC CTTGTATAGT CTGCCGATTT CGCGCTTGGC CGATTGGGGC
TTTCAAGCGC CGATTAGCGC AGCCCAACCG ATTCACCATT TGGTTCAGCA GCATGCCCTG
GTTCAACCAA ATGCTTTGGC CGTAACCTGG CAAGGCCAGC ATTTGAGCTA TGCCCAACTT
GATCAGGCGG CCAATCGCTT GGCGCACTAT TTGATTGAGC AAGGCATTGG CTGTGGCGAT
TTTGTTGGCC TCTGTTTTGA GCGCTCGCTA GCCATGCCAG TTGCTTGGTT GGGTGTGCTC
AAGGCGGGCG CGGCTTATTT ACCGCTTGAT CCAAGTTACC CGCTTGAGCG CTTGGCTTTT
ATGTGCAGCG ATGCCAAACT GCGTTTGGTG TTAACCCAGG CTGGGTTGGC CGATTGTTTG
CCGCTTGAGC AACCGCTCGT CATTTGGGAA CAACTAAGCG ATGCGCTGGG TGATTATCCT
GCGACCGCGC TTGGCGTAGC AATTCACCCG CAACAACCAG CCTATGTGAT TTATACCTCT
GGCTCGACTG GCTTGCCCAA AGGAACCGTG ATTGCCCACG GCCCGCTGGC TCAAACCTAC
CGCGCTTGGG AACAGGCCTA TCAGTTGAGC GACAAGATTC GGGTGCATTT GCAAATGGCG
GCGTTTTCGT TTGATGTCTG CACTGGCGAT TTTGTACGGG CACTCGGTTC AGGCGGACGC
TTAGTGCTTT GCCCTCGCGA TTATTTGCTC TCGCCCGCCG ATTTATATCA ACTGATTGTC
AGTGAACAGG TTGATTGTGG CGAATTTGTG CCAGCAGTGC TACGCGAGCT TTGTCATTAC
TTGGCATTAA CCAAGCAAAA ACTGGCAATG CCCTTGGTCA TCGCTGGCTC GGATATGTGG
TATGGCGAAG AGTATCAGCG TTTTCAAACC GTGTTTGAGC CGAGCACCCG CTTGATCAAT
TCATATGGTG TGACCGAGGC GGTGATTGAT AGCTGCTATT TCAGTGCTGA TCAACCGTTG
ATTGCTGAGC GCAGCGTGCC AATTGGCCGC CCATTTGCCG CCACCGCGAT GTATGTGCTT
GATCAATGGT TGCAGCCCGT GCCAAACGGG GCAATCGGCG AGCTGTATTT GGCGGGCGAA
CGTTTGGCCA GCGCCTATCT TGGCCGCCCC GACCTCACCA GCGAGCGCTT TGTCCCTGAT
CCTTGGGGGC GTTTGGCTGG GGCACGCATG TATCGCACTG GCGATCGCGC CCGCTGGACG
AGCACTGGTC AGTTGGAATT TCTGGGGCGT GGCGATCAGC AAATCAAGCT ACGTGGCTTT
CGGATTGAGC TAGGCGAGAT TGAAACCGCG CTAACCCAAT ATTCCAGCAT TCAGCATGCG
GTGGCTTTGG TGCATACCAC GCCCCATCCG CAGTTAGTCG CCTATGTGGT GACCACCCAA
GCGCTTGACC AACCAGCCTT GCTGCAATGG CTCCAAACTC GTGTCCCCGA ATATATGCTG
CCCAGCGGCA TTGTTGAGCT TGAGCAATTG CCATTAACTC CCAATGGCAA AGTTGATCGC
AAGGCCTTAC TCGGCTTAAA ACCGACCCAA CTCAGCGCAA CTGAACAGAT CGCGCCTGAA
GGTGCAACTG AGCAAGCGCT GGCCGCAATT TGGCAGCAGG TGCTTGGGCA GCCAGTTGGT
CGCCATGCCA ATTTCTTCGG TTTGGGCGGC GATTCGATTT TGGCGATGCA AGTGGTCAGC
CGCTGCCGCG CTGCGGGCTT GGGATTAACG CCACGCTTGT TATTTCAACA TCAAACAATC
GCTGCTTTGG CGCAGGTGTT GCCCACGCTT GAGGCGACGA GCACAATTCA ACAACCAGCT
CGCGCACCAC AGCAGCTTTT GCCAGTTCAA CACTGGTTTC GCGAGCTAGC AGCGCCGAAT
CCGCACCACT ACAACCAAAG TGTGTTGATT GAATTAACCA CAGCGCTTGA TCCTGAACGG
CTGCAAGCCA GCCTGAATCA ATTAACCCAA TTGCATCCAA GTTTACGGCT CGCATGTGAT
GCTCAGTTTC AGCAAAAACT GCATGCAGCT GAACCAACCT TGCATGTGCT CAAGCTTGAT
CCTGCTCAGC CAGCTGCTGC CCAAATAACC GCCTACGCCG CCCAGCGCCA ACAACAATTG
CACTTGCACC AAGCGCCGCT GTGGCATGCC AGCTATTTGC AGCAGAATGA GCAAGCTTGG
CTCTTGTTGA TCGCTCATCA CTGGATTATC GACGGGGTTT CGTGGCGAAT TTTACTTGAA
GATCTCGCTT ATCTATTAAA TGAGCAGCAA CCTCTCCCTG CTAGCACTAG CGTCGCCGAA
TGGACTGAGT ATCTGCAACA GCAAACCAGT CAGCAGTTTT TCAGCCAACT TGGCTATTGG
CAGCAGACAA TCCAACAATT AAAGCCGTTG GTTGCTAGCC AAAACCAACA ACTCAATAAC
GTAGTTGGGC AAACCCTGCG TTATCAACAT ACGCTTAGCC CACAATTAAC CGCAAACCTG
CTCGGTGAGC TGCATCAAGC CTATCGCACG ACGATCGATG ATCTATTGCT CGCGGCGTTG
GTATTGAGCT ATTGCGAATG GAGCGGCGAG CAAGGGCTAA GCCTTGAGCG CGAAAGCCAT
GGCCGCTTTG GCGATCAAGC TGATTTGGAT TTGACACGCA CTGTGGGCTG GCTCACCAGC
ATCTATCCCC AACATGTGAG CTTGCCCGCC AACCCAACCC TTGCCGAAAG CATAATCGCG
GTGAAAGAGC AATTACGCGC GGTGCCAGAT CAGGGCTTGA GCTATGGCGG GCTGCGCTAT
CAACATCCCG ATCCGACGGT TCGCCAAGCA TTAACGCTGA ATCAACCGTT GGCTGTTACC
TTCAACTACC TTGGGCAACT CGATCAAGGC ACGCTCACAG CACCATTCAA GCGGCTTGCC
GAAATTGACC TTGGCGACGA GCAAGACCCC GCCACGCCAC GCAGTAGTAT CATCGAGATC
AATGGCTATA TCAACAATGG TGTACTGACC TTGAATTGGG AATATTGCCG CGATTGGGCG
GCGGCTACGA TGCTCGAACA ATGGGCCAGC AGCTTTGCCA GCACGCTTGA AGCCTTAGTT
GAGCATTGTC GCACTATGCA GCAACCACGC TTAACTCCGA GCGATGTGCC TTATGCCCAA
CTCAATCAGC GCGAATTAGA TGGTTTGGCT CTGCGGGTTA AGCAACCAAT TGCGGATATC
TACCGGCTTA CGCCATTACA AGAGGGCATG CTGTTTCATA GCTTGCTTGC GCCCGAACAG
CAATTTTATA TCGAGCAAGT CGCATGTCGG CTCGATGGCA CGATTGATCC AGGCTTGTTT
GAACAGGCTT GGCAGCAGGT CGTCGAGCGC CATGCCGTGT TTCGGACAGC CTTTTATAAC
GATGGCCTGA AGCATCCCTG TCAGGTGGTG GCGGCACAGG CTAATTTTCA ACTTTGCTAT
CACGATTGGT CTGTTGAACA AATTGAATCA AATCAACTTA ACGACGTTGC CCAAGCTGAC
CGTCAACGTG GCTTTGATCT TCAGCAAGCA CCATTAATGC GGATCAGTCT GATCAAACTG
GCTGAACAGC ATTATCACTG TATTTGGACG CATCATCACT TGTTGCTTGA TGGTTGGTCG
GTGCCGCTGG TATTGGGTGA AGTAGTGGAA TGCTATCAAA ACCTGATCGC TGGTAGCCAA
CCCAATTTAG CGCCAGCCCC AGCCTATCGC GAATACCTTG GCTGGTTGCA AGCCCAAGAT
CAACGTCAAG CCCAGCAATT TTGGCGCGAT TATTTGGCGA CTCAAGAACA ACCAACTGCC
TTGCCCTGCG ATTACACTGG CTTACATCAG GCAAGCCAAA CTTGGGCCAA AGTTCAGATG
CAGCTCACTT CAGCTGAAAC CCAAGCCTTG AGCCAATTTG CCCGCGACCA GCATGTGACC
CTGAGCACCT TGGCTCAAGC AGCTTGGGGC TATGTCTTGG GTCGCTATAG TAGCCAACTG
CAAGTGTTGT TTGGATTAAC GGTTGCTGGC CGACCCGCCA ATTTGCCAGC GGCTGAACAC
ATGGTCGGAA TGTTTATCAA TACCTTGCCC TGCGTTGTGC CGCTGAATCC TGAGCAATCA
GTCGGCGCGT GGCTCCAAAC GCTTCAACAG CAACAGCTTG AAGCCCAACA ATATGCCGCC
AGCAGCCTTG TTGATATTCA AAGCTGGAGT ACTATCGCTC AGCCAACACC ACTATTCGAA
AGTATTTTGG TGTTTGAGAA TTACCCGAGC AAGCAGGCTG ACGATCAGCA AACGAGTTTA
CAGATTAGCG AGATTCAAGC GACTGAACAA ACCAATTACC CCCTGACACT GGTGGTTGCT
CCGGCTGAGC AACTGGTTTG TAGCCTCAGT TATGCGAATG AGCGCTTTGA TTCAGCCCTG
ATTGAAGCAG TATTGACGGG CTTTTGCCAA ACCCTAATGG CGCTCACGCG GCAAACAACC
TTGGCCCAAT TGCCAACCCT CGGCATGCAA CACCAACAGC TAGCGGCTTG GAATGCGACC
GAACAACCGC TTAGTCCATA TTGCTTGCAC GAGCTGTTTC AGCAACAAGC ACAGCGCACA
CCGCAAAACA TCGCGATTAT CACTGCTGAC CAACGCTTGA GTTATGCCGA GCTTGAGCAG
CAATCGAATC AAATTGCCCA CTATTTATGT GGCTTGGGAG TTGGCCCAAA TAGCTTGGTG
GGGATTCATC TCGAACGTTC GGCTTTGATG TTGGTGGCCT TATTGGGGGT GCTCAAGGCG
GGCGGAGCTT ATGTTCCACT TGATCCTAGT TTTCCCTTGG AGCGGCTAAG CTATATGGCC
GAGGATTCCA ATATTCGGGT ATTGCTGACT GCGACCTCAA CGCAAGCGCT AGCCTCAAGC
CTTCAGCATG GCCCATGGGC GGTGGTTGCG CTCGATGAGG TGGCTGATAG CTTGGCACGT
ATGCCAACCA CTGCGCCGTT GCCGAGCGCT CAAACCCACG ATTTGGCCTA TGCCATCTAT
ACCTCCGGCT CAACTGGCAA GCCCAAGGGT GTGTTGCTTG AGCATCAAGC AGTGGTCAAT
TTTGTTCAAT CAATTCAGCA TAAGCCAGGG ATTGCCTCCA GCGATCGGTT GTTGGCTGTT
ACTACCTTGT CGTTCGATAT CGCGGTGCTT GAGTTGTATG GCCCGTTGCT CTGTGGGGCA
ACGGTTGTGC TGGCTAGCCG CGAGGCTGCT GGCGATGCTG AACAATTGAT CAACTTAATT
AATCAACATG ACATAACCAC GATGCAGGCA ACCCCCGCGA CGTGGCGGAT GTTGCTGGCG
GCTGGTTGGC AGGGTAGCAA TCTACGGGCA CTCTGCGGCG GCGAGCCATT ACCACGCGAT
TTGGCTGGAG CGCTACTTGA GCGGGTTGCC CAAGTTTGGA ATATGTATGG CCCAACCGAG
ACCTGTGTTT GGTCAACATG TGCCCAAATT ACCACCGAAC TGCTGCTGAA TTCAACCCAG
CTTCCAATTG GCCGACCATT GGCGAATACC CAATGCTATG TGTTGGATGC GCAACAACAG
CCATTACCAG TCGGCGCATT AGGCGAATTG TATATCGCTG GGACTGGCGT GGCCCGTGGC
TATCACGAAC GGCCTGAATT AACCGAACAG CGTTTTGTCC CCGATCCTTT CAGCCACAAC
CCAACTGCGC GAATGTACCG AACTGGCGAT TTGGCTCGCT ATCGCAACGA TGGCACGCTG
GAATGCTTGG GCCGTATCGA TCAACAAGTT AAAATTCGCG GCTATCGAAT TGAGCTTGGC
GAGATCGAAA CCATTCTACT GGCTCACCCC AGCGTGGCCC AAGCCTTGGT TGTGGTGCAA
ACAACCGCGA CCGATGCTCA ATTGATTGCC TATCTGATTG GCGCAACCCC TGAGGTTGCA
ATTGAGCCGT TGCGCCAGCA CTTGGCCTTG CAACTCCCAC GCTATATGCT GCCAAGCGCA
ATTGTAGTAT TGAATGAATG GCCATTAACC CCCAATGGCA AAATTGATCG TCAGGCCTTG
CCCAAGCCAT GGAGCGAGCA GCCCAACCAG CAAATTGCCC GTGATCCGCT GGAATTGCAG
CTGCAACAAC TTTGGACAAG CGTGCTTGGC CATCAATTGG GTATTCACGA TCACTTTTTG
GAGCATGGCG GCCACTCGCT GATCGCTATT CGATTTATGG CCTTGCTCAA CCCAACGCTT
GAGCAGCCGC TGCCCTTAAC CAGCTTATAT CAAGCCCCAA CGATCGCCGA AATGGCTCAA
TTGCTGCGGC ATCAAAGCCG CCAATGGTCG CCATTAGTGC CCTTACGCCA TGGCGCAGCC
GAACAAACGC CGCTCTTTTT GCTCCCAGGG GCTGGTGGTA ATGTGCTGTA TTTACAACAG
CTAGCCCAAG CAATTCCAAC TGAGCGGGCG ATTTATGCAG TGCAAGCCTA TGGCCTAGAG
CCAAACCAAA CCCCATTGGA GACGGTCGAA GCCATGGCTC AACAAGCTTG GCAAGCAATT
CGCCACGCCT ATCCGCAAGG CCCATATACC TTGATTGGTC ACTCGTTTGG CAGCGATGTG
GCTTGGGCGA TTGCTAGCCT GGCACTTGCC GAGGGCCAGC AAATTTGCCA ATTCTTCAGC
CTTGATAGCG CTGCACCCCA AACTCGCCAG CAACCACGCC AGCTTGAGCC ATGGTCGGAA
TGGATGCGGC GTGGCAAGCA GGTATTGGAG CAGGCATTTG CAGTTAATTT GGTGCTGACT
GAGGCCGATT TAGCCGAATT AAGCCCACTT GAGCAATCTG GCCTGCTAAC TGACCAACTG
ATTACGCTTG GTATTTTGCC AGCCCAAACC GAACCCAGCC TGATCGAACG CTTTTTGAAG
GTCTTTCAGG CCAATCATCA AGCAAGTTTT CAGCCAGCTC AAGGCTTAGC AGTGCCTGTC
GTGCTGATCA AGGCTCGCGA CGAAGCTCCC GAGCCAAGCC TTGATCAACA GCCAGATTGG
GGCTGGAGCA ACTTAACCAG CCTAGCGCTG GAGATTGTGA GTTTGCCAGG CGATCACCAC
ACGATGTTGC ACGAGCCATA TGTTCAAGCC TTAGGTCGTT TGATTGGGGT TGGCTTGGAG
GTCGCCGTAT GA
 
Protein sequence
MSAVQALLEQ LNALDIKLWL DGANLRVNAP KGALTPELRA ALSEQKPQLI AWFEHYQATA 
NSNAAIQPSP RTTPLPLSFG QERLWFLDQL EGQSTAYNLS GCFEIQGAFK PELLARALQL
SLERHEILRT SFPTVAGQPI QQVHPTPNLS ELPLEIADYS SAAAPRQQAL QYLREQALAG
FDLSNGPLLR VVVVQLGAEQ AMLLVVMHHT IADGWSMGCL IHELTTSYQA FAQATRPQFP
ALPIQYGDFA AWQRDPAQAP LWQKQVQFWH DTLLGAPALL ELPSDYPRPS SQSWRGKTCH
FALSACLSQQ IRHVGQRYGC TPYMTLLAGF AFWLARMTGS HDLPIGTPIA NRNRLETEHL
IGFFVNSLVM RIKPDRNLPF SALLRQTQAV SLAAFANQDA PFAQVVEALQ PERTLGYNPI
FQVMFDLQTA PTSSLQLPNL SFEPCLLDEL GEGTAMFDLS WTMQDLASGF TGQVEFATDL
FAPSTIERWI EDFEQTLSYV CAAPEHPLYS LPISRLADWG FQAPISAAQP IHHLVQQHAL
VQPNALAVTW QGQHLSYAQL DQAANRLAHY LIEQGIGCGD FVGLCFERSL AMPVAWLGVL
KAGAAYLPLD PSYPLERLAF MCSDAKLRLV LTQAGLADCL PLEQPLVIWE QLSDALGDYP
ATALGVAIHP QQPAYVIYTS GSTGLPKGTV IAHGPLAQTY RAWEQAYQLS DKIRVHLQMA
AFSFDVCTGD FVRALGSGGR LVLCPRDYLL SPADLYQLIV SEQVDCGEFV PAVLRELCHY
LALTKQKLAM PLVIAGSDMW YGEEYQRFQT VFEPSTRLIN SYGVTEAVID SCYFSADQPL
IAERSVPIGR PFAATAMYVL DQWLQPVPNG AIGELYLAGE RLASAYLGRP DLTSERFVPD
PWGRLAGARM YRTGDRARWT STGQLEFLGR GDQQIKLRGF RIELGEIETA LTQYSSIQHA
VALVHTTPHP QLVAYVVTTQ ALDQPALLQW LQTRVPEYML PSGIVELEQL PLTPNGKVDR
KALLGLKPTQ LSATEQIAPE GATEQALAAI WQQVLGQPVG RHANFFGLGG DSILAMQVVS
RCRAAGLGLT PRLLFQHQTI AALAQVLPTL EATSTIQQPA RAPQQLLPVQ HWFRELAAPN
PHHYNQSVLI ELTTALDPER LQASLNQLTQ LHPSLRLACD AQFQQKLHAA EPTLHVLKLD
PAQPAAAQIT AYAAQRQQQL HLHQAPLWHA SYLQQNEQAW LLLIAHHWII DGVSWRILLE
DLAYLLNEQQ PLPASTSVAE WTEYLQQQTS QQFFSQLGYW QQTIQQLKPL VASQNQQLNN
VVGQTLRYQH TLSPQLTANL LGELHQAYRT TIDDLLLAAL VLSYCEWSGE QGLSLERESH
GRFGDQADLD LTRTVGWLTS IYPQHVSLPA NPTLAESIIA VKEQLRAVPD QGLSYGGLRY
QHPDPTVRQA LTLNQPLAVT FNYLGQLDQG TLTAPFKRLA EIDLGDEQDP ATPRSSIIEI
NGYINNGVLT LNWEYCRDWA AATMLEQWAS SFASTLEALV EHCRTMQQPR LTPSDVPYAQ
LNQRELDGLA LRVKQPIADI YRLTPLQEGM LFHSLLAPEQ QFYIEQVACR LDGTIDPGLF
EQAWQQVVER HAVFRTAFYN DGLKHPCQVV AAQANFQLCY HDWSVEQIES NQLNDVAQAD
RQRGFDLQQA PLMRISLIKL AEQHYHCIWT HHHLLLDGWS VPLVLGEVVE CYQNLIAGSQ
PNLAPAPAYR EYLGWLQAQD QRQAQQFWRD YLATQEQPTA LPCDYTGLHQ ASQTWAKVQM
QLTSAETQAL SQFARDQHVT LSTLAQAAWG YVLGRYSSQL QVLFGLTVAG RPANLPAAEH
MVGMFINTLP CVVPLNPEQS VGAWLQTLQQ QQLEAQQYAA SSLVDIQSWS TIAQPTPLFE
SILVFENYPS KQADDQQTSL QISEIQATEQ TNYPLTLVVA PAEQLVCSLS YANERFDSAL
IEAVLTGFCQ TLMALTRQTT LAQLPTLGMQ HQQLAAWNAT EQPLSPYCLH ELFQQQAQRT
PQNIAIITAD QRLSYAELEQ QSNQIAHYLC GLGVGPNSLV GIHLERSALM LVALLGVLKA
GGAYVPLDPS FPLERLSYMA EDSNIRVLLT ATSTQALASS LQHGPWAVVA LDEVADSLAR
MPTTAPLPSA QTHDLAYAIY TSGSTGKPKG VLLEHQAVVN FVQSIQHKPG IASSDRLLAV
TTLSFDIAVL ELYGPLLCGA TVVLASREAA GDAEQLINLI NQHDITTMQA TPATWRMLLA
AGWQGSNLRA LCGGEPLPRD LAGALLERVA QVWNMYGPTE TCVWSTCAQI TTELLLNSTQ
LPIGRPLANT QCYVLDAQQQ PLPVGALGEL YIAGTGVARG YHERPELTEQ RFVPDPFSHN
PTARMYRTGD LARYRNDGTL ECLGRIDQQV KIRGYRIELG EIETILLAHP SVAQALVVVQ
TTATDAQLIA YLIGATPEVA IEPLRQHLAL QLPRYMLPSA IVVLNEWPLT PNGKIDRQAL
PKPWSEQPNQ QIARDPLELQ LQQLWTSVLG HQLGIHDHFL EHGGHSLIAI RFMALLNPTL
EQPLPLTSLY QAPTIAEMAQ LLRHQSRQWS PLVPLRHGAA EQTPLFLLPG AGGNVLYLQQ
LAQAIPTERA IYAVQAYGLE PNQTPLETVE AMAQQAWQAI RHAYPQGPYT LIGHSFGSDV
AWAIASLALA EGQQICQFFS LDSAAPQTRQ QPRQLEPWSE WMRRGKQVLE QAFAVNLVLT
EADLAELSPL EQSGLLTDQL ITLGILPAQT EPSLIERFLK VFQANHQASF QPAQGLAVPV
VLIKARDEAP EPSLDQQPDW GWSNLTSLAL EIVSLPGDHH TMLHEPYVQA LGRLIGVGLE
VAV