Gene Haur_3961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3961 
Symbol 
ID5735822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4981029 
End bp4990421 
Gene Length9393 bp 
Protein Length3130 aa 
Translation table11 
GC content65% 
IMG OID641281111 
ProductBeta-ketoacyl synthase 
Protein accessionYP_001546721 
Protein GI159900474 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR00517] acyl carrier protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATC CCTTGGAACA ACTTCTCAAG AGCCTGACGC CTGATAAGAA AGCCTTGCTG 
GCCGAGTACC TGCGGCCCAA GCCCGAGCCT GTCGCCGTGA TCGGAATCGG CTGCCGCTTC
CCCGGCGGTT TGGTGACACC GGACGCCTTC TGGGAATTCC TAAAGCAAGG CCAGGACTCG
ATCATCGAGG TGCCGTCCGA CCGCTGGGAT ATCGACGCCT ACTACGATCC CAACCCGGAT
GCGCCGGGCA AAATGTATAC CCGCTGGGGC AGCTTCCTGA CCGACGCGCC GATGTTTGAT
GCCAGCTTTT TTGGCCTTTC GCCGCGTGAA GCTCTACGCA TGGACCCGCA GCATCGCCTC
CTGCTGGAGG TCGCCTGGCA GGCCCTCGAA GATGCCGGCC AGCAGATTGA CAGTCTGGCC
GGCAGTCAGA CCGGCGTGTT CATTGGGATG ATCAACAACG ACTATCCCGT GCGCCAGCTC
TACGCCGACG GCGCCGAGTG TTTCAACGAC CCATACTTCA GCACCGGCAG TTCGTCCAGC
ATGGCCGCCG GACGCCTGGC CTACCTGCTT GATCTCCAGG GGCCGACGAT GACCTTGGAC
ACTGCCTGCT CGTCGACGCT CGTGGCAACC CACCTGGCCG TGCAGAGCTT GCAAAGCAAG
GAAAGCAACC TGGCTCTGGT CGGCGGCTCC AACGTGGTAA TCCTGCCCGA CTCGTTTGTC
AGCCTGTGCA AGATGCGGAT GTTCTCCAGT GATGGCCGCT GTAAGACCTT CGACGCCGCC
GCCGATGGCT TTGTGATCGG CGAAGGTTGC GGCTTCGTGG TACTCAAGCG CCTTTCCGAC
GCCATCAAGG ACGGCGATCA GGTTCGGGCG ATTATCCGCG GCTCGGCGGT CAACGAGGAT
GGTCGCAGCA GCAGCATCAC CGCCCCCAAC GGCCTGGCCC AGCAGGCGGT GATCCGCAAA
GCCCTGGCGG TGGCCGGGCT GAAGCCCCAG CAGATCAGCT ATGTTGAGGC TCACGGTTCC
GGGACCTCGT TGGGCGATCC GATCGAGATG GAGTCGCTGC GGGCTGTTTT GGGCAAAGGC
CGATCTCCCG ATCAGCCGCT CTACGTCGGC GCGGTTAAAA CGAATATCGG CCATCTGGCC
GCCGGTTCGG GAGTCGCCGG GCTGATCAAG ACCGTCTTGT CCTTGCAGTA CAAGCAAATT
CCGCCGCATC TGAATTTCAA GACGCTGAAT CCGGGCATTC CCAAGGGCGG CGCGCCGTTT
GTCGTGCCAA CCAGCCTGAC CCCTTGGACG GTCGCCGACG GGCCGCGCCT GGCCGGGGTG
AGCTCGTTTG GCTGGTCGGG AACCAACGCC CATGTGATTT TGGAGGAGGC TCCGCTGGCC
GAGCCGCTGA GCGCCGCCCG CCCGACCCAT GTGTTGCTTT TATCTGCCAG GACGCCGACC
GCCTTGGAAA AGGCGACCGA AAACCTGCTG GGCTATCTCC ACCGCAACCC AGACTGCGAT
CTTGCCGACG TCGCCCACAC GCTGCAACGC CGTAGAAAAC ACTTCGCCCA TCGCCGGGCG
GTGATTTGCC GCGATGTCGC AGAGGCGATC GCTGGCTTGA GCGGCCGACA TGGCCGTATT
CACACCGGCC ATGTCGGCCA GGAACGCCCG GTGGCCTTTG TGTTCGCTGG CGTCGGCGAC
CACTACGCCG GTATCGCCAA GGGCCTCTAC GCCAATGAAG TGGTTTTTCG CACCGCGGTC
GACCATGCGC TGGCCCTGTT GCCGCCGCTG GATGCGGCGG AGCTGCGGGC GGCGCTGTAT
CCGTCCAACC TGCCGGCTGC TCCGGCGGCC GGACATGGAC TGCTAGGGCG CAACGGAGCT
GCGGGCGGGG CGGACGGGCC ACTACACCAG ACGGCGCTGG CCCAGCCGGC GGTGTTTGTG
GTCGAGTACG CCCTGACCCA ATTGATGATG GCCTGGGGCG TGCGACCGCA GGCATTGCTG
GGCTACAGCT TGGGCGAGTA CGTCGCCGCA ACGATCTCTG GCGTATTGAG CCTGGAGGAC
GCCTTGGCGC TGGTCGCCCG GCGTGCCCAG CTGATCCAGT CGCTGCCGAA AGGCGCGATG
CTGGCGGTAG CGGCAGGTGC GGATGCGATC CAGCCGTACT TGGGTAGCGA GGTGTGCCTA
GCCGCGGTCA ACAGCCCGAG CACCTGCGTC CTTGCCGGCC CACACGCCGA GATGAACGCC
CTGGCGGAGC GGCTCAGCCA GGCCGAGATC GCCTGCCGCC CGGTGGAAAC CAGCCACGCC
TTCCACTCGA CGATGCTGGC ACCGGCGCAA GAGGCGCTGA CTGCGTTCGC TACCACATTG
ACCTTCAACT CGCCGGCCAT CCCGTACCTA TCCAATGTTA CCGGCACCTG GATCACTGTT
GAGCAGGCCA CCGATCCTGC CTACTGGGCA CGGCACATGG TCGAGACCGT CCAATTCGCC
CCTGCCCTGG GTTCGCTGCT CGCCGATCCG GCCATGATGC TCGTGGAGAT AGGTCCGGGC
CAGGCGCTGG GGTCGTTCGC TAGGCAGCAT CCCGCCTGTG ATCGCCAGCG CTTCGCCGAT
ATCGTAGCAA CCCTGCCGGC CAAGCACGAG GCCCAGTCCG AGTTGTCGGC GGCGCTGTCG
GCGCTGGGCC GCCTCTGGGT CGCCGGGGCG CGAGTTGATT GGGCCGCGTT CGCCGGCGCT
GAGCGGCGCC GCTCGGTTGC GCTGCCGCCG TATCCCTTTG AGCGCGAGCG CTTCTGGATC
GATCTTGACA GCGTGCCTAC GACCCTGGCG CCGGCGCGGC CGGCCCGCGG CAAGCAGGCC
AATATCGCCG ATTGGTTTTA TCGGCCGGTC TGGGAGCCGC ATGCGCTCGC CTTGCTGGCG
CCTACCCCGA CTAGCGGTAA GCACTGGCTG GTTTTTGTTG ACGAGCGCGG ACTTGGCCAC
GAGCTTGGCG CCCGGCTCGA ACATGCCGGC GACAGCGTCG TGCGCGTTCA GCGCGGCGAC
GGCTTCGCCC GCATCGATCA CCAGACCTTC GCCGTGCGCC CGGACACCCC CGAGGATTAC
TTGCGCCTAT TGCAATCCCT GGCCAGCGAA GCGCAGTCCC ACCTGGTGGT CGCGCACCTC
TGGTCGCTTG ATGCAACGCC TTCCGCGACC GGCGATCAGT TCGCCGCGAC CCAGCAAGTC
AGCTTCTACA GCCTGCTGGC CCTGGCGCAG GCCCTTGGGC AGGTGAGCCT CGCAGGCGCG
CCTGAGATTG CCGTGGTTTG CGCCGCCGTC CACAGTGTTA TCGGCACCGA AACGATCAAC
CCCGATCTGG CGGCGATCCT TGGCCCGGCG CGAGTTATCC CGCAGGAGTT GCAAGGCGTC
GGTTGTCGCA CCATTGACCT CGCGCTGCCG CAGCGAGGCA GCGCGGAAGA GCGCGAACTG
CTCGACCTGC TGGCCCGCGA GCTGCTGGCC CAGTCGGGCG AACCGATCGT GGCCTACCGC
AATAACCAGC GCTGGGTCGC CGCCTACAAG CCGGTGCACC TGGAGGCCCC TGCCCGGACT
GCGCTGCGCG AGCACGGCAC CTATTTGATC ACCGGCGGCC TGGGTGACAT CGGCCTGATA
CTGGCCGAGC ACCTTGCGTC GACCGTGCGG GCCAGGCTGG TCTTGCTGGC CCGCAGCGAG
TTACCCGACC GCGGCGAGTG GCCACGCTGG CTGAGCGATC GCTCCGAGGA CGACCGGACC
TGCCAGCGCA TCCGCGCGGT TCAGCGCCTG GAGGAGCTAG GCGCCGAAGT CCTGACGATC
ACGGCCGATG TCGCCGACGA GAGCGCTTTG CGGGCGGCTG TCGATCGCGC TACTGAGCGC
TTCGGCGACA TCCACGGCGT GATCCACGCG GCCGGGATTG TGGCAACCGC AGCGTTCCGC
TCGGTCCAGG ATAGCGACCC GGCCATCTGT GAGCAGCATT TCCAGCCCAA GGTCCATGGC
CTGTATGCCC TGGAGCGCGT GCTCGGCGAT AGGTCGCTTG ACTTCTGCGT ACTTTTTTCC
TCGCTGTCAT CAGTGCTGGG CGGGCTTGGT TTCAGCGCGT ATGTGGCGGC AAACTCCTTT
ATGGACGCCT TCGCCCACCG CCACAACCAG TCGCATCCGG TTCGCTGGCT GAGCGTGAAC
TGGGATCTTT GGCTGGGCAG CGTCGATAAA ACCATGTCCG GCGGCCTGGG CGCCAGTCTG
ACCGAATACG GCATGACGCC TGCCGAAGGG GTCGCGGCAT TCGAGCGGGT CCTTTCCGTT
CGGGATGCCA GCCAGATTAT CAACTCCACC GGCGACCTTG ATGCGCGCAT CGCCCAGTGG
GTCCGCATGG AAGCCCTTGC GGTTGGCGCC GATGCCCCAA CCCCGGCGGC GAAACTGGCT
CACGCCCGGC CCGAGCTTAA TACCGCCTAT GTTCCGCCGC GCGGCGAATA CGAACAGCGT
ATCGCCGTGA TCTGGCAGGA GGCGCTTGGC CTCGATCAGG TCGGCATTCA GGATAATTTC
TTCGACCTGG GCGGCAACTC TCTGGTAGGC ATCCAGGTGA TCGCCCGATT GCAAAAAGAG
TTCAAAGTGC AGCTTTCGAC GGTGGTGCTG TTCGAAGCCC CCACGATCAG CGCCCTGGCG
ACGTATCTGA TGGAACGGCT GCCGCAGGTC GCTGGCGTGG CCAGTCAGCC GCAGCCGGCC
AGACGCCAGC AGCGCCAAGC CGCCCAAGGC GATATCGCTA TCATCGGTAT GGCCGGACGC
TTCCCGGGTG CCAGTTCGGT CGATGAGCTG TGGCGCAACA TCAGCCAGTC CCACGAAGCG
TTTTCGATCT TCAGCGACGA GGAATTGCTG GCCGCCGGCG TCGCTCCCGC CCTGGTCCGT
GATCCCAACT ACGTCAAACG GCGCCCGATT CTTGGCGACG ATATCGGCCT GTTCGATGCG
GCCTTCTTTG GCTATTCGCC CCGCGAAGCT GAGTTTATGG ACCCGCAACA GCGCCTGTTC
CACGAGTGCG CCTGGGAGGC CCTGGAAACA GCGGGGTACG ACTCTCAGCG CTACGATGGG
CTGGTTGGCG TGTTTGCCGG CGCCAGCGTC AGTACCTATA TGCTTCAGCT GGCGGCGCTG
CCGGTCTTCA ATGACTTCGG CTCCGATCCG TCTGCCTATT TCTCCAATGA CAAAGATGGC
CTGACGACCA ACGTGTCCTA CAAGTTGAAC CTCCACGGGC CAAGCGTGGC GGTGCAGACC
TATTGTTCGA CCTCGCTGGT GGCCACCCAC ATGGCCTGCC GCAGCCTGCG CGGCGGCGAG
TGCGACATCG CGTTGGCTGG CGGCGTGTCA GTCCGCGTCC CGGTCAAAAC AGGCTATCTG
TACCGCGACG GCGATCAGGT CTCGCCCGAC GGACGCTGCC GCACCTTTGA CGCGGAGGCC
GGCGGGGCTA ACTTTGGCGA CGGGGTGGCA ATCGTGGTGC TAAAGCGGCT GGCCGACGCG
CTGGCCGACG GCGACACCAT CCACGCGGTC ATCCGCGGGT CGGCGATCAA CAACGACGGG
GGGCTGAAGG TCGGCTACAC CGCGCCCAGC GTGGTCGGCC AGTCCAAGGC CATCGCCGCC
GCGCTTGACG ACGCCGGCGT GACCGCCGAC TCGATCTCGT ATGTCGAAGC CCACGGCACC
GCCACCAAGT TGGGCGACCC GATCGAGATC GCCTCGCTGA CCAAGGCGTT TCGCGCCAGC
ACCGATAAAG TCGGCTTCTG CGGGATCAGT TCGGTCAAGC CCAACATCGG CCACCTCGAC
CGAGCGGCCG GCGTCACTGG CTTGATCAAA ACGGTGCTGG CCCTCAAGCA CAGCCTGATC
CCGCCGACAC TGCACTTCCA TGCGCCCAAT CCCGAGATCG ATTTCGCTGC CAGCCCGTTT
TATGTGCTGA CCGAGCCGAC CCCTTGGACG CGCAATGGCA CGCCCCGCCG TGCGGTCGTG
AACTCGCTGG GCGTGGGTGG CACCAACGCC CACGTCGTGG TCGAGGAGGC GCCGCCGGCT
CCGGCTGCCA ACCCGTCGCG ACCGACTCAG CTGCTCTTGC TGTCGGCCAA GACTCCCACG
GCTCTCGAAG CGGCTGCGGC GCGGCTGTCC GATCATCTTG GCGGCCAAGA TGTGAATCTG
GCCGATGTCG CCTACACCCT GCAAGTCGGC CGCCGCGTCT TCGAGCACCG GTGCGTGATG
GTCTGCGAGG ATGCCACCGA CGCGCGTTCG CTGCTGACCA AAGGCGACAC CCGGCGCGTT
CTCAGTCGCC AGCAGAAGCC GACCAGTCGC AACCTAGCCT TTGTGTTCGC CGGCGTCGGC
GACCACTACG CCGGCATGGC CCAAGGCCTT TATGAAACCG AGAGTGTTTT CCGCGCCACG
GTCGACGAGT GCTTCCGCAT CCTGACGCCG CTGCTGGGTG CCGACCTCAA GCAGGCTTTG
TACCCGGAAG GCCAGGCACG GCGCAACGGC AATGGCTCCG GCATCGATCT GCGGGCACTG
CTCGGGCGTG ATCGCGCGGC AGGCGGGGCG GCCGGGCCGC TGCACCAGAC GGCGCTGGCG
CAGCCAGCAG TGTTTGTGGT GGAGTACGCC CTGGCCCAGC TCCTGATGAC CTGGGGCATC
CGGCCGCACG CACTGCTCGG CTATAGTCTG GGCGAATACG TGGCGGCCGC CGTGGCCGGA
GTGTTGAGCT TGGAAGATGC CCTGGCGCTG GTTGCCCGGC GCGCCCAGCT GATCCAGGCG
CTGCCGAGCG GCGCGATGCT GGCGGTGGCG GCGGGCGCGG ATGCGGTCCG CCCGTATCTG
GGCGGCGAGG TGTGCTTGGC GGTAGTCAAC AGCCCGAGCA CCTGCGTTCT CGCCGGCCCC
CAGCAGGCCC TCGCGGCCGT CGCGGATCAG CTTGAGGCCC TCGATATCAG CAGCCGCTGG
TTGGAAACCA GCCACGCGTT CCACTCGACG ATGCTGGCAT CGGTGCAGGC GGAGCTCACC
GCGTTCGCCG CCACCCTGAC CTTTCACCTA CCGACCATCC CGTATCTGTC CAACGTCACC
GGTACCTGGA TTACCGCCGA ACAGGCCACC GACCCAGGCT ACTGGGCGGC GCACATGTGC
CAGACCGTGC AGTTCGCCGC AAGCGTGGCA GTCCTGGCGC AGGATGCTAA CCGCGTCATG
GTGGAGATCG GGCCGGGCCA GGCGCTTGGC TCCTTTGTCA AGCAAAGCCC CGCCTGGGGC
CGCGACCGCC TGGATCTTGT GCTGTCGACC CTGCCGTCCC AGCATGAGGG TCTGAGCGAC
TCGACTGCGC TCCTGACCGC GCTGGGGCGC CTGTGGCTGC TTGACGTGCC GATCGATTGG
CAGGAGTTCG CCGTGGGTGA GCAGCGCCGG CGCGTCCCAC TGCCAACCTA TCCGTTTGAG
CGGCAGCGCT ATTGGCTGGA ACCCAGCCGC AAAGTCGGCG GGCTGGCGGA ACAAAGCGAC
AAGCTCGACC TTATGGGCCT GCCGCGCGAT CCGGCCAACG AGTGGTTCTA CCTGCCGGCC
TGGAAGCAAT CGGCGCCGTA CCTGCCGGCA CTAGCCACAT CCTCGGATGA CAGCCCGCAG
TGCTGGGTGA TCTTTGAGGA CGCCTGCGTG GTCGGCCAGC AGATCGGCGC CTGTCTACGC
CAGCAGGGCC AGCACGTCGT GAGCGTCCAC GCCGGCGCAG CCTTCTTCAA AAACGGCGAC
TCCTACACAA TCAATGCGGG GCAGCGCGCT GATTATGATG AGCTGTTCAA CGACCTGTAT
CAGCAAAACC GGCGGCCCAC CAACATTGTC CACCTTTGGA CGGTGACACC GCCGCTACCG
CATCCGCTGC CCGATCGCTT GCTCGGCCCC GTGCTTGATG CCAGCTTCTA CAGTCTGATC
CACCTTGGCC AGGCGCTAGG CGACCTTGAC CTCGAAGCCT GCACCATCAC CGTGGTTTCC
AGCGATATGC AATCGGTCAT CGGCTCCGAG CGCCAGTGCC CGGAGAAAGC CACGCTTATC
GGCCCCTGCA AGCTGCTGCA GTTCGAATAT GCGGCGCTGG GCTGCCGCAG CGTCGATATT
GTGCTGCCCG AGCGCGGCAG CTGGGAAGAA GAAGCGTTGA TCAGCCACCT GCTCGGTGAG
TTGACCGCCG CGACCGCCGA CACGCAGGTC GCCTTGCGCG GCAACCGCCG CTGGGTACAT
TCGCTTGATA GGCTGAAGCT GGCAGCGCAG GAGTCAAGCG CGCCACGGCT GCGCAAACAC
GGTGTGTACC TGATCACCGG CGGTCTGGGC GGCATCGGCC TGGCACTGGC CGAGCACCTG
GCGCGGACGC AGCAGGCCCG CTTGGCGCTG GTAGGCCGCT CCGGGCTGCC CGAACGCAGC
GAATGGCCTG CCCTGCTGGC GCGCCAGCCG GAACACGCCC ACGCCGGTAA GATCCGACAG
ATCCAGGCGA TCGAGGCGCT GGGCGGCGAG GTGTTGGTGC TCAGGGCCGA TGTCACCGAC
ACAGGCGAGA TCGAAGCGGC GGTGGCGCAA ACCATCGAGC GCTTCGGCAG CCTGCACGGG
GTGCTCCACG CCGCCGGCGT GCCGGGCGTT GGCCTAATCT CACTGAAGAC CAGGGAAACA
GCTGCCAGCG TCCTGGCTCC CAAAGTCCAG GGCACGCGCG CCCTTGCGCA TGCGCTGAAC
AACATGCCGC TGGATTTCCT GGCGCTGTTT TCCTCGGTCG CTTCGGCGAC CGGTGGCGGA
GCCGGGCAGG TCGATTATTG CGCGGCCAAC GCCTACCTGG ACGCCTTTGC CCACAGTCGA
GCCGGGCAGC CAAGTCTGGT GGTCTCGATT GGCTGGTGCG AGTGGCTCTG GAATGCTTGG
GATGAAGCCA TGAGCAGCTA TGACAGCGCT ACCCAGGAGT TCTTCCGCGA CTACCGCGAA
CGCTTCGGCC TGCGCTTCGA TGAAGGCAGC CAAGCTCTTG ACCGGGTGCT TTCTCATCGC
TTCCCGAACG TCTTCGTCTC CACCCAAGAC CTGCGCGCGA TCGTGCGCAT GATGGAAAAC
GCCAAGATCG ATGTGCTTGA GCAGCCCGAA GCCCAAGGCG CGCGTCACGC GCGACCGAGC
CTTGGCACAT CATACGTCAA GCCGCAGAGC AAGTTGGAGC AGGCGATCGC CGCGGTGTGG
AGCGAACGCC TTGGCATCGC CGAAATCGGC CTCAACGACA ACTTTTTCGA GCTTGGCGGC
AACTCGCTGA TCGGTGTCGA TCTGCTGAAC CGGCTGCGCA AGCGCCTTCA GATCAGCCAG
ATCCCGGCCT ATGTGCTGTA CGAAGCACCG ACGGTCGGAG CAATGGCGAA ATTCCTTGAA
CCGGGCCAGG ACACAAACGC AGCGATTCAA GAACGTCACG ATCGCGGTGC AAAGCGAAGG
GATCGGCAGG CTCAACTCAA ACTCAGGTCG TAG
 
Protein sequence
MSNPLEQLLK SLTPDKKALL AEYLRPKPEP VAVIGIGCRF PGGLVTPDAF WEFLKQGQDS 
IIEVPSDRWD IDAYYDPNPD APGKMYTRWG SFLTDAPMFD ASFFGLSPRE ALRMDPQHRL
LLEVAWQALE DAGQQIDSLA GSQTGVFIGM INNDYPVRQL YADGAECFND PYFSTGSSSS
MAAGRLAYLL DLQGPTMTLD TACSSTLVAT HLAVQSLQSK ESNLALVGGS NVVILPDSFV
SLCKMRMFSS DGRCKTFDAA ADGFVIGEGC GFVVLKRLSD AIKDGDQVRA IIRGSAVNED
GRSSSITAPN GLAQQAVIRK ALAVAGLKPQ QISYVEAHGS GTSLGDPIEM ESLRAVLGKG
RSPDQPLYVG AVKTNIGHLA AGSGVAGLIK TVLSLQYKQI PPHLNFKTLN PGIPKGGAPF
VVPTSLTPWT VADGPRLAGV SSFGWSGTNA HVILEEAPLA EPLSAARPTH VLLLSARTPT
ALEKATENLL GYLHRNPDCD LADVAHTLQR RRKHFAHRRA VICRDVAEAI AGLSGRHGRI
HTGHVGQERP VAFVFAGVGD HYAGIAKGLY ANEVVFRTAV DHALALLPPL DAAELRAALY
PSNLPAAPAA GHGLLGRNGA AGGADGPLHQ TALAQPAVFV VEYALTQLMM AWGVRPQALL
GYSLGEYVAA TISGVLSLED ALALVARRAQ LIQSLPKGAM LAVAAGADAI QPYLGSEVCL
AAVNSPSTCV LAGPHAEMNA LAERLSQAEI ACRPVETSHA FHSTMLAPAQ EALTAFATTL
TFNSPAIPYL SNVTGTWITV EQATDPAYWA RHMVETVQFA PALGSLLADP AMMLVEIGPG
QALGSFARQH PACDRQRFAD IVATLPAKHE AQSELSAALS ALGRLWVAGA RVDWAAFAGA
ERRRSVALPP YPFERERFWI DLDSVPTTLA PARPARGKQA NIADWFYRPV WEPHALALLA
PTPTSGKHWL VFVDERGLGH ELGARLEHAG DSVVRVQRGD GFARIDHQTF AVRPDTPEDY
LRLLQSLASE AQSHLVVAHL WSLDATPSAT GDQFAATQQV SFYSLLALAQ ALGQVSLAGA
PEIAVVCAAV HSVIGTETIN PDLAAILGPA RVIPQELQGV GCRTIDLALP QRGSAEEREL
LDLLARELLA QSGEPIVAYR NNQRWVAAYK PVHLEAPART ALREHGTYLI TGGLGDIGLI
LAEHLASTVR ARLVLLARSE LPDRGEWPRW LSDRSEDDRT CQRIRAVQRL EELGAEVLTI
TADVADESAL RAAVDRATER FGDIHGVIHA AGIVATAAFR SVQDSDPAIC EQHFQPKVHG
LYALERVLGD RSLDFCVLFS SLSSVLGGLG FSAYVAANSF MDAFAHRHNQ SHPVRWLSVN
WDLWLGSVDK TMSGGLGASL TEYGMTPAEG VAAFERVLSV RDASQIINST GDLDARIAQW
VRMEALAVGA DAPTPAAKLA HARPELNTAY VPPRGEYEQR IAVIWQEALG LDQVGIQDNF
FDLGGNSLVG IQVIARLQKE FKVQLSTVVL FEAPTISALA TYLMERLPQV AGVASQPQPA
RRQQRQAAQG DIAIIGMAGR FPGASSVDEL WRNISQSHEA FSIFSDEELL AAGVAPALVR
DPNYVKRRPI LGDDIGLFDA AFFGYSPREA EFMDPQQRLF HECAWEALET AGYDSQRYDG
LVGVFAGASV STYMLQLAAL PVFNDFGSDP SAYFSNDKDG LTTNVSYKLN LHGPSVAVQT
YCSTSLVATH MACRSLRGGE CDIALAGGVS VRVPVKTGYL YRDGDQVSPD GRCRTFDAEA
GGANFGDGVA IVVLKRLADA LADGDTIHAV IRGSAINNDG GLKVGYTAPS VVGQSKAIAA
ALDDAGVTAD SISYVEAHGT ATKLGDPIEI ASLTKAFRAS TDKVGFCGIS SVKPNIGHLD
RAAGVTGLIK TVLALKHSLI PPTLHFHAPN PEIDFAASPF YVLTEPTPWT RNGTPRRAVV
NSLGVGGTNA HVVVEEAPPA PAANPSRPTQ LLLLSAKTPT ALEAAAARLS DHLGGQDVNL
ADVAYTLQVG RRVFEHRCVM VCEDATDARS LLTKGDTRRV LSRQQKPTSR NLAFVFAGVG
DHYAGMAQGL YETESVFRAT VDECFRILTP LLGADLKQAL YPEGQARRNG NGSGIDLRAL
LGRDRAAGGA AGPLHQTALA QPAVFVVEYA LAQLLMTWGI RPHALLGYSL GEYVAAAVAG
VLSLEDALAL VARRAQLIQA LPSGAMLAVA AGADAVRPYL GGEVCLAVVN SPSTCVLAGP
QQALAAVADQ LEALDISSRW LETSHAFHST MLASVQAELT AFAATLTFHL PTIPYLSNVT
GTWITAEQAT DPGYWAAHMC QTVQFAASVA VLAQDANRVM VEIGPGQALG SFVKQSPAWG
RDRLDLVLST LPSQHEGLSD STALLTALGR LWLLDVPIDW QEFAVGEQRR RVPLPTYPFE
RQRYWLEPSR KVGGLAEQSD KLDLMGLPRD PANEWFYLPA WKQSAPYLPA LATSSDDSPQ
CWVIFEDACV VGQQIGACLR QQGQHVVSVH AGAAFFKNGD SYTINAGQRA DYDELFNDLY
QQNRRPTNIV HLWTVTPPLP HPLPDRLLGP VLDASFYSLI HLGQALGDLD LEACTITVVS
SDMQSVIGSE RQCPEKATLI GPCKLLQFEY AALGCRSVDI VLPERGSWEE EALISHLLGE
LTAATADTQV ALRGNRRWVH SLDRLKLAAQ ESSAPRLRKH GVYLITGGLG GIGLALAEHL
ARTQQARLAL VGRSGLPERS EWPALLARQP EHAHAGKIRQ IQAIEALGGE VLVLRADVTD
TGEIEAAVAQ TIERFGSLHG VLHAAGVPGV GLISLKTRET AASVLAPKVQ GTRALAHALN
NMPLDFLALF SSVASATGGG AGQVDYCAAN AYLDAFAHSR AGQPSLVVSI GWCEWLWNAW
DEAMSSYDSA TQEFFRDYRE RFGLRFDEGS QALDRVLSHR FPNVFVSTQD LRAIVRMMEN
AKIDVLEQPE AQGARHARPS LGTSYVKPQS KLEQAIAAVW SERLGIAEIG LNDNFFELGG
NSLIGVDLLN RLRKRLQISQ IPAYVLYEAP TVGAMAKFLE PGQDTNAAIQ ERHDRGAKRR
DRQAQLKLRS