Gene Haur_3963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3963 
Symbol 
ID5735824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4995179 
End bp5004451 
Gene Length9273 bp 
Protein Length3090 aa 
Translation table11 
GC content66% 
IMG OID641281113 
ProductBeta-ketoacyl synthase 
Protein accessionYP_001546723 
Protein GI159900476 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGGC CAACGTCGCA CGATGCTAAT TACGACACTG CGGTGGCGAT CATCGGCATG 
GCCGGGCGCT TCCCCGGCGC CCAGAACGTC GATGAGCTGT GGCGGAACAT CACCGCCGGC
GTGCGTTCGA TCCGCAGCTA CTCGGATGCG CAACTGCTGG CGGCGGGCGT CGATCCCGAA
GTGCTCAAAC TGCCGAACTA TGTGAAGGCG GGGACGTTTC TGGATAGCTA CGACCACTTC
GACGCCAGCT TCTTCGGCTA CACGCCGCGC GAAGCCGAGG TGATGGACCC GCAGCACCGC
CTGTTTCTGG AGTGCGCATG GGAAGCCTTG GAGCAGGCCG CCTGCGATCC TGGCGCCTAC
GCCGGCTCCA TCGGCGTCTT CGCCGGGTCG AGCATCTCGC TGTATATGAT CAACAACCTC
TTCACCAATC CGGAGGTACT GGAACTGGCT GGCGGCTTGC AGATCGACAT TGGCAACTCG
GTTGACGCGT TAGCCTCGAC CGTCTCGTAT AAATTCAATC TGCGCGGGCC GAGCGTGGCG
GTACAAACCT ACTGTTCCAC CTCGCTGGTC GCGGTCCACA TGGCCTGCCA GAGCCTGCTC
ACCTATGAGT GCGGCATCGC CTTGGCCGGC GGCACGGCCA TCAGCATTCC CCATGGCACC
GGCTATCTGT ACCAGGAGGG CGGCATCCTG TCGCCCGACG GCTATTGCCG CACCTTCGAC
GCCAAGGCTC AGGGCAGCGT GGTGGGCAAT GGCGTCGGCA TCGTCGCCCT CAAACGTATG
AAAGACGCCA TCAAAGATGG CGACCATATC TATGCCGTCA TCCGCGGGTC GGCGATCAAC
AACGACGGTA TCCGCAAGGT CGGCTACACC GCCCCGGGTC TCACCGGCCA ATCTGCGGTG
GTTACCCGCG CACTGCGCCG CGCCGGCGTC AAACCGGAAA CGATCGGCTA CATCGAGACC
CACGGCACGG CTACGCCGCT CGGCGACTCA GTGGAGCTTT CGGCGTTGAT CAAGGTCTTC
GAGCGTGGCG CCGAGCGCAA ACAATTCTGC GCGCTGGGGT CGGTGAAGCC CAACATCGGC
CACCTCGATC GGGCCTCGGG TGTCACCGGT CTGATCAAGG CGGCCCTGGC CCTGCACCAC
CGTCAGCTGC CGCCGCATCT TGATTTCGAG ACGACCAGTC CCGATATCGA CCTGGCTAAC
AGCCCATTCT ACGTCAACAC CCAGCTACGT GACTGGCCCG CGGATGGCGC TGCGCCGCGC
CGGGCTGGCG TCAGTTCGTT CGGCCTGGGC GGCACCAACG CCCACGTCGT GCTGGAAGAA
GCACCGGTGC CGATGCCCGC GACCATCATC CCTGCCCGCC CGGCCGAGCT GCTGGTGCTG
TCGGCCAAAA CTGAAACGGC ATTGGAAACG GCGACCGATA ACTTGGCCGC CTACCTCGAA
GCGCAGACCG TTGATCTGGC CGATGTGGCC CACACCCTGC GCGTCGGCCG GGCGCCTTTC
AGCTATCGGC GCGTGCTCGT GGCCAGCGAC TCCCGCGATG CTGCGACTGC CCTCAAAGCC
CGTGATCCCC GCCGCCTACT GAGTTTTCAG CAAGTTACCC GCGATCAATC GCTGGCGTTT
GTGTTCCCCG GTGTGGGCGA CCACTACGCC GGCATGGCCG GGACGCTGTA CGCGACTGAG
ACGGTGTTCC GCGAGGCGGT CGACCGGTGC GCTGAGTTAC TGGTCTCGCG CCTCGGCCAG
GATCTGCGCG CCGCCCTGTA TCCCGCTGAC CAGGCCGTTC CTGCGTCAGC CTCAGCGCTC
CTGTCAGCCC TGCCGGGAGG CAGTGGGATA GCGGCGGGGG CGCTGCATCA GACGGCGTTG
GCCCAGCCGG CGGTGTTTGT GGTGGAGTAT GCGTTAGTCC AACTGCTGGC GAGTTGGGGT
ATCCGGCCGC AGGCGCTGCT CGGCTACAGC GTGGGCGAAT ACGTCGCGGC GACCATCGCC
GGGGTGTTGA GCCTGGAGGA CGCGCTGACC TTGGTGGCCA AACGCGCCCA GTGGATCCAG
GCCCAGCCGC ATGGCGCGAT GCTGGCCGTC TCGCTGGGCG TGGATGCGAT CAGCGCCTAT
GCCGAGGGCG AGGTGGTGCT GGCGGTGGTC AACAGCCCGA TGACCTGTGT TCTGGCTGGC
CCGCACGCCG CATTGGAGGC GGTCAAGGTG CGGCTGGACG CGGACGAGGT GGCGAGTCGC
TGGCTGGAGA CGAGCCACGC CTTCCACTCG CCGATGCTGG CGCCGGTGGC GGCCGAACTG
ACCGCGCTGG TGCGCACGCT GCGGCTCCAT GCCCCGCAGA TTCCCTATAT CTCCAATGTG
ACTGGCACGT GGATCACCGA CGCCCAGGCG ACCGACCCGA GCTACTGGGC GCGGCACATG
GTCGAGACGG TGCAATTTGC CGATGGCGTC GGCACCTTGC TGGCCGATGC CCAACTCGTA
GTGCTGGAAG TGGGGCCGGG GCAGGTGCTG GGGTCGTTTA TCCGCCAGCA CCCGGCCTGT
GGCCGCGAAC GGATGAAGCA TGTGCTGGCG TTGTTGCCGG CGGCTCACGA ACGCCAATCT
GAGCTGGCTC ATGTACTCCA ATCGATTGGT CGGCTCTGGT CGCTGGGTAT TACCATCGAT
TGGGTCGGCT TTGCGCCGGC ACAGCGGCGC CGGCTGCCCT TGCCAACGTA TCCCTTTGAG
CGCAAGCGCT TCTGGGTCGA TGCCGCAAAC CGGCCGGTCG CCGCGACTCT CGCCCCGTCG
CTGGGCCGGC ATGCGGATGT GGCCGACTGG TTCTACCGGC CCGACTGGGC GCCCACCGCG
CTCGGCGCCC CGGCCGCCCC CGGTCGCTGG CTGATCCTGC CCGATGCCCA CGGACTGGGC
ACGGCGGTGG CTGCGTCTCT TCGTGCGGCC GGCCACACCG TCACCCTGGC CGCCGGGCCG
GCTGACGCTG CGGCCTACGG TTCGCTGTTC GCCACGCTGC GCGCCGACGG CGACCTACCC
AGCCACATCC TGTGGCTGGG CGGCTTGACG CCGCTCGACT CGGCCCTGAC CGGCCCGGCC
CGCTTTCAGG CGGCCCAGGC GACCGGCTAC TACGACCTGC TCCAGCTGGC GCAGGCGGTG
AGCGCGCAGG TGATCGACGA AGCGGTGCAG CTCGTGGTGG TGACCGCTGG GATGCAGGCG
GTCGGTGCGA ACACGATCCC CGTGGCCGAA CACGCCACGC TACTGGGGCT GGCGACGGTG
ATCGGTCAGG AAAACCTGAC CATCCGCGTG CGCAGCGTCG ACCTTGCGGT GGCCGATGAC
GCCGCGGTCG GGTTGCTGGC GGCCGAGTGC TTGGCGACCA GTGATGCGCT GCGCGTCGCC
TACCGCGATG GTCAGCGCCT AGAAGAAACT TACCAACCGA TTCGGCTGGA GGCGCCGGGA
TCGCCGGTGG TGCGGTCGGG TGGGGTGTAT GTGATCACAG GTGGGCTGGG CGGGGTCGGG
CTGGTGCTGG CGGAGCATCT GGCGCAGACG GCGCAGGCGA AACTGGTGCT GGTCGGCCGG
CAGGGCTTGC CGGAGCGGGC GGTCTGGGAC GCGTGGCTGC GCGAGCACGG CGCGGACGAC
GCCACCAGCC AGCGTATCCA GCGGGTGCGG ATGATCGAAG CCGCTGGCGG CGTGGTCGAG
GTAGTGGCAG CCGATGTGGC GCAGGTGGCC GACCTTCAAC GCGTGCTCGC GACTGCTGAA
GCGCGGTTTG GCACGCTCCA CGGGGTGCTC CACGCCGCCG GCATATCAGA CCCGCAATCG
TATCAACCTA TCCCTACACT CGGACCGAAG GAATGTGAAT GGCATTTCCA ACCCAAGGCC
TACGGCCTGT ATGCGCTGGA AGCGGCGTTG GGCGACCGGC CGCTAGACTT CTGCGTGGTC
TTTTCGTCGG TGTCCTCGGT GCTGGGCGGG CTGAGCTTTG GCGGCTATGC GGCGGCCAAC
AGCTTTATGA ACGCGTTCAC CCAGCGCCAC AACCGCACGC ACGCGGTGCC TTGGGTCAGC
GTCAACTGGG ATACTTGGCA ACTCAAGGCC GGACACGATG CTATTGGTAC GACCGTCGCC
CTGTACGAGA TGAGTCCAGC CGAGGGCACC GACGCCTTTG AGCGGGCGGT CGCCACGCGC
AACGAGCCGG TGATCATTAA CTCGACCGGC GACCTGGATG CCCGCATTCG CCAGTGGGTC
CGCCTCGAAT CCCTGCGCGC TGATGCCGCA GCGGATGATA CCATGGCGGC TCCGGCTTCG
TTCAGTCCGG TTGGGCAAAC CAGCAGCGAC TATGAGCGGC GGATCACAGA GATCTGGAAA
CATGTCCTTG GTATCGATGA GATTGGCATC CACGACAACT TCTTCGACCT GGGCGGCAAC
TCGCTGATCG CACTCCAGCT GATCGCCCGG CTCAAGAAGG AGTTCAAGAC CCAAGTACCG
GCGGTGGCGA TCTTTGAAGC ACCCACCGTC AGCGCGCTGG TTCAGTACCT GTTGCCAGAC
GCGCCCGCCG TTGTGCCCGC CGATGCGCGG CTGGCCGAGC GGCGGCAACG GGTGCGCCAG
ACCGCCGAGC AGGATGGCAT CGCGATCATC GGCATGGTCG GGCGCTTCCC CGGCGCCTCC
ACCGTCGATG CCCTCTGGCA GAATATACGG AATGGCGTGG AGTCGACGAC CCACTTCACC
GACGCAGAGC TGCTGGCGGC CGGTGTCGAT CCGCTGCTGG TGCAGCACCC CGACTACGTC
AAATCGCGAC CGCTCCTGAA GGATGATGTC AGCTTGTTCG ATGCGGCGTT CTTCGGCTAC
ACGCCGCGCG AGGCGGAGTT TCTCGACCCG CAGCAGCGCT TGTTCCAAGA ATGCGCCTGG
GAGGCCCTAG AGCAGGCTGG CTACGATACC CAGCGCTACC CCGGCCTGGT CGGCGTCTTC
GGCGGCACCA ACATGAATTA CTACTTCCAT CATTTGATGG ACGACCATGC GCTCCGCGAG
CACATGAGCG AAGCGATAAT GTTGCAGAAT GACAAGGACG CGCTGGCGAC CTATGTCTCT
TATAAACTTG ACCTGCGTGG GCCGAGTTTC AGCATCCAGA CCTATTGCTC GACTTCGCTG
GTCGCCACCC ACCTGGCTTG CCGCAGCCTG CGTGCTGGCG ACTGCGATAT CGCCCTCGCT
GGCGGCGTGT CGGTCCGTGT CCCAGTCAAC ACCGGCTATC TGTTCCAGGA AGGCGATCAG
GTGTCACCGG ACGGCCACTG CCGGACGTTC GACGCCAACG CGGGCGGAGC GACCTTCGGC
GACGGGGTGG CGATCGTGGT GCTGAAGCGG CTGGCGGACG CGCTGGCCGA CGGCGACACT
ATCCACGCCG TGATCCGTGG GTCGGCGATC AACAACGACG GCGGCCTCAA GGTCGGCTAC
ACCGCACCCA GCGTGGTCGG GCAGGCGGCG GTGGTGCAGG CTGCCCTTGC CGACGCCAAT
CTGGCCGCCG ATGCCATCTC GTATGTCGAG GCCCACGGCA CCGCCACCAA GCTCGGTGAC
CCGATCGAGG TTGCCTCATT GACCAAGGCC TATCGCACGA TGACTGATAA AGTTGGCTTC
TGCGCGATCA GTTCGGTCAA ACCGAACGTC GGCCACCTCG ACCGGGCGGC CGGCGCGACC
GGCTTGATCA AAACGGTTAT GGCGCTGAAG CACAACGTGA TTCCGCCGAC CTTGCACTTC
CAGGCGCCCA ACCCCGAGAT CGACTTCGCC AGCAGCCCGT TCTTTGTGCC GACCGCGCTC
ACGCCGTGGA CGCGCAATGG CACACCGCGC CGGGCCGGGG TCAACTCACT GGGTGTGGGT
GGAACCAATG CCCACGTCAT CGTGGAGGAA GCACCGCAGG TCGGGCCAAG TGGCCCCGGT
CGGGCGGTCG AACTGCTGGT GCTGTCGGCC AAAACGGCGA CCGCGCTGGA GGCAGCGACC
ACGAATCTGG CGGCTCATCT GGAGGAGCAG CCGACGGTGA ATCTGGCTGA TGTGGCCCAC
ACACTCCAGG TTGGGCGGCG GGTGTTTGAA CATCGCCGGG TCGTGGTCGC CCGCGATGCG
ACGAGCGCTG CGGCGCTGTT GCGGAGCGGC GATGCGCGGC GGGTGCTGAC GCTGGCACAA
AAGCCGACCA GTCGGGGTGT GGCCTTCGTG TTCCCGGGTG TGGGCGACCA CTACGTCGGC
ATGGCGGAGG GGTTGTACGC GACCGAGGGA GTATTCCGCG CGACGGTTGA CCGCTGCTGC
GCGCTGCTGA CACCGCTGCT CGGATCGCCC ATTCGGAAGG AAATCTACCC GGATGGCGGC
GCGCCGGTCT CGGCGAGCAT CGACCTGCGC GTTTTGCTCG GCCGACCGGC AGTGCCGGGG
TCGGCAGGGC GCTTGCACCA GACGGCGTGG GCGCAACCGG CGGTGTTCGT GGTGGAGTAT
GCGTTGGCGC AGCTGCTGGC GAGCTGGGGC ATCCGGCCGC AGGCGTTGCT CGGCTACAGC
GTGGGTGAGT ACGTGGCGGC GACGGTCGCT GGGGTGTTGA GCCTGGAGGA TGCCTTGACC
CTAGTCGCCA AGCGTGCCCA GTGGATTCAG GCCCAGCCGG CCGGGTCGAT GCTGGCGGTG
AGCTTGAGTG CCGAGGCGAT CGGTGCGTAT GTGGGCGGTG CGGTGGCGCT GGCAGTGGTT
AACAGCCCGA TGACCTGTGT CCTGGCCGGT CCCCAGTCCG CGTTGGAGGC AGTGAAAACC
CGCTTGGACG GTGATGAGGT GGCCAGCCGC TGGCTGGAGA CGAGCCACGC CTTCCACTCG
CCGATGTTGG CGCCGGTGCA GGCCGAGCTG ACCGCACTGG CTGGTACGCT GCGGCTCCAG
GCACCGCGCA TCCCGTATGT CTCCAACATC ACCGGCACCT GGATCACCGA TGCGGAAGCG
ACCGACCCGG GCTACTGGGC ACGGCATATG GTCGAGACGG TGCAATTTGC GGACGGCGTT
GGCACGCTGC TGGCCGATGC CCAGCTCGTG GTGCTGGAAG TGGGGCCGGG GCAGGCGCTG
GGGTCGTTTA TCCGGCAGCA CCCGGCCTGC GGACGCGACC GGTTCGGCCA GATCGTGGCC
ACGGTGCGTG GGATGACGGA CACGAGCGAT GACCTGGAGG TGCTGTTGAG CGCGCTGGGG
CGGCTGTGGC TGCACGATGT GGTGGTCGAT TGGGCCGGCT TCCGTGGCAG CGAAGTCCGC
CAGCGTATCC CGTTGCCCAC CTACCCCTTC GAGCGCCAGC GCTTCTGGGT CGAGCCGAAT
CCCAACGCTG TGGCCGTGCG GACGCAGCTG CAATCAGTGC GCCGGCCAGA TGTCGGCGAC
TGGTTTGCGG CCGCCTCGTG GAAGCGCGGC TTGCCCTTCG ATGCTGAGGC GACGGCCGAG
CGCTTGAAGG AATCGCGCTG CTGGTTGGTG TTTCAAGATG CCTGCGGGGT TGGCGCGGGT
CTGGCAGCAT GGCTGGAAGA GCGTGGACAG ACCGTGATCA CGGTGACGCC CAGCGCAGCC
TTTACACAGC TCGGCGACTC CCACTACAGT GTCCGCCCGG CCGAGCGCGA CGATTGTACT
GCGCTGTTGC AAGCACTGGA GCGCCAGGGC CAGACGCCTA GTCGCATCGT CCACGCTTGG
TTGGTCGCTC CTGCCGACCA CGCCTCCGAC CTTTCGGATG TCGTCTTGGA TCAAACCCTT
CAAATGGGCT TTTACAGCCT GCTGGCATTG ACCCAAGCGT TGGGCGATCA AGGCGTTGAT
GGCTGTCAGA TCGACATCAT GACCTCGGAT ATGCAAGAAG TCACTGGCCA CGAGCCATTG
CAAATCGCCA AGGCCACTGT GATCGGCCTA TCCAAAATCA TCCCGCAGGA ATATCCCAAC
CTAACCGCGC GATCCATCGA TCTGAGCCTG CTGGCAGGCG GGTTGCTCTC CCAGCAGTTG
ATTGCGGAAA TCGCGACGGA GCTGGTACAC CCGCCGACCG GCGACCAGAT TGCATTCCGC
GGCATCCACC GCTGGGTCCA GGTTTTCGAA CCGCTCAATT TGCCGGCGGC GCCCGCGTCG
CATCCGCGCT TGCGGATGGG CGGGGTGTAC CTGCTGACCG GCGGCTTGGG CGGGATCGCC
CTCGGCTTGG CCCGCGACCT GGCGGCGACG CTGCGGGCCA AGCTGGTGCT GGTCAACCGC
TCCAGCCTGC CCGATCGCGC CACCTGGTCG GCGTTGCTCG AACGCGACGA TGCCGAGCAG
GGCGTGGGGC GGCGCATCCA GCAGGTGCTG GACTTGGAGG CGCTGGGCGC GGAGGTGCTG
GTCATTCAGG CCGACGTCAC CGACGCGGTG GCAATGGCGC AGGCGGTGAA CCAGGCCCAG
GCATGCTTCG GGACAATCCA CGGCGTGCTC CACACGGCCG GCGTGCCCGG CGTGGGCTTG
ATGCAGCTTA AGGATGCCGC GACGGCGGCG GCTGAGCTGG CGCCCAAGGT TCAGGGCACG
CTGGCGCTGA CCCGTGCGCT GGCCGGGGTG CCACTCGATT TCCTCGTGTT GTTCTCGTCG
GTGACGTCGG CGACAGGCGG CGGGCCGGGC CAAGTGGCCT ACTGTGCCGC CAACGCCTTC
CTCGACGCCT ACGCCCGCAA GCATGTCACC GACCACGGCC AGACCGTCGC GGTGAGCTGG
GGTGAGTGGC GCTGGGATGC CTGGTCTGAG GGCTTGCAGG GCTTCCCGGA GGAGATCCAG
GCTAAGTTCC GTGCCTATCG GTCCACGTTC GGCATTACCT TCGAGGAAGG CGCAGAAACA
TTGCGCCGCT TGTTGGCGCG CCGCTTCCCT CATCTGTTTG TGACCAGCGA TGATTTGCTG
GCGATGGTTG AAGGCAGTAA ACAAATCTTC GCCTCGGGCG GCGGCCTTGT GGGCAATCAG
GAACAGGAAA GCGTCCGCTC TACCTACCCG CGGCCCGAGG TCGGCACCTC GTTTGTCGAG
CCGCAAAGCG ACCTAGAACA CCAAATCGCG GGTCTGTGGA GCGAACTCCT GGGCATCGCG
CCGATCGGGG CCAATGATAA CTTCTTCGAC CTCGGCGGCA ACTCACTGCT TGGTATATCG
CTGTTTGGCC GCATGCGGAA GACCTTGAAG CTGGATAAGC TTCCGGCGCA TGTTTTGTAT
GAAGCGCCCA CCGTCAAAGC GCAGGCGGAT TACATCTCCC AGGAACAGGC CGCGACGGGC
GGCCCGGCGG TGCCGAAGCT CCAGGAGCAA GCTGCAAAGC GTCGCGAGCG GATGAGCGGT
TTCAAGAAGA AAGCTCAGTT GGAGGGTCTA TGA
 
Protein sequence
MSGPTSHDAN YDTAVAIIGM AGRFPGAQNV DELWRNITAG VRSIRSYSDA QLLAAGVDPE 
VLKLPNYVKA GTFLDSYDHF DASFFGYTPR EAEVMDPQHR LFLECAWEAL EQAACDPGAY
AGSIGVFAGS SISLYMINNL FTNPEVLELA GGLQIDIGNS VDALASTVSY KFNLRGPSVA
VQTYCSTSLV AVHMACQSLL TYECGIALAG GTAISIPHGT GYLYQEGGIL SPDGYCRTFD
AKAQGSVVGN GVGIVALKRM KDAIKDGDHI YAVIRGSAIN NDGIRKVGYT APGLTGQSAV
VTRALRRAGV KPETIGYIET HGTATPLGDS VELSALIKVF ERGAERKQFC ALGSVKPNIG
HLDRASGVTG LIKAALALHH RQLPPHLDFE TTSPDIDLAN SPFYVNTQLR DWPADGAAPR
RAGVSSFGLG GTNAHVVLEE APVPMPATII PARPAELLVL SAKTETALET ATDNLAAYLE
AQTVDLADVA HTLRVGRAPF SYRRVLVASD SRDAATALKA RDPRRLLSFQ QVTRDQSLAF
VFPGVGDHYA GMAGTLYATE TVFREAVDRC AELLVSRLGQ DLRAALYPAD QAVPASASAL
LSALPGGSGI AAGALHQTAL AQPAVFVVEY ALVQLLASWG IRPQALLGYS VGEYVAATIA
GVLSLEDALT LVAKRAQWIQ AQPHGAMLAV SLGVDAISAY AEGEVVLAVV NSPMTCVLAG
PHAALEAVKV RLDADEVASR WLETSHAFHS PMLAPVAAEL TALVRTLRLH APQIPYISNV
TGTWITDAQA TDPSYWARHM VETVQFADGV GTLLADAQLV VLEVGPGQVL GSFIRQHPAC
GRERMKHVLA LLPAAHERQS ELAHVLQSIG RLWSLGITID WVGFAPAQRR RLPLPTYPFE
RKRFWVDAAN RPVAATLAPS LGRHADVADW FYRPDWAPTA LGAPAAPGRW LILPDAHGLG
TAVAASLRAA GHTVTLAAGP ADAAAYGSLF ATLRADGDLP SHILWLGGLT PLDSALTGPA
RFQAAQATGY YDLLQLAQAV SAQVIDEAVQ LVVVTAGMQA VGANTIPVAE HATLLGLATV
IGQENLTIRV RSVDLAVADD AAVGLLAAEC LATSDALRVA YRDGQRLEET YQPIRLEAPG
SPVVRSGGVY VITGGLGGVG LVLAEHLAQT AQAKLVLVGR QGLPERAVWD AWLREHGADD
ATSQRIQRVR MIEAAGGVVE VVAADVAQVA DLQRVLATAE ARFGTLHGVL HAAGISDPQS
YQPIPTLGPK ECEWHFQPKA YGLYALEAAL GDRPLDFCVV FSSVSSVLGG LSFGGYAAAN
SFMNAFTQRH NRTHAVPWVS VNWDTWQLKA GHDAIGTTVA LYEMSPAEGT DAFERAVATR
NEPVIINSTG DLDARIRQWV RLESLRADAA ADDTMAAPAS FSPVGQTSSD YERRITEIWK
HVLGIDEIGI HDNFFDLGGN SLIALQLIAR LKKEFKTQVP AVAIFEAPTV SALVQYLLPD
APAVVPADAR LAERRQRVRQ TAEQDGIAII GMVGRFPGAS TVDALWQNIR NGVESTTHFT
DAELLAAGVD PLLVQHPDYV KSRPLLKDDV SLFDAAFFGY TPREAEFLDP QQRLFQECAW
EALEQAGYDT QRYPGLVGVF GGTNMNYYFH HLMDDHALRE HMSEAIMLQN DKDALATYVS
YKLDLRGPSF SIQTYCSTSL VATHLACRSL RAGDCDIALA GGVSVRVPVN TGYLFQEGDQ
VSPDGHCRTF DANAGGATFG DGVAIVVLKR LADALADGDT IHAVIRGSAI NNDGGLKVGY
TAPSVVGQAA VVQAALADAN LAADAISYVE AHGTATKLGD PIEVASLTKA YRTMTDKVGF
CAISSVKPNV GHLDRAAGAT GLIKTVMALK HNVIPPTLHF QAPNPEIDFA SSPFFVPTAL
TPWTRNGTPR RAGVNSLGVG GTNAHVIVEE APQVGPSGPG RAVELLVLSA KTATALEAAT
TNLAAHLEEQ PTVNLADVAH TLQVGRRVFE HRRVVVARDA TSAAALLRSG DARRVLTLAQ
KPTSRGVAFV FPGVGDHYVG MAEGLYATEG VFRATVDRCC ALLTPLLGSP IRKEIYPDGG
APVSASIDLR VLLGRPAVPG SAGRLHQTAW AQPAVFVVEY ALAQLLASWG IRPQALLGYS
VGEYVAATVA GVLSLEDALT LVAKRAQWIQ AQPAGSMLAV SLSAEAIGAY VGGAVALAVV
NSPMTCVLAG PQSALEAVKT RLDGDEVASR WLETSHAFHS PMLAPVQAEL TALAGTLRLQ
APRIPYVSNI TGTWITDAEA TDPGYWARHM VETVQFADGV GTLLADAQLV VLEVGPGQAL
GSFIRQHPAC GRDRFGQIVA TVRGMTDTSD DLEVLLSALG RLWLHDVVVD WAGFRGSEVR
QRIPLPTYPF ERQRFWVEPN PNAVAVRTQL QSVRRPDVGD WFAAASWKRG LPFDAEATAE
RLKESRCWLV FQDACGVGAG LAAWLEERGQ TVITVTPSAA FTQLGDSHYS VRPAERDDCT
ALLQALERQG QTPSRIVHAW LVAPADHASD LSDVVLDQTL QMGFYSLLAL TQALGDQGVD
GCQIDIMTSD MQEVTGHEPL QIAKATVIGL SKIIPQEYPN LTARSIDLSL LAGGLLSQQL
IAEIATELVH PPTGDQIAFR GIHRWVQVFE PLNLPAAPAS HPRLRMGGVY LLTGGLGGIA
LGLARDLAAT LRAKLVLVNR SSLPDRATWS ALLERDDAEQ GVGRRIQQVL DLEALGAEVL
VIQADVTDAV AMAQAVNQAQ ACFGTIHGVL HTAGVPGVGL MQLKDAATAA AELAPKVQGT
LALTRALAGV PLDFLVLFSS VTSATGGGPG QVAYCAANAF LDAYARKHVT DHGQTVAVSW
GEWRWDAWSE GLQGFPEEIQ AKFRAYRSTF GITFEEGAET LRRLLARRFP HLFVTSDDLL
AMVEGSKQIF ASGGGLVGNQ EQESVRSTYP RPEVGTSFVE PQSDLEHQIA GLWSELLGIA
PIGANDNFFD LGGNSLLGIS LFGRMRKTLK LDKLPAHVLY EAPTVKAQAD YISQEQAATG
GPAVPKLQEQ AAKRRERMSG FKKKAQLEGL