Gene Haur_3966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3966 
Symbol 
ID5735827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5027829 
End bp5037155 
Gene Length9327 bp 
Protein Length3108 aa 
Translation table11 
GC content66% 
IMG OID641281116 
ProductBeta-ketoacyl synthase 
Protein accessionYP_001546726 
Protein GI159900479 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGATA GAGCAGCATA TGATGATGCA TACGACACAG CCATTGCCAT CGTAAGCATG 
GCCGGGCGCT TCCCAGGCGC GTCGAATGTT GATGCGTTCT GGCAGAATCT GGTCGCTGGC
GTGCGCTCAA TCCTCCCGAT TGCCGACGAG GATATGCTGG CCATGGGCAT GGATCCCGCA
GTGCTTCGCG ACCCGCAGTA TGTCAAGGCT GGCGCGTTTG TCGACGATGT CGATCTGTTC
GACGCCAGTT TCTTCGGCTA CACGCCGCGC GAGGCCGAGA TCATCGATCC CCAGTACCGG
CTATTTTTGG AATGTGCTTG GGAAGGACTG GAACGGGCTG GCTATGACCT GGAAACCTAC
CGCGGAGCCA TCGGCATGTT CGCCGGCTCC GGCCACCCGA CGTATGCTCT GGAGAATATC
GCCTCCAACC CAAGCATCAG GGAGTGGGCC GGTGACCTTC AGGTTAATAT CAACAACGAA
AAGGACTCGC TAGCGACGAT CGTCTCGTAC AAGCTGAACC TGCGCGGGCC GAGCGTGGCA
GTGCAGACGT TCTGCTCGAC CTCGCTGGTC GCGGTCCACA TGGCCTGCCA GAGCCTGCTG
ACCTACGAGT GTAATATCGC GCTTGGCGGC GGCTCGGCGC TGAATCTGCA ACAGGGCCTC
GGGTATTTCT ACCAAGAAGG CGGCATCTTG TCGCCGGACG GCCGCTGCCG CACGTTCGAC
GCCCAGGCCC AGGGCAGCGT GATGGGCAGC GGCGTCGGCG TCGTCGTGCT CAAGCGCTTC
GAGGACGCGC TCAACGATGG CGACACGATC TACGCCATCA TCCGCGGGTC GGCGATCAAC
AACGATGGCA TCCGCAAGGT CGGCTATACC GCCCCCGGTC TGGGCGGCCA GTCATCGGTG
ATCTCCACTG CGCATCAGCG CGCCGGGGTT GCGCCAGAGT CAATCGGCTA TATCGAGGCC
CACGGTACGG CCACGCCGCT CGGCGACTCG ATCGAGCTGG CCGCCTTGAT CAAAGCCTTC
GAGCGTAAAA CCACTCGCCA GCAATTCTGC GCGCTGGGGT CGGTCAAGCC GAATGTCGGC
CACCTCGACC GGGCGTCGGG GGTGACGGGC CTGATCAAAA CGACCATGGC CCTGTATCAC
CGCCAGCTGC CGCCCAACCT TGATTTCGAG CGCACCAGCC CGGATATCGA TCTGGCCAGC
AGTCCGTTCT ACGTCAACAC GCAACTGCGC GAGTGGGCCG CTGAGGCCGG GAGCGTGCGA
CGGGCCGGCG TCAACTCGTT TGGTCTGGGC GGCACCAACG TCCACGTCGT GCTGGAGGAA
GCACCCGCAC CGGCGCCCGT GGCTCCGGCT CGCCCGGCCC AGTTGCTGGT GCTGTCAGCC
AAGACCGCGA CCGCCCTGGA GGCCATGACC GACAACCTAG CCGCCTATCT GGCCGGCGCC
CCTGCCGATC TGGCCGACGT GGCCTTTACC CTCCAGATCG GCCGCACGGG TTTCAACCAC
CGGCGGATCG TCGCCGGAAC CAGTGCCGCC GATGTTCGTG CCGCCCTGGA GCAGCGCGAT
GGGCGGCGCG TGCTGAGCGC CACGCAGACT GGACGCAACC GGCCGGTGGC GTTTGTGTTC
CCCGGCGTGG GCGACCACTA TGCCGACATG GCCAAGACCC TGTACGCGAC CGAGGCGGTC
TTCCGCGAGG CGGTTGATCA GTGTGCCGAA TTGCTGGCCC CACGCCTTGG CCAGGATCTG
CGCGCCGCCC TGTATCCCGC CGATCAGCCA GCCGCAGCCG CGGCCCACAC GCTGTTTGCG
GCTACTGCGG CGAGCAGTCG TGTGGCGGGA GCGCTGCATC AGACGGCGCT GGCCCAGCCG
GCGGTGTTTG TGGTCGAGTA TGCGTTAGTC CAACTGCTGG CGAACTGGGG TATCCGGCCG
CAAGCGCTGC TCGGCTATAG TCTGGGCGAG TACGTGGCGG CGACGGTCGC CGGCGTGCTG
AGCCTGGAGG ATGCCCTGAC CCTGGTTGCC ATGCGCGCCC AGTGGATTCA GGCCCAGCCG
CATGGCGCGA TGCTGGCCGT CTCGCTGGGC GCCGAGGCCA TCCAGCCCTA CCTGAATACC
GAGGTGGCGC TGGCGGTGGT CAACAGCCCA ATGACCTGCG TCCTGGCCGG TCCGCAGGCG
GCGTTGGAGG CGGTCAAAAT CCGTCTGGAA GAAGATGAGG TGGCCAGCCG CTGGCTGGAG
ACGAGCCACG CGTTCCACTC GCCGATGCTG GCGCCGGTGG CGGCCGAACT GACCGCGCTG
GTGCGCACGC TGCAGCTCCA GACACCCAAA ATTCCCTATA TCTCCAATGT GACTGGCACG
TGGATCACCG ACGCCCAGGC GACCGACCCG AGCTATTGGG CACGGCATAT GGTCGAAACG
GTGCAGTTTG CCGATGGCGT CGGCACCTTG CTGGCCGATG CTCAGCTGGC GCTGCTCGAA
GTGGGACCGG GCCAAGCGCT GGGGTCGTTT ATCCGCCAGC ATCCGACCTG CGGGCGTGAC
CGGTTTGGCC AGATCGTCGC GACCTTGCCA GTGGCGGCCG AGGCCACTAA TGATCTGGTG
GCGCTGCTCA ACGGGCTAGG GCGGTTGTGG CTAGCTGGCG TCCCCGTCGA TTGGGCCGGT
TTCCACGGCG GCGCGGCTCG CCAGCGCGTC CCGCTGCCGA CGTATCCCTT TGAGCGCAAG
CGCTTCTGGA TCGATGTGCA ACGGCCGGAG AAGGCCGCCG CCCTCGCCGA GACGACCGCG
GGCCGCAAAC CCGACATCGC CGACTGGTTC TATCAGCCAG CGTGGGTGCG CGCGGAACTG
CTGGGTGCAC CGGTCAAGCC GGGCTGCTGG CTGGTGTTTG CCGACCAGCA CGGTCTGGGC
AACGCTGTCG GCCAGCGGTT GATACTGGCC GGCCATCGCG TCGTGCAGAT CGAGGCCGGA
TCAGATTTCG TCCAGTTCGA CGAGCAGCAC TTTCAGATCC GACCGGGCCA GATTGACGAT
TACCAGAACC TTTTGAAGAC GGCGATCGGC GCGGGATACC TGCCGACCCA CGTCCTCCAT
CTGTGGAGTC TGGCGCCGGC GGCGGGTCGG GCGACGGGTT CCGGGCGTTT CGCAGCCATT
CAGGAGTACG GCTTCACCAG CTTGCTAAAC CTGGCCTTCG CCCTCGGCAG CCAGCTGATC
GACGATCCGG TTGAGCTGCT GGTGGCGACT GCCAGTATCC AGGCCGTCGA TCCGAGCGAC
CTCCCCGATG CCGACAACGC CACAATCCTT GGTGCCTGCA CTGTGATCGG CCAGGAAAAC
CTTTCTATCA CGGTCCGCAA CATCGACCTA AGCCTGCCAG CTGACCTGAA CAGTGCCGAA
CTGCCGGTCG CCGCGCTGGT CGCCGAGTGT CAGAAGCAAA ACAGCGACCT CCACGTCGCT
TATCGCGGCA ATCAGCGATT TGTCCAGCAG TACCAGCCCT TGCGGCTAGA GGTGCCGGAG
TCGCCGGCGG TGCGGCCGGG CGGGGTGTAT GTGATCACGG GCGGCCTGGG CGGGGTCGGA
CTGGTGCTGG CGGAGCATCT GGCGCGGACG GCGCAGGCGA AGCTGGTGCT GGTCGGCCGG
CAGGGCTTGC CGGAGCGGGC GGCCTGGGAC GCGTGGCTGC GCGAACACGG CGCGGACGAC
GCCACGAGCC AGCGTATCCA GCGGGTGCGG ATGATCGAAG CCGCCGGTGG CAGCGTTCTG
GTTGTTGCCG CCGATGTGGC CATGGTGGCC GGTTTCCAGC GCGTGATTGC GGCGACCGAG
CAGCAGTTCG GTTCGATCAA CGGCGTCTTG CACGCCGCCG GTATCTCGGA TAGCACCGCG
TACAACCTAA TCCAGTTATT GGAACAGGCG ACCTACGCCG CTCACTTCCA GCCGAAGGTG
TACGGGTTGT ACGCACTGGA AGCGGCGCTG GGCGACCGGC CGCTGGACTT CTGTGTGCTG
TTTTCCTCGA TCTCGGCGGT GCTGGGTGGG TTGGGCTTCG CGGGCTACGC GGCGGCCAAC
TGCTTCATCG ATGCCTTTAC CGAGCGCCAC AACCGCACGC ACGCAGTGCC TTGGGTCAGC
GTCAACTGGG ATACTTGGCA ACTCAAGGTC GGCCAGCACG ATGTGATCGG CGCGACCGTC
GCCCAGTACG AGATGAGTCC GGCCGAGGGC GCGGCGGCCT TCGAGCGCGT CGCTGCGGCC
CGGGGCCACA CCCAGATTGT CAACTCGACC GGCGACCTCG ACGCCCGCAT TCGCCAGTGG
GTCCGCCTGG AATCGGTGCG CGCCGACGCC GCCGCAGCGG CCGACACCGC CGTGCAAACG
GTGCCCCGCG GCACACCGCT TTCCAGCAGC GAGTATGAGC AGCGGGTGGC TATGGTCTGG
CAGCAGATTC TGGGTATCGA CGAGGTGGGC ATCGACGACA ACTTCTTTGA CCTGGGCGGC
AATTCGTTGG TCGCGCTCCA ATTAGTGACG CGGCTCAAGA AAGAGCTGAA AACCCAGATC
CCGATGGTCG CCTTGTTTGA GGCGCCGACG GTGCGGGCAA TGTCGCAATT GCTGCGCCCC
GAAACCGCGC CCGATGTCGA CCAGCAGGCG CTGCTCTTGC AACAGCGCCG CGAGCAAACG
CGCCAGACGG TGCAGGCGGA TGGCCTCGCG ATCATCGGCA TGGTCGGGCG TTTTCCCGGT
GCCTCCAGCG TGGAGGAGCT CTGGCAAAAT CTGCATAACG GTGTCGAGTC GACAACCCAC
TTCACCGATG AGGAACTCGT TGCTTCCGGG GTCAATCCCC TGGAAATCCG GCACCCCGAC
TACGTCAAGT CGCGGCCGAT CCTGAAGGAT GATGTCAGCT TGTTCGATGC GGCGTTTTTC
GGCTACACGC CGCGCGAGGC GGAGTTTCTC GACCCGCAGC AGCGCTTGTT CCACGAGTGC
GCTTGGGAGG CCCTGGAGCA GGCCGGCTAC GATACTCAGC GCTATCCCGG CCTGGTCGGC
GTTTTTGGCG GCACCAATAT GAACGCCTAC CTCAACCGCA TCGCCCGCGA CCCACGCTCC
GACGGCCATA TCACTGAGAT CATCACCCTT GAGAACGACA AGGACGCGCT GGCGACCAAC
GTCGCCTACA AGTTGAACCT GCGCGGGCCA AGCTTCGCGG TGCAGACGTT CTGCTCGACC
TCGCTGGTCG CCACCCACTT GGCCTGCCGC AGCCTGCGCC ACGGCGAGTG CGATATCGCC
CTCGCTGGTG GCGTGTCGGT CCGTGTCCCG GTCAACACCG GCTATCTGTA TGAAGAAGGC
GATCAGGTGT CACCGGACGG CCACTGCCGG ACGTTCGATG CCAACGCGGG CGGAGCGACC
TTCGGCGACG GGGTGGCGAT CGTGGTACTG AAGCGGCTGG CGGACGCGCT GGCCGACGGC
GACACTATCC ACGCCGTGAT CCGTGGGTCG GCGATCAACA ACGACGGGGG GCTGAAGGTC
GGCTACACCG CACCCAGCGT GGTCGGGCAG GCGGCGGTGG TGCAGGCGGC GCTGGCTGAT
GCCAATCTGG CCGCCGATGC CATCTCGTAT GTCGAGGCCC ACGGCACCGC CACCAAACTC
GGCGACCCGA TCGAAGTGGC GGCATTGACC AAGGCCTATC GCACGACCAC CGACAAAGTT
GGCTTCTGCG CGATCAGTTC GGTCAAACCG AACGTCGGCC ACCTCGACCG GGCGGCTGGC
GCGACTGGTT TGATCAAGAC GGTTATGGCC CTGAAGCACA ACGTGATTCC GGCCACGCTG
CACTTCCAGA CGCCCAACCC CGAGATCGAC TTCGCCAGCA GCCCGTTCTT TGTGCCGACC
GCGCTCACGC CGTGGACGCG CAATGGCACA CCGCGCCGGG CCGGGGTCAA CTCGCTGGGT
GTGGGTGGAA CCAATGCCCA CGTCATCGTG GAGGAAGCAC CGCAGGTCGG GCCAAGTGGC
CCCGGTCGGG CGGTCGAACT GCTGGTGCTG TCGGCCAAAA CGGCGACCGC GCTGGAGGCA
GCGACCACGA ATCTGGCGGC CCATCTGGAG GAGCAGCCGA CGGTGAATCT GGCTGATGTG
GCCCACACGC TCCAGGTTGG GCGGCGGGTG TTTGAGCACC GCCGGGTCGT GGTCGCCCGC
GATGCGACGA GCGCTGCGGC GCTGTTGCGG AGCGGCGATG CGCGGCGGGT GCTGACGCTG
GCACAAAAGC CGACCAGTCG GGGTGTGGCC TTCGTGTTCC CGGGTGTGGG CGACCACTAC
GTCGGGATGG CGGAGGGATT ATACGCGACC GAGGGAGTAT TCCGCGCGAC GGTTGACCGC
TGCTGCGCGC TGCTGACGCC ACTGCTCGGA TCGCCCATTC GGAAGGAAAT TTACCCCGAT
GGTGGTGTTC CCGCCCAGAC CGGCGTCGAC CTGCGTGCTA TGCTGCGCGA GGACGCGACG
CCGGGGTCGG CGGGGCGCTT GCACCAGACG GCGTGGGCAC AACCGGCGGT GTTCGTGGTG
GAGTATGCGT TGGCGCAGCT GCTGGCGAGC TGGGGCATCC GGCCGCAGGC GTTGCTCGGC
TACAGCGTGG GTGAGTACGT GGCGGCGACG GTCGCTGGGG TGTTGAGCCT GGAGGATGCC
TTGACCCTAG TCGCCAAGCG TGCCCAGTGG ATTCAGGCCC AGCCGGCCGG ATCGATGCTG
GCGGTGAGCT TGAGTGCCGA GGCGATCGGT ACGTATGTGG GCGGCGCGGT GGCGCTGGCG
GTGGTCAATA GCCCGATGAC CTGTGTCCTG GCCGGTCCCC AGTCCGCGTT GGAGGCAGTG
AAAACCCGCT TGGACGGTGA TGAGGTGGCC AGCCGCTGGC TGGAAACGAG CCACGCCTTC
CACTCGCCGA TGTTGGCGCC GGTGCAGGCC GAGCTGACCG CACTGGCTGG TACGCTGCGG
CTCCAAGCAC CGCGCATCCC GTATGTCTCC AACATCACCG GCACCTGGAT CACCGATGCG
GAAGCGACCG ACCCGGGCTA CTGGGCACGG CATATGGTCG AGACGGTGCA GTTTGCGGAC
GGCGTTGGCA CGCTGCTGGC CGATGCCCAG CTCGTGGTGC TGGAAGTGGG GCCGGGGCAG
GCGCTGGGGT CGTTTATCCG GCAGCACCCG GCCTGCGGGC GCGACCGGTT CGGCCAGATC
GTGGCCACGG TGCGTGGGAT GACGGACACG AGCGATGACC TGGAGGTGTT GTTGAGCGCG
CTGGGGCGGC TGTGGCTACA CGATGTGGTG GTCGATTGGG CCGGCTTCCG TGGCAGCGAA
GTCCGCCAGC GTATCCCGTT GCCCACCTAC CCCTTCGAGC GCCAGCGCTT CTGGATCGAG
CCTGATCTGA CCCCGCGGCC GGGTAGCGGG CCGAAACTTC GACGATTTGA TGCGGGCGAC
TGGTATGCCG TGCCCTCGTG GAAGCGCGCG GTCGCCCATA ATATCGTCAA CAACGGCTCG
CTGACCGAGC CGGGGTGCTG GCTGGTGCTG GTGGATGGCG AAGGACTGGC CACCAGGTTG
ACCGCTTGGC TGGAAGATCG CGGCCAGACC GTGATCGCGG TCACACCTGG CGCGGCCTTC
ACCCAGCACA GCGCAACGGC GTATACCGTG CGTCCCGCCA GCCGTGAGGA CTTCACTGCC
TTGTTGCAGA CGCTGGAACG CCAAGGCCAG ACGCCTAGCC GCGTTGTCCA CGCCTGGTTG
GCGACCGCCG GCGACCCCGC CGCGGACGAA ACCGCCGCCG GATTCGATCA TACGCTCCAG
CACGGCTTCT ACAGCCTGCT GGCGCTGGCC CAAGCGCTGG GCGACCAGGG TGTCGAGTGG
TGCGAGATCA ACGCCGTCAC CGTAGCTATG CAGGAGGTCA CTGGACAGGA AGATCTGCGC
ATCGCCGCCG CCACGGTGAT CGGGCCGTGC AAGATTATTC CGGTCGAGTA TCCCAATCTG
ACCGCGCGCT CCATCGATAT CTTGCTGCCG GCCAGCCCCG CTGAGCGGGC GACCCTGGTC
GCGCAGCTGG GCGCCGAACT GGCCACTCCG CCGACCGGTG ACCAGGTCGC CTTCCGCGGC
GCCCATCGCT GGGTTCAGAT GATGGAGCCG GTTGCGCTGC CGGCAGTGCC CGCGTCGCAT
CCGCGCTTGC GGACGGGCGG GGTGTACCTG CTGACCGGCG GCCTGGGCGG GATCGCCCTC
GGCTTGGCCC GCGACCTGGC GGCGACGCTG CGGGCCAAGC TGGTGCTGGT CAACCGCTCC
GGCCTGCCCG ACCGTGCCAC CTGGCCGGCG CTGCTCGAAC GCGACGGTGC CGAGCAGGGC
ATGGGGCGGC GCATCCAGCA GGTGCTGGAT CTGGAGGCGC TGGGCGCGGA GGTGCTGGTC
ATTCAGGCCG ACGTCACCGA TGCGGTGGCG ATGGCGCGGG CGGTGGCTGA GGCCCAGGCA
CGCTTCGGGA CGATCCACGG TGTGCTCCAC ACGGCCGGCG TGCCCGGCGT GGGCTTGATG
CAGCTTAAGG ATGCCGCGAC GGCAGCGGCT GAGCTGGCGC CCAAGGTTCA GGGCACGCTG
GCGCTGACTC GGGCGTTGGC CGGGGTGCCG CTGGATTTCT TGGTGTTGTT CTCGTCGGTG
ACGTCGGCGA CGGGCGGCGG GCCGGGCCAA GTGGCCTACT GTGCCGCCAA CGCCTTCCTC
GACGCCTACG CCCGCAAGCA TGCCACCGAC CACGGTCAGA CCGTCGCGGT GAGCTGGGGC
GAATGGCTAT GGGATGCCTG GTCCGAGGGT TTGCAAGGCT TCGCACCCGA GGTTCAGGCA
GAATTCCGGG CCTACCGCAA GAACTTTGGG ATCACCTTCG CCGAAGGCGC CGAGGCGCTG
CGCCGCATCC TGGCCTGCCA GATTCCGCAC CTGTTTGTGA CGACCGAAGA CCTGTTGAAT
ATGTTTGAGG GCAGCAAACG CTCGGCGGCG CGGACGCAAG CCGAGCCAAC CGAACAACAG
GCGCGAACCC GTTATCCGCG GCCCGAGGTC GGCTCGTTTG TCGAGCCGCA AGGCGAGTTG
GAGCAAAGCA TCGCGCGGGT CTGGGCTGAT GTCTTGGCGA TCGAGGCGAT TGGTGCCAAC
GATAACTTCT TCGATCTCGG TGGCAACTCG CTGCTGGGAA TTGGTCTGAT CAATCAACTG
CGCCGAAATC TCAAATTGGA GAAACTGCCG GCACACGTCC TGTACGAGGC TCCGACCGTC
GCCACACTGG CGGACTATAT CAATCAATCG ACACACCCCA CGGGTTCAAG CGCGGTTCAT
CTTCCCGATC TTGACGAGGA AGCCGACAAG CGCGAAGAAC AGTTCGATTT CTTCAAAGAC
AGAGCTCAAA TGGAGGAGCT GACATGA
 
Protein sequence
MTDRAAYDDA YDTAIAIVSM AGRFPGASNV DAFWQNLVAG VRSILPIADE DMLAMGMDPA 
VLRDPQYVKA GAFVDDVDLF DASFFGYTPR EAEIIDPQYR LFLECAWEGL ERAGYDLETY
RGAIGMFAGS GHPTYALENI ASNPSIREWA GDLQVNINNE KDSLATIVSY KLNLRGPSVA
VQTFCSTSLV AVHMACQSLL TYECNIALGG GSALNLQQGL GYFYQEGGIL SPDGRCRTFD
AQAQGSVMGS GVGVVVLKRF EDALNDGDTI YAIIRGSAIN NDGIRKVGYT APGLGGQSSV
ISTAHQRAGV APESIGYIEA HGTATPLGDS IELAALIKAF ERKTTRQQFC ALGSVKPNVG
HLDRASGVTG LIKTTMALYH RQLPPNLDFE RTSPDIDLAS SPFYVNTQLR EWAAEAGSVR
RAGVNSFGLG GTNVHVVLEE APAPAPVAPA RPAQLLVLSA KTATALEAMT DNLAAYLAGA
PADLADVAFT LQIGRTGFNH RRIVAGTSAA DVRAALEQRD GRRVLSATQT GRNRPVAFVF
PGVGDHYADM AKTLYATEAV FREAVDQCAE LLAPRLGQDL RAALYPADQP AAAAAHTLFA
ATAASSRVAG ALHQTALAQP AVFVVEYALV QLLANWGIRP QALLGYSLGE YVAATVAGVL
SLEDALTLVA MRAQWIQAQP HGAMLAVSLG AEAIQPYLNT EVALAVVNSP MTCVLAGPQA
ALEAVKIRLE EDEVASRWLE TSHAFHSPML APVAAELTAL VRTLQLQTPK IPYISNVTGT
WITDAQATDP SYWARHMVET VQFADGVGTL LADAQLALLE VGPGQALGSF IRQHPTCGRD
RFGQIVATLP VAAEATNDLV ALLNGLGRLW LAGVPVDWAG FHGGAARQRV PLPTYPFERK
RFWIDVQRPE KAAALAETTA GRKPDIADWF YQPAWVRAEL LGAPVKPGCW LVFADQHGLG
NAVGQRLILA GHRVVQIEAG SDFVQFDEQH FQIRPGQIDD YQNLLKTAIG AGYLPTHVLH
LWSLAPAAGR ATGSGRFAAI QEYGFTSLLN LAFALGSQLI DDPVELLVAT ASIQAVDPSD
LPDADNATIL GACTVIGQEN LSITVRNIDL SLPADLNSAE LPVAALVAEC QKQNSDLHVA
YRGNQRFVQQ YQPLRLEVPE SPAVRPGGVY VITGGLGGVG LVLAEHLART AQAKLVLVGR
QGLPERAAWD AWLREHGADD ATSQRIQRVR MIEAAGGSVL VVAADVAMVA GFQRVIAATE
QQFGSINGVL HAAGISDSTA YNLIQLLEQA TYAAHFQPKV YGLYALEAAL GDRPLDFCVL
FSSISAVLGG LGFAGYAAAN CFIDAFTERH NRTHAVPWVS VNWDTWQLKV GQHDVIGATV
AQYEMSPAEG AAAFERVAAA RGHTQIVNST GDLDARIRQW VRLESVRADA AAAADTAVQT
VPRGTPLSSS EYEQRVAMVW QQILGIDEVG IDDNFFDLGG NSLVALQLVT RLKKELKTQI
PMVALFEAPT VRAMSQLLRP ETAPDVDQQA LLLQQRREQT RQTVQADGLA IIGMVGRFPG
ASSVEELWQN LHNGVESTTH FTDEELVASG VNPLEIRHPD YVKSRPILKD DVSLFDAAFF
GYTPREAEFL DPQQRLFHEC AWEALEQAGY DTQRYPGLVG VFGGTNMNAY LNRIARDPRS
DGHITEIITL ENDKDALATN VAYKLNLRGP SFAVQTFCST SLVATHLACR SLRHGECDIA
LAGGVSVRVP VNTGYLYEEG DQVSPDGHCR TFDANAGGAT FGDGVAIVVL KRLADALADG
DTIHAVIRGS AINNDGGLKV GYTAPSVVGQ AAVVQAALAD ANLAADAISY VEAHGTATKL
GDPIEVAALT KAYRTTTDKV GFCAISSVKP NVGHLDRAAG ATGLIKTVMA LKHNVIPATL
HFQTPNPEID FASSPFFVPT ALTPWTRNGT PRRAGVNSLG VGGTNAHVIV EEAPQVGPSG
PGRAVELLVL SAKTATALEA ATTNLAAHLE EQPTVNLADV AHTLQVGRRV FEHRRVVVAR
DATSAAALLR SGDARRVLTL AQKPTSRGVA FVFPGVGDHY VGMAEGLYAT EGVFRATVDR
CCALLTPLLG SPIRKEIYPD GGVPAQTGVD LRAMLREDAT PGSAGRLHQT AWAQPAVFVV
EYALAQLLAS WGIRPQALLG YSVGEYVAAT VAGVLSLEDA LTLVAKRAQW IQAQPAGSML
AVSLSAEAIG TYVGGAVALA VVNSPMTCVL AGPQSALEAV KTRLDGDEVA SRWLETSHAF
HSPMLAPVQA ELTALAGTLR LQAPRIPYVS NITGTWITDA EATDPGYWAR HMVETVQFAD
GVGTLLADAQ LVVLEVGPGQ ALGSFIRQHP ACGRDRFGQI VATVRGMTDT SDDLEVLLSA
LGRLWLHDVV VDWAGFRGSE VRQRIPLPTY PFERQRFWIE PDLTPRPGSG PKLRRFDAGD
WYAVPSWKRA VAHNIVNNGS LTEPGCWLVL VDGEGLATRL TAWLEDRGQT VIAVTPGAAF
TQHSATAYTV RPASREDFTA LLQTLERQGQ TPSRVVHAWL ATAGDPAADE TAAGFDHTLQ
HGFYSLLALA QALGDQGVEW CEINAVTVAM QEVTGQEDLR IAAATVIGPC KIIPVEYPNL
TARSIDILLP ASPAERATLV AQLGAELATP PTGDQVAFRG AHRWVQMMEP VALPAVPASH
PRLRTGGVYL LTGGLGGIAL GLARDLAATL RAKLVLVNRS GLPDRATWPA LLERDGAEQG
MGRRIQQVLD LEALGAEVLV IQADVTDAVA MARAVAEAQA RFGTIHGVLH TAGVPGVGLM
QLKDAATAAA ELAPKVQGTL ALTRALAGVP LDFLVLFSSV TSATGGGPGQ VAYCAANAFL
DAYARKHATD HGQTVAVSWG EWLWDAWSEG LQGFAPEVQA EFRAYRKNFG ITFAEGAEAL
RRILACQIPH LFVTTEDLLN MFEGSKRSAA RTQAEPTEQQ ARTRYPRPEV GSFVEPQGEL
EQSIARVWAD VLAIEAIGAN DNFFDLGGNS LLGIGLINQL RRNLKLEKLP AHVLYEAPTV
ATLADYINQS THPTGSSAVH LPDLDEEADK REEQFDFFKD RAQMEELT