Gene Hoch_0799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0799 
Symbol 
ID8543181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1044100 
End bp1053237 
Gene Length9138 bp 
Protein Length3045 aa 
Translation table11 
GC content73% 
IMG OID646385573 
ProductKR domain protein 
Protein accessionYP_003265308 
Protein GI262194099 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.482882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCACG CCGAAAGACT TCGCCGGGCG GTTTTGCTCA TCAAGAAGAT GGAGGCGCAG 
CTCGCGGCCG AGCGCGCGAG CCGTCGCGCG CCCATCGCCA TCATCGGCAG CGCGTGTCGC
CTGCCGGGCG GCGTCGCCGA CCGCGACGCG CTGTGGAAGG TGCTCCGCGA CGGCACCGAG
ACCGCGGAAG CGGTTCCCGC CGCGCGCTGG CAGATGGCGG CCAACACGCA CGCGGGTGCC
CGTTGGGGCG CGTTCATCGA CGATGTCGAC TGCTTCGACC CCAGCCTGTT CGGCGTGTCG
CCGCGCGAGG CGGCGCAGGC CGACCCGCAA CAGCGGCTCG CACTTGAGCT GGTGTGGGCG
GCGCTCGAAG ATGCCGGCAT GGCGCCCGAC GGGCTGCACG GAACGCGCGC GGGCGTGTGG
CTGGGCATGT CCGGCGGCGA CTACAGCTCG ATGCTGTCGT GGTCGCGTCC CACGGGCCTC
GATCTGCACG GCATGACGGG CGCCATCGCC GCGTTCTTAC CCGGACGACT GTCCTACGCG
CTGGGCCTGC AGGGGCCATC GATGACCGTG GACACCGCGT GCTCCTCGTC CCTGGTCGCG
GTTCATCTGG CGTGCAATTC GCTGCGGGCG GGCGAGACCT CGCTCGCCCT GGTCGGCGGC
ATCAACCTGA TGCTCTGGGG CATGCACAAC GACGGGCTGG CCGGCTCGGG GGCGCTGTCG
CCAGACGGCA AATGCTACAC CTTCGACAGC CGCGCCAACG GCTACGTGCG CGGCGAGGGC
GGCGGTGTCT TCGTGCTCAA ACGCCTCGAG GACGCGCAGC GGGACGACGA TCGCATCCTG
GCGACCATCC TCGGCTCGGC GGTCAATCAC AACGGCCGCT CCACCGGCCT CACCACGCCC
AATGGGCTCG CACAGCAGGA CCTGCTGCGC GACGCGCTGG CGGACGCCGG GCTCAGCGCG
GGCGACCTCG ATGTGCTCGA AGTCCACGGC ACGGGGACGC CGCTCGGCGA CTCGATCGAG
GTGGACGCGA TCCGGGCCGC CCTGGGCACG GGCGGTGAGC GCCCGCTGTG GATGGGCTCG
GTGAAGGCCA ACCTGGGCCA CCTGGAGGCC GCCGCGGGCG TGCCCTCGCT GCTCAAGCTG
ATCCTGGCGC TGCGCCACGA GCAGCTCCCG CCGCAGGCGA CCTTCCGCTC GCTCAATCCC
CTGATCGACC TCGCCGACAG CCGCCTGCGC ATCGCCGAGC GCGCGGTGCC GTGGCCGAGC
GGCGAGCGCC CGCGGCGTGC CGGCATGAGC GGCTTCGGCA TGAGCGGCAC CAACGCGCAC
CTCATCATCG GAGAGGCGCC CGCGCGCGCC GAGCCGGCCG CGGGCCCCGA GCGGCCCGTG
CACGTGCTGC CCCTGGCGAC CACGGATCAG CCCGCGATGG CGCGTCGCAT CGCGCAGATG
AGCGCGTGGA TCCACAGTCA CCCGGAAGCC CGCGCGGGCG ACGTCGCGGC GGCGGCGGCT
CGTCGCATGG CCCGCCTGCC GGTGCGCGCG GCCGTGGTCG CGGCCGATCT CCCGGGTCTC
GAGGCCGGTC TCGAGCGGCT CGCGGCCGGC ACCTCGGGGC CCGGCGCGGT CCGCGGCGCG
GCCATCGAGA TGCGTCCGCG CATCGGCTGG CTGTTCACCG GTCAGGGCGC ACAGCGCGCC
GACATGGGCC GCGGACTGTA CGAGCATCAT CCCGGGTTCC GGGAGACGCT CGATCGCTGC
GCCGACGCCC TCGGGCGCGC GCACGACCTG CGCGAGGTGA TGTGGTCGAG CGATGGCCGC
CTCGACCGCA CCGGCTGGAC CCAGCCGGCG CTGTTCGCGC TCGAAGTCTC GCTGGCCGCG
CTGTGGCGCC AGTGGGGGAT CGAGCCCGAG GTGTTGGTCG GGCACTCGGT GGGCGAGATC
GCGGCCGCCT GCGTGGCCGG CGTGTTCTCC ATCGAGGACG GCATGCGGCT CGTCGAGGCC
CGCGCTCGGC TCATGGATGC GCTGCCCGAG GGCGGCGCGA TGGTGGCGGT CCGGGGCCAA
CCGGCGCGCA TCGAGCGGGC GGTCGCGTCC GCCGAGGGCG TGTCGGTGGC CGCGTTCAAC
GGCCCCGACC AGGTCGTGAT CTCGGGCGCC AGCGACGCCG TGCAGGCGCT GGCGTCTGAG
CTGGCAGAGG CCGGGCTGCG CGCCAAGGCG CTCACCGTGT CGCACGCCTT CCACTCCGAG
TTGATGGAGC CGATGCTGGA GGATTTCCGG GCTGCGCTGC GCGACATCCG TTTCCACCCC
CCCGAGCTGC CCCTGGTGAG CAACCTGCGC GGCGAGTTGG CCGGGCCCGA GGTGGCGAGC
GCCGACTACT GGGTCGAGCA CGTGCGGGCG CCGGTGCGCT TCCTCGAGGG CATGCGTGCC
GCCCACGCCG TGGGCGTCGA TCACTACCTG GAGATCGGTC CGCAGCCGGT CCTGTGCCGT
CTTGGGGCCA CCTGCGTGCC CGCGGGCGGC GGCGAGACCT GGCTGCCCTC GCTCCAGCGC
ACCCGGGACG ACTGGGAGGT CATCGCGAAC ACCGTGGCCC AGCTCCACAG CCTGGGCGCG
GACATCGATT TTGCTGCCTT CGACGAGCCC TTTGCGCGCC GCTGGGTCGA TCTGCCCTGC
TACCCCTTCG AGCGTCGGCG CTTTCCGCTG CCCGCGATCG CCGAGCCCGA TGTCTCCGAG
CTATCCGAGC TATCCGACCT GGGCGACGCC GCGTTCGTGC GTGGTACCGA GCGCATGGCG
TCCCCCCGCC AGCCCGTGCG CGCCGAGCGC TCGAGTGGCC TGGTGCACGT CGTGCACTGG
ACCGCGGTCG CCGATCTGCC GGAGAGCCAG CCCGCGGGCC AGTGGGTGGT GCTGCACGAG
GGCGCGCCGC TGGCCGGCGA GTTGGCGTCT GCGCTGGGCG CGGTTACCGT GGAACTCGGC
CACAGCATCC CGGCCTGCGA TGGCCTGATC ATCCTGCTCG GGCGGGACGG CGAGCCCGCC
GACGAGGCCG CTCGGCAGAC GCGCGCCGCC CTGGACATTC TGGCTGCGCT GCGCGACCGG
CGTCCGGCAC CGCGCACCGT GTGGATCACG CGCGGGGCGA GCGCACCCGG ACGCGAGCCG
AGCGGTCTCG CCGGGGCCGC CCTGGCCGGC ATGCTCCGCT GCCTGGCGCT CGAGCAGCCC
GAGCTGTCGC CGCGCTGGAT CGACCTGGCC GGCGCCGGAG ACGAGGGCGA ACTCGTTGCG
TGTCTGATCC GCGAGCTCGG GGCCAGCGAG CCCGAGGTGC GCCTGTGGGG GACGCGGCGG
CAGCTTCCCC GCCTGGACGC GCTGTCGCGT TTGCCCGGGG TGTTGGAGGC GCCGGCCGGT
CCTCACCTGC GCCTGGTGCG CGGGCAGGGC AACAGCCTCG ACGCCTTGTC GATCGCAGCG
TGGCAGCCCG GTCCGCTCGC GCCCGACGAG GTGCGCGTGC GCGTGCGCGC GGCCGCGCTC
AACTTCCGGG ATGTGTTGTC GGTCCTCGGG ATGTATCCGG ACGACATCGG CGAGCCCGGC
GGCGAGGTCG TCGGCGAGGT CGAAGCGCTC GGCAGCGAGG TCGTCGGACT GTCGGTCGGC
GACCGCGTGA TGGGCGGCGG CCAGGGCGGT CTGGCGGATC GCGTCGACAT CCCCGCCATG
CAGCTCGTCC AGGTCCCCGA CACCCTCAGC GACGTCGAGG CCGCGACCCT GCCCATCGCC
GGCGGCACCG CGCTGTTTGC GCTCGCCGAA CTCGCGCGGG TTCAGCCGGG CCAGCGCGTG
CTCATCCACG CCGGCGCGGG CGGTGTGGGC ATGGCGGCGA TCCACATCGC CCGCATGCTG
GGGGCGGAGG TGCTCGCGAC CGCCAGCCCC GGCAAGCACG CGCTGCTGCG CGAGCTGGGC
GTGGAGCTCA TCGCCAATTC CCGCTCGCCG GACTTCGGCG AGCGACTGCG CGCCGCATCC
GGTGGACGCG GCGTTCACGT GGTCCTGAAC TCGCTGACCG GGCGCTTCAT CGACGAGGGG
CTGTCGCTGC TCGAGCCGGG CGGCGTGTTC GTCGAGCTCG GCAAGCGCGA TGTCCGAGAT
CCCGCCGCGA TCGCGGCCGG CTGGCCCGGC GTGCGCTACG AGGTCCTGGA CCTGACCGCG
TTGCCCATCG AGGAGTTCCA CGCCAACTCG CAGCGGGTCG CCGACATCGC CGCGCGCGGC
GACCTGCCCG CGCTGCGGCG CTCGCTGTAC AGCCTGCAGC GCGCGCCCGA GGCGCTCCGT
CGGCTGGGCG CCGGCGAGAC CGTGGGCAAA GTGGTGGTCA GGCCGCCGAG CGCTAACCCA
GTGTTTCCGA CCGCTTGTCC GGTTTTGATC ACGGGCGGCC GAGGGGCGCT CGGCCGCGCG
GTCGCGCGCT GGCTGGTGCA GCACGGCGTG CGGCATCTGG TGCTCGCCGG GCGCAGCGAG
CCGGGCCAGG ACGACCTGGA GTTCGCCGCC GCGCTCGGCG AGCAGGTCGA GCAGGTCGTA
CTCGACGTGT GCGACGAGCA CGCGCTCGCC GAGCTGCTGG CGCGGCTGGG GCCGCTGGGC
GGCGTGGTGC ACGCGGCCGG CGTGCTCGAC GATGCGATGC TGCACAACAC CGACGCCGAG
CGGGCCGCGC GCGTGATGGC GCCCAAGGTC CAGGGTGCGT GGAATCTGCA TCGCCAGACC
CTGAATCGCC CGCCGGATCT ATTCGTGGTG TTCTCGTCCG TCGCGGGCGT CTTCGGCAAT
CCCGGGCAGA GCGCCTACGC CGCCGGAAAC GCCTTTCTCG ATGCGCTGAT GGCCTGGCGG
GCCGAGATGG GCCTGCCGGG CACGAGTGTG GCCTGGGGCG CCTGGGAGGG CGAGGGCATG
GCCGCCGGCA TCGATCGGAC CCGCATCGAG GCCGCGGGGC TGCGCATGCT GCAGCCGGAG
GCCGCGCTCC AGGCGCTGGC AGAGGCGCTC GCGTCGGGAC TTCCCCGCAC CGTGGCCACG
CCGCTGCCGA CCGGGCGCAT GGGCAAGGGG GCGCCGGCGC TGCTGGCCCA CCTCGGTGAG
GTCCGCGGGC CCGCCGCAGC CACCGGCGCT GCTGCGGCCG CCAGCTCGCA GGTCGCGAAC
CTGGGACCGG ACGCCCTGCG CGAGCAGCTC GCTCGCCGCG TCGCCAACCT GCTCGGCCAC
TCCGGCCTGG TCTCCATCCG GCGTCCGCTC CACGAACTCG GACTGGATTC GCTGATGGCT
GTCGAGCTGC GCAATGGCCT GGCGGCCTGG ACCGGGTTGG AGCTGCCCGC GACCCTGGCC
TTCGACTACC CGAGCGTCGA CGCCATCGCC TCGCTGATCG AGGGCTCGCT CGAGCCGGTC
CAGGCCGCGG AGGAGGCGCC CACCCGCACG GATGAGCCCA TCGCCATCGT CGGCCTCGGG
TGCCGCTTCC CCGGCGCCGA CAACCCGCAG GCGCTGTGGC AGATCGTGCT CAAGCAGCAG
GTCATGGTGC GGCCCGTGCC GGCCGAGCGC TGGCCGGCCG ACGCCTGGCC GGCGCCCGAG
GAGCCCGGCG GGTTGCCGCG CGCGGCAGCC TTTTTGGACC GCGTGGACCA ACTCGATGCA
GGCTTTTTCG ACCTCTCGGC CGGAGAAGCG CGCGACATGG ATCCACAGCA GCGTCTGCTG
CTCGAGGTCG GCTGGGAGGC GCTGGAGGAC GCGGGCATTC CGCCGCTGTC GCTTGAGGGC
AGCAACACCG GCACCTACGT CGGCATCTGC TGCTCGGATT TTCGCGAGCT GTGCGGCGAC
AGCGCGCTCG GTGTCCACGG CGGCACCGGC ACCCTGCACT CGGTCGCCTC CGGCCGTCTC
GCGTACGTGC TGGGGCTCAA CGGGCCGGCG ATCACCGTGG ACACCGCGTG CTCATCCTCG
CTGCTCGCCG TGCACCTCGC CTGCCAGGCG CTGCGCAGCG GCGAGGTCGA CCTGGCGCTC
GCCGGCGGTG CCAACGTCAT CCTGTCGCCG CGCTCGTCGG TCGAGATCAA CCGGCTCAGC
GCGCTCTCGC CCCAGGGCCG AAGCCTCAGC TTCTCGGCCG ACGCTAACGG CTACGGGCGG
GGCGAGGGCT GCGGCGTGGT GGTGCTCAAG CGCCTGTCGG ATGCGGTCCG GGCGGGCGAT
CGGATCGTGT CGTTGATCCT GGGCGACGCG TCTAACCACG ACGGCCGCAG CAACGGCCTC
ACCGCCCCGC ACTCGGGGGC GCAGAAGAAG GTGATCCAGC AGGCGCTGCG GCGCGCGGGC
GTCGCCGGCC GAGAGGTCGA GCTCATCGAG GCCCACGGCA CGGGAACGCC TCTGGGCGAC
CCCATCGAGG TCCAGGCGCT CGATGCCGTG TTGCGCGAGG GACGTGAGCG GCCCGTGGTC
CTCGGTTCGC TCAAGGCCGG TCTCGGCCAC GCCGAAGCCG CGGCCGGCAT CGCGGGCTTG
CTCAAAGTCG CGTTGGCCAT CCAGGCCGGC GTGCTGCCCG CCCAGCCCCC GGTGGGGGCG
CTCAACCCGC ACGTCGCCTG GGATCGCCTC AAGGTGGACG TCATCGAGCA GCCGCAGCCC
TGGCCGAGCG AGCGCCGCAT CGCGGGCGTG AGTTCGTTCG GGCTCAGCGG CACCAACGTC
CACGTCGTCC TCGGGGCGCC GCCGCCCGTG GAGACGTCGC CCACGCCGCT CCAGGGCGTG
CTGCACCTGC CGCTCTCGGC TCGCAGCGCG CCGGCGCTTC GCGAGCTGGC GCGGGCCTGG
GCCGAGCGCC TCGAGCAGAG CTCGCTCGCC GATCTTCTGC ACACCGCGCG CGTGGGCCGC
AGCCATCTCG AGCACCGCGT GTGCGTCTGC GGCCACGACG CCGAGCAGCT CGCCGCGGCG
CTGCGTCTGT TTGCCGAGAG TGGGACGCTC GAGCCGGACG AGCTGGCCCG GCGCTTCATG
GCCGGCGAGC AGGTCGAGTG GCCGCTCATC CCCGGCGCCC GCATCGTCTC GGCTCCAACC
ATGCGCTGGC AACGTCGCAG GTACTGGCCG CGCGGCCTCG ATTCCGCGAG CCTTGTGCCC
GGCGGCGCGT CGGCGCGGGG CGAGACTGCG CCCGACGGTG TCAACGTCAC CGACGTCGCC
GACATCGCCG ACGTCGGGGC GTGGCTGAGC GAGCACTTGG TCGAGCTTCT CAACCTAGAG
CCCGAGGCCG TCACTCGCGA CGCCGAGTTG GCGGCGTTGG GGCTGGACTC GATGCTCAGC
CTGGAACTCG GCGAGGCCAT CCGCGACGAG CTGGAGCTGA CCGTGTACCC GCGAGAACTC
GCCGAGATCC GCACCTTGGC CGAACTCGAG ACGCTTTTGG GACGCCTGGC CGACGAGCGC
GTATCGCTCC AGGCCCGTCC CGCAGCCGCT CCGCATGACG CGGAGCCCGA ACTCGGGGCG
CCGCTCGAGC CCGAACTCGA GTCGCCGCTT GGGCCCGAGG GCGAGCGCCT CCGAGGCGCG
CCCTTGCGAG AGGGGCCGGT GTTCGTACTG AGTGCGCCCC GGTCGGGCTC GACGCTTCTG
CGCGTCATGC TCGCCGGCCA TTCCAGGCTG TTCGCCCCGC CCGAGTTGCA TCTCCTGGTG
GCGGCAGACC TCGCCGCGTG GAGGGACAGC CCCAGGCACC TGGATGAGGG CCTGCTCGAG
GCGTTGGTTC AACTCGGGCA GGGCACGCCG GAGGACGTCC GTGCCCTCAT CGACCAGTGG
GTGGCCGAGG GCCTGTCGAT TGCGGATACC TACCGGCGCC TGATGGACCT GTGTGCGCCC
CTCGCGCTGG TCGACAAGAG CCCCTCGAGC GTGATGGATC GCGACGCGCT CATGCGTGTG
GCCAGGGAGT TCCCCGACGC GCGCTTCGTG TGGCTCGTCC GGCATCCGCT GGCGGTCGTC
GAGTCGATGA TCCGACGCCG TATCCACGCG GTCGTGGGCG CGGTCGAGGA TCCTCAGACC
TTCGCCGAGC AGACCTGGTG TCAGAGCGTG GACAACGCCC TCGCGCTGCG CGACGAAGTC
GGCGCCGAGC GCTTCGTCAC CCTGCGCTAC GAGGCGCTCG TGCGCGACCC GGCGGCCGCG
ATGGCGCACC TGTGCGACGC GCTCGGCCTC GCCTACGAGG ACGCCTTGCT GCGGCCGTAC
GAGGGCGAGC GGATGACCGA CGGGCTGCAC GATGGCTCGC TTTCCATCGG CGATCCTGGC
TTCAAGGAGC GGCGCGACAT CGAGCCGACG CTGGCCGATG CGTGGCGCGA GGTCCGGCTG
CCGCGGCCGC CGAGCGCGGC CTTGTGCGAG CGGGCGCAGA GGCTCGGCTA CCCCGTGGCG
AGCCAGCGCG AACTCGTCTC CGACCTGGTG CTCAGCACCT GGGGACCTGA GAGCGGCGAC
GCGGTGGTGT GCATCCACGG CCACCTCGAT CAGGGGCCGC TGTGGACCCC GGTGGCCGAC
AGACTCGCGG CCCAGGGCCT TCGGGTGCTC GCGCCCGACC TGCGCGGCCA CGGCCGCAGC
CCGCACGGCT CGCTCGGGCT CTTCGAGCAC CTCGCCGACC TCGACGCGCT GCTCGCGGCG
CAGGCGCCGG GCCGCATCGT GCTGGTCGGC CACTCGCTGG GCGCGCTCAT CGCCGCCTTC
TACGCCGCAG CACGCCCCGA GCGCGTCGCC AAGCTGGTGC TGCTCGACCC CGGACTGCCC
TCGCCCTTGA GCGAGGGACC CGGAGCGGCA CTGGCGCGTG CCCTGGACCG TCGCAGGGAC
GCCGCCCACG CGCCGATGGC CGGCCTCGAC GAAGCGGCCC GCCGCCTCCG CCGAGCCATC
CCGGACCTGA GCGAGGCCTG GTCGCGGGAG CTCGCCGAAC GGGTGAGCGA GCAGCGGGGC
GAGCATCGCG TGTGGCGCTG GGATCCGCGC CTGCGTGTGC TCTCAGGCGA AGGATTCGAT
CGCGACACCG CGCTCGAGAT ACTCGCCTCG CAGCACGCGC CCGTCACCGT GGCCTTTGCC
GCGCGTGGCG ATCGCGCGCG CCCCGAGGAT CGCCGGGCCA TCGAGGACGC GCTGGGCAGC
GCGACCTTCG TCGAGCTCGA TACCGCGAGC CATCACCTGC ACCTGGCGCG CACCGAGGAT
GTCGTCGGCC TGATCGTCGA GCGAGCCGCG GCCCAGTCCA CCATGTCGTC GCCCGATCGG
AGTACCAATG CGCCGTAG
 
Protein sequence
MPHAERLRRA VLLIKKMEAQ LAAERASRRA PIAIIGSACR LPGGVADRDA LWKVLRDGTE 
TAEAVPAARW QMAANTHAGA RWGAFIDDVD CFDPSLFGVS PREAAQADPQ QRLALELVWA
ALEDAGMAPD GLHGTRAGVW LGMSGGDYSS MLSWSRPTGL DLHGMTGAIA AFLPGRLSYA
LGLQGPSMTV DTACSSSLVA VHLACNSLRA GETSLALVGG INLMLWGMHN DGLAGSGALS
PDGKCYTFDS RANGYVRGEG GGVFVLKRLE DAQRDDDRIL ATILGSAVNH NGRSTGLTTP
NGLAQQDLLR DALADAGLSA GDLDVLEVHG TGTPLGDSIE VDAIRAALGT GGERPLWMGS
VKANLGHLEA AAGVPSLLKL ILALRHEQLP PQATFRSLNP LIDLADSRLR IAERAVPWPS
GERPRRAGMS GFGMSGTNAH LIIGEAPARA EPAAGPERPV HVLPLATTDQ PAMARRIAQM
SAWIHSHPEA RAGDVAAAAA RRMARLPVRA AVVAADLPGL EAGLERLAAG TSGPGAVRGA
AIEMRPRIGW LFTGQGAQRA DMGRGLYEHH PGFRETLDRC ADALGRAHDL REVMWSSDGR
LDRTGWTQPA LFALEVSLAA LWRQWGIEPE VLVGHSVGEI AAACVAGVFS IEDGMRLVEA
RARLMDALPE GGAMVAVRGQ PARIERAVAS AEGVSVAAFN GPDQVVISGA SDAVQALASE
LAEAGLRAKA LTVSHAFHSE LMEPMLEDFR AALRDIRFHP PELPLVSNLR GELAGPEVAS
ADYWVEHVRA PVRFLEGMRA AHAVGVDHYL EIGPQPVLCR LGATCVPAGG GETWLPSLQR
TRDDWEVIAN TVAQLHSLGA DIDFAAFDEP FARRWVDLPC YPFERRRFPL PAIAEPDVSE
LSELSDLGDA AFVRGTERMA SPRQPVRAER SSGLVHVVHW TAVADLPESQ PAGQWVVLHE
GAPLAGELAS ALGAVTVELG HSIPACDGLI ILLGRDGEPA DEAARQTRAA LDILAALRDR
RPAPRTVWIT RGASAPGREP SGLAGAALAG MLRCLALEQP ELSPRWIDLA GAGDEGELVA
CLIRELGASE PEVRLWGTRR QLPRLDALSR LPGVLEAPAG PHLRLVRGQG NSLDALSIAA
WQPGPLAPDE VRVRVRAAAL NFRDVLSVLG MYPDDIGEPG GEVVGEVEAL GSEVVGLSVG
DRVMGGGQGG LADRVDIPAM QLVQVPDTLS DVEAATLPIA GGTALFALAE LARVQPGQRV
LIHAGAGGVG MAAIHIARML GAEVLATASP GKHALLRELG VELIANSRSP DFGERLRAAS
GGRGVHVVLN SLTGRFIDEG LSLLEPGGVF VELGKRDVRD PAAIAAGWPG VRYEVLDLTA
LPIEEFHANS QRVADIAARG DLPALRRSLY SLQRAPEALR RLGAGETVGK VVVRPPSANP
VFPTACPVLI TGGRGALGRA VARWLVQHGV RHLVLAGRSE PGQDDLEFAA ALGEQVEQVV
LDVCDEHALA ELLARLGPLG GVVHAAGVLD DAMLHNTDAE RAARVMAPKV QGAWNLHRQT
LNRPPDLFVV FSSVAGVFGN PGQSAYAAGN AFLDALMAWR AEMGLPGTSV AWGAWEGEGM
AAGIDRTRIE AAGLRMLQPE AALQALAEAL ASGLPRTVAT PLPTGRMGKG APALLAHLGE
VRGPAAATGA AAAASSQVAN LGPDALREQL ARRVANLLGH SGLVSIRRPL HELGLDSLMA
VELRNGLAAW TGLELPATLA FDYPSVDAIA SLIEGSLEPV QAAEEAPTRT DEPIAIVGLG
CRFPGADNPQ ALWQIVLKQQ VMVRPVPAER WPADAWPAPE EPGGLPRAAA FLDRVDQLDA
GFFDLSAGEA RDMDPQQRLL LEVGWEALED AGIPPLSLEG SNTGTYVGIC CSDFRELCGD
SALGVHGGTG TLHSVASGRL AYVLGLNGPA ITVDTACSSS LLAVHLACQA LRSGEVDLAL
AGGANVILSP RSSVEINRLS ALSPQGRSLS FSADANGYGR GEGCGVVVLK RLSDAVRAGD
RIVSLILGDA SNHDGRSNGL TAPHSGAQKK VIQQALRRAG VAGREVELIE AHGTGTPLGD
PIEVQALDAV LREGRERPVV LGSLKAGLGH AEAAAGIAGL LKVALAIQAG VLPAQPPVGA
LNPHVAWDRL KVDVIEQPQP WPSERRIAGV SSFGLSGTNV HVVLGAPPPV ETSPTPLQGV
LHLPLSARSA PALRELARAW AERLEQSSLA DLLHTARVGR SHLEHRVCVC GHDAEQLAAA
LRLFAESGTL EPDELARRFM AGEQVEWPLI PGARIVSAPT MRWQRRRYWP RGLDSASLVP
GGASARGETA PDGVNVTDVA DIADVGAWLS EHLVELLNLE PEAVTRDAEL AALGLDSMLS
LELGEAIRDE LELTVYPREL AEIRTLAELE TLLGRLADER VSLQARPAAA PHDAEPELGA
PLEPELESPL GPEGERLRGA PLREGPVFVL SAPRSGSTLL RVMLAGHSRL FAPPELHLLV
AADLAAWRDS PRHLDEGLLE ALVQLGQGTP EDVRALIDQW VAEGLSIADT YRRLMDLCAP
LALVDKSPSS VMDRDALMRV AREFPDARFV WLVRHPLAVV ESMIRRRIHA VVGAVEDPQT
FAEQTWCQSV DNALALRDEV GAERFVTLRY EALVRDPAAA MAHLCDALGL AYEDALLRPY
EGERMTDGLH DGSLSIGDPG FKERRDIEPT LADAWREVRL PRPPSAALCE RAQRLGYPVA
SQRELVSDLV LSTWGPESGD AVVCIHGHLD QGPLWTPVAD RLAAQGLRVL APDLRGHGRS
PHGSLGLFEH LADLDALLAA QAPGRIVLVG HSLGALIAAF YAAARPERVA KLVLLDPGLP
SPLSEGPGAA LARALDRRRD AAHAPMAGLD EAARRLRRAI PDLSEAWSRE LAERVSEQRG
EHRVWRWDPR LRVLSGEGFD RDTALEILAS QHAPVTVAFA ARGDRARPED RRAIEDALGS
ATFVELDTAS HHLHLARTED VVGLIVERAA AQSTMSSPDR STNAP