Gene Hoch_4143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4143 
Symbol 
ID8546546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5699994 
End bp5708507 
Gene Length8514 bp 
Protein Length2837 aa 
Translation table11 
GC content74% 
IMG OID646388821 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003268534 
Protein GI262197325 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0473869 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATTT CCGAAGACAC CGACCAGATC GAGATTCCGG TGGACGCCGA TATCGCCGCG 
CGCCTCGAGG CGTTTCGCCG CGAGCCCAGC GATGCCGACG CCTACGCCAG CCTGAGCGGG
ATTCTGCGCC AGGCCGGTCG CCTGCGCGAG CTGGCCGAGG TGCACGAGCT GCACGCGGCC
CATCTGCAAG CGGCCCAAGC CGCCACGGCG TGGACCGAGG CGGCCCGCGC CCGCCTGTCG
GCGACCCAGC GCGAGCGCGC CGAGGAGGAT CTCGGCCGCG CCCTCGACGC CGATCCGGCC
CACGAGGACG CCGCCGGGTT GCTGGCCGAG CTGTACGGCG ACGCCGGCCG CCAGGCCGAG
GCCGCCGAGC TGTTCGAGAA CGAGCTGGCC GCGCTCAAAG CCCAGGCCGA GGCGCTGCCC
GAGCGCAAGC GCGCGCCGAT CAACACCCGC CGCGGCGAGC GCCATCGCCA CCTGGCCGCG
CTCTGGGAGC GCGAGCTGGG CCGCGTCGAC CGCGCCCTCG AGCACTGGCA GCGGGCCTGG
CATCTCGAGC CCGAGCGCAC CGACGCCATC GAGGCGGCGC GCAACATCTA CGCCTCGCTG
GGCGACGATT CCATGGTCGC GCGCCTCTAC GAGGCTCAGC TCGAGGTCAT GCGCAAGGGC
GGCGATGACA AGGCGCGGGG CCAGCTCGAG CTGGCGCTGG GTCGCATCCG CGCCCGCGAG
GGCCGCATGG ACGACGCGGC CACGCACCTC GAGGAGGCGC TGCGGCTGTT GCCGGGCCTG
GACGAGGCGC TCGAGGCCCT GGCCGAGGTC TACACCTCGG CCGCCAGCGC CGAGCGTGCC
GAGCACCGCG AGCGCGCCTG CACGCTGCTC CTCGAGCTCG GCAAGCGCCG GCTGGCGAGC
GCCTCCGAGC CCGCCGACGA AGAGGCCGGC ATCGCCTATC TGCGCCGCGC CCTGGGCGTG
CAGCCGGGCT CGCGCCAGGC CACCGACGCG CTGGCCAGCG CCCTGCGCGA GGGCGAGCGC
TGGGAGGATC TCGACCACCT GTACGAGCAC TTTCTCGACC AGCAGGACGA TAAAGACTCG
CCGCAGGCCC GGGCCCAGCG CATCGACATC CTGGGCAAGC GCGCCGAGCT CTACGATCGC
TACCTGGTCG GCCGCGACAC CCTGCGCGGG CTGCTGGTCG AGCTGAGCGG GCTGACCCCG
CCGCACGACG AGATGTCGCA GAAGCTGCGC GCCTTCTACC GTCAGGAGAA GGACTGGAGC
GCGCTGGCGC AGCACATCGA GCGCGAGCTG CCGGCGCTGG CCCAGGAGCC GATGCGGGCC
GCGGCCGAGA TGCTCGAGCT GGCGACCATT GTGCGCGAGC ACCTGGGCGA CCGCGACCGC
GCCGCCGCCA TCCTGCACCA CATCCTGCGC GAGATCGATC CCAACCACCA GGAGGCGCTG
GCCCGCTACG GCGACCACTT CCGCGAGCGT CGCGACTGGC GCGGGCTGGC CGATCTGCTC
GAGTTCTCGG TCGACAGCGC GCGCAAGGCC GGCGCCCCGC CGCCGACCCT GGCCCGCCAG
CTCGAGGAGA TCGCCGGCAT CGCCGAGCAG CGTCTGGGCG ACATCGAGCG CGCCATCCAC
ACCTGGCGGC GCATCCACGA GCTGGAGCCG CAGAGCCCGC GGCCGGGCGA GGAGGTCCGC
CGCCTCGAGT CGCGGGCCAA GATGTGGGCC TCGCTGGTGG GCGTGCTCGA GCACGAGGCG
CAGAGCGCGC AGACGCCGCA GAAGCGCGCC GAGGCCCTGC GCCGCATCGC CCAGGTGTAT
CGCGAGCGCA ACGTCAACCC GCGCCGCGCC ATCGCCCTCT ACGAAGAGGT CGCCGGCATC
TTCCCCGACG ACCACACCGC GCTCAAGGCG CTGGCCGAGC TGTACGAGCG CGAGGGCGAC
CAGGCCGGGC TGGCCCACAC CCTGCGCCGC CGCCTCGACT ATGACGTGCG CGCGATGGCC
GCCCAGCACC CGGGCGAGAC CCCGAGCGTG CGCGACTGGC CCACGGCCAA GCGGGTCGAG
CGGCTCACCT CGCTGCGTCG CCTGGTCACC ATGTACGAGG GCCTGAGCGA CATCGAGGGC
GTCATCTACG CCTGCACCGG CGTGCTCGAC GCCATCCCGG GCGACCGCGA CGCGCTCGAG
CGGCTCGAGC GAGCGCTCGA CAAGTCGGGC GACGTCGAGC GCCTCGAGCA GACCCTCGTG
TACCACGTCA GCGCCGCCAG CGGCCCGGCC GAGAAGGCCC GGGTGCTGCG CCGCCTGGCG
CGCATCGCGG CCGACAAGCA GGACGATCTG GCGGCCATGC AGCGCTGGGA AGAGGTGCTG
GGCACGGTGC CCAACGACTT CGAGGCCATC GAGACCCTGG CCGATCTGTA CGAGCGCCAC
GGTCGCTGGG CCGACATGGC CCGGGTGCTC GAGCGCGGCC TGCTCAGCCA GCGCTCGCGC
GCCGGTACCA ACTCGGGCAT CCGGCGCATG CTCACCCAGG ACGGGGGCGG CCGCTACACC
ACCGGCGAGA TCCGCATCGG CACCGGGCTC ATCCTCGACC CCAAGAAGCG GCTCGCGCAG
CTTCTGCGCT ACGCCCGCGT GGTCGACGAG AAGCTCGGCG ACGCCGCCCG CTCGACGCGC
GCCTGGAAAG AGATCCTCGA GCTGTCGCCG GGCCATCACC AGGCCCTCGA GGCGCTGGCG
CGGCTGCACG AGCAGGCCGG GCGCTGGCGC GACCTGGTCG ACGTCCTGGC CGCCCGCATC
CCGCTGGTGC GCAAGGACGA ACCCGAGCTG GCCGCGCAGC TCGCGCTGCA GCGCGCGCGC
CTGCTCGAGG AGCGCATCGG CGCCCCGGGC GAGGCCATCA AGGCCCTCGA GGAGATGATC
CGCGAGATCG CGCCCGGCCA CCTCGACGCC CACCGCGCGC TGCGCCGCCT GTACGAGGCC
CGCGGCGACT TCGAGGCCGC CGTGCGCACC GCCGAGCGCG AGCTGTACCT GAGCCGCGAT
CCCGACGACA AGCTGGCGCG CGGCCTGGAA ATCGGACGCC TGTGCCGCGA TCAGCTCCAC
GATCCCACGC GCGCCATCCA GGCCTTCGAG CGCGTGCTGT ACCTCAAGGG CGACCACGAG
GTGGCGCTGG TGGCCGCGGT CGATCTCTAC GCCCGGGTCG AGGACTGGCC CAGCCACGTG
CGCACACTCG AGGCGCTGGT GGCGCAGGCC TCCGAGGGCC AGACCCGGGC CGACCTGATG
ACGCGCATCG CCCAGGTCAC GGCCGAGCGG CTCGACGACC GCGCCGGCGC CTTCTCCTGG
TATCGCCGCG CCCATGAGCA GGCGCCGCGG CCGCAGACCC TGGCCGCCCT GCGCCGCGCG
GCCGAGGCCT ACGAGCTGTG GAGCGAGCTG GCCGAGGTCT ACGAGGGCGA GCGCGGGCGC
TACGTCAACG AGCGCGACGA GCCCACCAAT CCGGTCGCCT ACGTGGCCGC CTGCCGCGAG
CTGGCGGCGC TGGCCGAGCG CCGTCTCGAC GCCCCGGTGC GGGCCATGAA CGTGCTGCTC
GACGCGATCT TGGTGGCCCC GCTCGACGAG GGCTTGCTGT CCGAGGCCGA GCGCATCGCG
GCCCAGGCCG ACCAGCGGCC GCTGTGGCAG ATCCTGCTCG AGTGCCAGAG CGCGGCGCTC
GAGCGCTCGT CGCGGGCGCG GCGTGTGATC CTCCACCGCA GCCGGGCGCG CGTGCTCGAG
GAGCGCCTCG ACGACCCCGA GACCGCGCTT GAGGAGCTGC TCAAGGCCTT TGCCTGGGCC
CCGGACCAGC TCGGCATCCG CCAGTCGCTG TACGAGCTGG CCGAGCGCAG CGGCTCGTTC
ACCGACGTCA TCGCGGTCGA GTCCGCGCTG CTCGAGCGCG CCCCGAGCAC CCACGCGCGG
CTGTCGATCC TGCGCCGCAA GGCCGCCGTC ATCGAGGACA AGCTGCACGA GACGGTGCGC
GCCTTCCGCA CCCATCTCAG CGCGTTTTTG CTGCAGCCCG AGGACAGCGA CACCGTCGCC
CACCTGTGGC GGCTGGGGCG CATCATCGGC GTGAGCTACG CCGACGCCGA TCGCACGCCG
CGGGCCGAGC CGCCGCCGGC CTACGTGCAC CCGCCCGAGC CGGCGCAGAG CCCGCGCCCG
CGCCCGGCCG CGTCCGGCAG CTCGGCCGAG GTGCCGATCG ACGACTTTGC CGACGCGGCG
GACGAATTCC GCTCCAACCC CACCCAGGAG CTGTCGGTCA ACGACCTCAG CGAGCTGATG
ATGAGCCCGG CCGACAGCGA TGAGTTCGCC GAGTCCGGGC GCGGCAGCAC CATCCAGCTC
GACCTCGACG AGATCGTCAT CCAGGAAGAG GAGGAGGCGC GCCGCCGCGA TCCCACCATC
GAGCTGCGCA CCGAGGACCT CATCAGCGCG CTGCGCGAGC CCAGCGGCAA GCGCCCGGCG
CCGCCGCCGC TGCCGGGCGT GAGCGCGCGC GCCCGCAAGC CGCCGCCGCC GCCCTCGGCC
GCGCCGCCGG CCAATCCGGC GCCGCGCCGG GGCAAGCTGC CGCCGCTGCC GAGCATGCCG
GTGCGCGCCT ACGAGGGGCC GTGGGACGAG CTGGCCGCGG CCTACGACCT GCTGCCGGCC
GCGGACAAGA AGGCCAAGAT GCGCTGGCTG TTCCGCGCCG CCGAGGTGTG GGAGAACGGC
GCCGGCGACA TCAGCCGCGC GTTCAACACC CTGGCGCGCG CGCTCGAGCT GGGCGTGGAC
GACACCGAGC CGCGCGCCCG CCTGCAGCGC GTGGCCAGCG AGCACGACTC CTGGGATCGG
CTGGCCGACC TCTACGAGTC CGCGGCCGAG GACGCCAAGA CCGCCGACAC CGCGGTCGGC
CTGCTCATGG AGGTCGCCGA GATCCGCGCC CGCCAGAAGC GCAGCCGCGA GACCGAGGCG
CTGTACCGCC GGGTGCTGGG CATGCGCCCG CGCGACCGCA CCGCGCGCGA GCGCCTCGAG
GGTCTGTACC GCAGCGAGGG CCGCTGGGTC GACCTCGCGG CCTCGCTCGA GGAGCGCACC
GATCCGCGCG TCGGCATCGC CGTGCCCGAG AGCGAGCGCC CGGAGCTCTT GCGCGAGCTG
GCCGACATCT ACCGGCGCAT GAGCCGGCCG CACGACGCCA TCGACGCGCT CCTGCGCATG
CGCGACCTGC TCCCCGAGGA CGTCGACATC CTGCGCGAGC TGGGCGAGCT CTACGCCCAG
GTCGGCCGCT GGAGCAAGGT CATCGAGAGT CTCGGACGCG TGGTCGAAAT CGCCGAGGGC
ACCGACGAGG CGCGCAGCGC GCTGCGCCGC ATCGCCGAGA TCTACGAGCG CGAGCTGGAG
CTGCCCGATC GCGCCATCGA CGCCTACCGC CAGCTCGTGG CGCAGTGGTC GGATGACACC
AGCGCCTACG CCGCGCTCGA CCGCCAGCTC GGCGCGCTCG GGCGCTGGGC CGAGCTGGCC
GACATCCTGC GCCGCCGCGC CGCGCTCACC CGCGCGCCCG AGAAGCGGGC CGCGATCCTG
CGCCGCCGCG CCTCGCTGCT GGTCGACCGC CTGGGCGCGC CCGAGGAGGG CGCGGCCGCC
CTGCGCCACG CGCGCAGCCT CACGCCCGAC GCGCCCGGCC TCGAGAGCGA GCTGGTGCAG
GCCCTGATCG CGGCCGGCCG CACCCGCGAG GCCGCGTCGC TGCTCGACAC CCGCGTCGAC
GCGCTCACCC GCACCGCCAA GGGCGTGCCC GCGCCCGGCA GCGGCGTCGG CGACCTGGCC
GCGCTGCTCA TCCGTCTGGC CGACGCTCAG GTGAGCGCGG GCGACAAAGA CGCCGCCCAG
GCCTCGCTCG CGCGCGCGCT CACCCTGGTC CCCGACCATC CCACCGCGCT CGCGGCCCAG
GCCCGCCTGG TCGAGGACGA GCGCGATCCC CGCGCCTTCG CCGAGGCCCG GCTGCGCGAG
GCCGAAGACC TCGAGGATAT CGACGCCAAG GTCGCCGCGC TGATGGACGC CGGCCTGGCC
TTGCGCGATC GCCTGGACGA CATCGAGGGC GCGCGCGCTG CCTTCGAGGC CGTGCTGCAG
GTGCTGCCCT ATCAGTCGGA GGCGACCTGG GCGCTGGCCG GCTTGGTGGA GCAGGGCGGC
GACCCGGTGC AGGCCGCGCA GGTGCTCGAG AGCCGGCTCG AGGACAGCTC GCTCGAGCCC
GCCGAGGCCG CGCGCATCCA CACCCAGCTC GCCGCGCTGG CCCGCCAGGC CGGCGTCGAG
GCCGCGGCCG AGAGTCACCT CGACGGCGCG CTGCGCGCGG TGCCCGGCCA TCTGCCGGCG
ATCATCGCGC GCGCCGACCT GCTCGGCGAG GCCGAGCGCT TCGAGGACCT CGAGGCCTTC
TTGCGCGAGG CCCTGCCGCG CCTCGAAGAC GCGCCCGCGG CCACCCTGGC CGAGCTCAAC
CGGCGCCTGG CCGTGGCCTG CGAGCACCTG GGCCGCGACG ACGAGGCCTA TCAGATCCTG
CTCGCGGCCG ACAAGCTGCA CCGCGGCCAC CTGATGGTCA AGCTGGCCCT GGGCGAGAAC
CGCTACCGGG CCAAGCGCTG GCGCGAGGCC GCCCTGCACC TGTCGGCGCT GGCCGTGCAC
GTCGACGCCC CGCGCTACCC GGCCGAGGTC GCCGAGGGCC TGTACCACGC CGCGCAGGCC
GAGATCCGCT CGCTCAGGCC CGAGAAGGCG CGGCCGCTGT ACGAGCGCGC GCTCGACCTC
AAGCCCAAGT TCACGCCGGC CCTGCACGCC TTGGCCGAGC TGGCCATGGA GAGCGGCGCC
TACGAGCGCG CGGCCGAGCT GTTGCTGCGC CAGGCCGAGG CCACCGAGGA GCCGAGCGAG
CGCATGCGCC TGTTCGAGGC GCTCGGCGAC ATGGCGCTCG AGACGCTCTC GGACGAGGCG
CGGGCGCTGT CGTGCTACCA GGCCGCGGTC GACGCCGCCT CGCCGCTCGA GGCCAAGCAC
GTCGCGCTGC TCGAGAAGCT GTTGCGCCGC CAGGAGGCCA AGGGCGATCA CCGCGGCGCC
GCGCGCACCG CCGAGCTGAT GGCCTCGTTC GGCAGCGACG GCCCGTCGCG GGCCTCGCGC
TACACCTCGG CGGCCGAGAA CTACCTGGCC GTGGGCGAGC CCGACAAGGC GCTGGCGGCG
GCGCGCAACG CGGTCGACGC CGACCCCTAC GACCTCACCG CCGTGACCGT GCTCAGCGAG
CTGCTGGCCA AGCGCAACGA GCACGAGGAG ATCACCGAGG TGCTGGGCCG GGCGCTGAGC
AAGAGCGACG ACGCCGACGC CTACATCGGC CCGCGCAAGG CCCTGCTCTG GGATCGCCTG
GCCCACGCCC GCCGCGCCCG CGGCGATATC AAAGGCGCGA CCTCGGCCTG GGAGCACGCC
CTGGCGCTGG CCGGACACTC GGACGGCGCC ATGAACGCGC GCCGTGCGCT GCTCGAGATC
TGGAAGAACG AAGCCGACAA GCGCGACACC CTGCTCGAGT TCCGGCGCGT GTTGGCGATG
GACAGCATGT CGGTCAAGGA CGTGGTCACC TACGCGCGCG ACCTGTGCCG CGGCCGGCAC
GACGACGGCG GCCGCGCGGT GCTCGAGCTG GCCCAGACCA TGGGGCACCA GTTCAGCGAG
CTCGACCGCA ACTTCCTCGA GCGCCGCCCG GTCGTGGAGA TGGCGCCCGA CGACGCCTAC
CGCGGTGTGC TCAGCGAGAG CCTGCGCGCC GAGGTGGTGC TCGACCGCAG CGACGACGAC
GACGACGGCA GCCTGCTCGG TGTGGTGCTC TCGACCATCT GGGAGGCGGC GCCGGTGCTG
TGGCCCGAGA TCGCCGAGTC GCTGCTGCGC AACGGCGTGG CCGACGCCAC CCGGGTGACG
CCGCCTTCGG AGGTCGCTGC GGTCAACATG TTCCCGCGCA TCACGTCCGC CCTGGGCGCG
CCCGCGACCA TGCTCTACGC CAGCCGCGCC GCGGACGCGC CCGATATCCA GATCGTGTGC
GCGGCCACGC CGATCGTGGT CTTCGGGCCC AAGCTGCAGC AAGTCGAGGA CCCTGCCCAG
CACAACGCCC TGCGCTTCTT GCTCGGACGC GCGGCCGAAA TGCTGCGGCC CGAGAACATC
ATCGCCGTGG GCATGCCGCA CGAGGACTAC CTCAACCTGC TGGGCGCGCT CTTGCGGCTG
TACGGGCCGC CGCACCTGCA CGACGCGCTG CCCTCGACCA TCACCGACGC CGATGCCCGG
CACGAGTACG ATGAGTCGCT GCGCACCGCG CTGCCGGTCA AGCTGCGCGA GCGGCTCGAG
ATCCTGCTCG AGGACGCCAG CAGCCGCGAT ATCGACCCCG ACCGCTACTG GAGCGCGCTC
GACCGCGCCG CCGACCGCGC CGGCCTGCTG GTGTGCGGCG ATATCGCCAC CGCGCTCAGC
TATGCGGGCG CGACCGATAT GCAGAATCGC CGCGTCACCC GGCACCTGAC CATGACCGCG
CTCAGCCCCG GCTACCTCGA AGCCCGCGCC GCGCTGGGCG TGGGCGTGCG CTGA
 
Protein sequence
MSISEDTDQI EIPVDADIAA RLEAFRREPS DADAYASLSG ILRQAGRLRE LAEVHELHAA 
HLQAAQAATA WTEAARARLS ATQRERAEED LGRALDADPA HEDAAGLLAE LYGDAGRQAE
AAELFENELA ALKAQAEALP ERKRAPINTR RGERHRHLAA LWERELGRVD RALEHWQRAW
HLEPERTDAI EAARNIYASL GDDSMVARLY EAQLEVMRKG GDDKARGQLE LALGRIRARE
GRMDDAATHL EEALRLLPGL DEALEALAEV YTSAASAERA EHRERACTLL LELGKRRLAS
ASEPADEEAG IAYLRRALGV QPGSRQATDA LASALREGER WEDLDHLYEH FLDQQDDKDS
PQARAQRIDI LGKRAELYDR YLVGRDTLRG LLVELSGLTP PHDEMSQKLR AFYRQEKDWS
ALAQHIEREL PALAQEPMRA AAEMLELATI VREHLGDRDR AAAILHHILR EIDPNHQEAL
ARYGDHFRER RDWRGLADLL EFSVDSARKA GAPPPTLARQ LEEIAGIAEQ RLGDIERAIH
TWRRIHELEP QSPRPGEEVR RLESRAKMWA SLVGVLEHEA QSAQTPQKRA EALRRIAQVY
RERNVNPRRA IALYEEVAGI FPDDHTALKA LAELYEREGD QAGLAHTLRR RLDYDVRAMA
AQHPGETPSV RDWPTAKRVE RLTSLRRLVT MYEGLSDIEG VIYACTGVLD AIPGDRDALE
RLERALDKSG DVERLEQTLV YHVSAASGPA EKARVLRRLA RIAADKQDDL AAMQRWEEVL
GTVPNDFEAI ETLADLYERH GRWADMARVL ERGLLSQRSR AGTNSGIRRM LTQDGGGRYT
TGEIRIGTGL ILDPKKRLAQ LLRYARVVDE KLGDAARSTR AWKEILELSP GHHQALEALA
RLHEQAGRWR DLVDVLAARI PLVRKDEPEL AAQLALQRAR LLEERIGAPG EAIKALEEMI
REIAPGHLDA HRALRRLYEA RGDFEAAVRT AERELYLSRD PDDKLARGLE IGRLCRDQLH
DPTRAIQAFE RVLYLKGDHE VALVAAVDLY ARVEDWPSHV RTLEALVAQA SEGQTRADLM
TRIAQVTAER LDDRAGAFSW YRRAHEQAPR PQTLAALRRA AEAYELWSEL AEVYEGERGR
YVNERDEPTN PVAYVAACRE LAALAERRLD APVRAMNVLL DAILVAPLDE GLLSEAERIA
AQADQRPLWQ ILLECQSAAL ERSSRARRVI LHRSRARVLE ERLDDPETAL EELLKAFAWA
PDQLGIRQSL YELAERSGSF TDVIAVESAL LERAPSTHAR LSILRRKAAV IEDKLHETVR
AFRTHLSAFL LQPEDSDTVA HLWRLGRIIG VSYADADRTP RAEPPPAYVH PPEPAQSPRP
RPAASGSSAE VPIDDFADAA DEFRSNPTQE LSVNDLSELM MSPADSDEFA ESGRGSTIQL
DLDEIVIQEE EEARRRDPTI ELRTEDLISA LREPSGKRPA PPPLPGVSAR ARKPPPPPSA
APPANPAPRR GKLPPLPSMP VRAYEGPWDE LAAAYDLLPA ADKKAKMRWL FRAAEVWENG
AGDISRAFNT LARALELGVD DTEPRARLQR VASEHDSWDR LADLYESAAE DAKTADTAVG
LLMEVAEIRA RQKRSRETEA LYRRVLGMRP RDRTARERLE GLYRSEGRWV DLAASLEERT
DPRVGIAVPE SERPELLREL ADIYRRMSRP HDAIDALLRM RDLLPEDVDI LRELGELYAQ
VGRWSKVIES LGRVVEIAEG TDEARSALRR IAEIYERELE LPDRAIDAYR QLVAQWSDDT
SAYAALDRQL GALGRWAELA DILRRRAALT RAPEKRAAIL RRRASLLVDR LGAPEEGAAA
LRHARSLTPD APGLESELVQ ALIAAGRTRE AASLLDTRVD ALTRTAKGVP APGSGVGDLA
ALLIRLADAQ VSAGDKDAAQ ASLARALTLV PDHPTALAAQ ARLVEDERDP RAFAEARLRE
AEDLEDIDAK VAALMDAGLA LRDRLDDIEG ARAAFEAVLQ VLPYQSEATW ALAGLVEQGG
DPVQAAQVLE SRLEDSSLEP AEAARIHTQL AALARQAGVE AAAESHLDGA LRAVPGHLPA
IIARADLLGE AERFEDLEAF LREALPRLED APAATLAELN RRLAVACEHL GRDDEAYQIL
LAADKLHRGH LMVKLALGEN RYRAKRWREA ALHLSALAVH VDAPRYPAEV AEGLYHAAQA
EIRSLRPEKA RPLYERALDL KPKFTPALHA LAELAMESGA YERAAELLLR QAEATEEPSE
RMRLFEALGD MALETLSDEA RALSCYQAAV DAASPLEAKH VALLEKLLRR QEAKGDHRGA
ARTAELMASF GSDGPSRASR YTSAAENYLA VGEPDKALAA ARNAVDADPY DLTAVTVLSE
LLAKRNEHEE ITEVLGRALS KSDDADAYIG PRKALLWDRL AHARRARGDI KGATSAWEHA
LALAGHSDGA MNARRALLEI WKNEADKRDT LLEFRRVLAM DSMSVKDVVT YARDLCRGRH
DDGGRAVLEL AQTMGHQFSE LDRNFLERRP VVEMAPDDAY RGVLSESLRA EVVLDRSDDD
DDGSLLGVVL STIWEAAPVL WPEIAESLLR NGVADATRVT PPSEVAAVNM FPRITSALGA
PATMLYASRA ADAPDIQIVC AATPIVVFGP KLQQVEDPAQ HNALRFLLGR AAEMLRPENI
IAVGMPHEDY LNLLGALLRL YGPPHLHDAL PSTITDADAR HEYDESLRTA LPVKLRERLE
ILLEDASSRD IDPDRYWSAL DRAADRAGLL VCGDIATALS YAGATDMQNR RVTRHLTMTA
LSPGYLEARA ALGVGVR