Gene Rleg_4173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4173 
Symbol 
ID8014963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4261218 
End bp4269737 
Gene Length8520 bp 
Protein Length2839 aa 
Translation table11 
GC content63% 
IMG OID644826743 
Productglycosyltransferase 36 
Protein accessionYP_002977953 
Protein GI241206857 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATTT CTCATCTTTC TCCGAGCCTT CCGCGCGAAC CCGATATGAA GCAGATCGAT 
CACAATGGTT CGATCCGTTC GACCTATCTG TCGATCGAAG ACATCAAGGC GATGGGCGAT
GCGGTCGCCC GCAACGGCGT CGACCAGCTG CCGGCTTTCG CCCCCTTCGA TTTCTTCGCG
CGCCATAAGG AAAACGAGAA GGAAATCCTG CGCGTCTACC GGACAACGGC GACCGACGTC
GAGGCCGGCG AGACGATCAC GCCGGCGGCA GAATGGCTGC TGGACAATCA CTATATCGTC
GAAGAGGCGA TCCAGGAGGT GCGGCGCGAC TTCCCCCGCC GCTTCTACCG CGAATTGCCG
ACCATGGTCG TCGGCGGCGT CGAGGTACCG CGCACCCTCG TGCTTGCCTG GCTCTATGTC
GCTCATACCC ACAGCACGAT CTCGCAGGAA AGTCTGACGG CGCTGGTCGA CGGCTTCCAG
GCTAGCGAGA CGCTCAGGAT CGGCGAACTC TGGGCGTTGC CCTCTCTGGT GCGCTACGTG
CTCGTGGAAA ACCTTCGCCG CATTTCGAGC CGTGTCGAAG CCAGCCGCCG GCTGCGCCGC
CGCGCCAACG AGGCGGCCGA CGAATTGGTG CGCCTGACCG ACCCGGCCGG GGCGGCCGCC
TATCTGAAGA CGCTGGAACC GCTTGCCGAG GACAATACCT TCTCGACGCA GTTCCTCTAT
CGCCTGCGCG ACGGCTCGCA GACGTCGAGC CTGGCGATCA CCTGGCTCGA CGAGCGGCTG
GAAGAGCTCG GCCGCAACAC CGAAGAGGCG ACGACGGCCG AACACAGCCG GCTCTCGTCC
GGCAATGTGA CGATGGGCAA CATCATCCGC AGCCTTCGCG AGATCGATGA TGCCGAATGG
TCGGTCTGGG TCGAGCAGGT CAGCCATGTC GACAAGCTGC TCTGGGAGCA TTCGGATTAC
GGCATCCTCG ATTCCGGCTC GCGCAACAAA TATCGCAAGC AGATCGAGAA GCTGGCCAAG
CGCTCGCCGC TGTCGGAAAT GGAAATCGCC CAGCTGGCGC TCGACATGAC CGATGCCGCC
AAAGCCTCCG ACGAGCCGCA GCCGCATGAA CCGAATGTCG GCGGCTTCCT GTCAGGCGCG
CAGCGGCCGA AGCTCGAGGC GCGGGCGAAC TATCGTCCGA CGGTCACCCA GCATTTCGTG
CGCGCCGTTC GCCGGTTCAA CTGGCTGGCG ATTGCCGTGC CCGTCATGCT GCTGACGGTC
ATCGCCATGG CAATCGTCGG CAAGTTCATG GCGAATGCCG GCATGGGGGC GGTCGAAATC
GCCCTGCTGC TGATCATGTT CTCAGTGCCG GCGTCGGAGG GGGCGACGGG TCTCTTCAAC
ACCTTGCTTT CCTTCTTCGT GACACCGGCG CGCCTCGTCG GCTACGAGTT CAAGGACGGC
ATTCCAGAAG ATGCGCGCAC GCTGCTCGTC GTGCCCTGCC TGATCTCGAA CCGCGACAGC
GTCGACGACC ACGTGCGCAA TCTCGAGGTC CATTATCTCG CCAATCCGCG CGGCGAGATT
TATTTCGCGA TGCTCAGCGA CTGGCCGGAC AGCGAGGTCG AAGAAACGCC TGCCGATCTC
GAGGTTCTCG ATTATGCCAG GCGCGAGATC GCCAATCTCT CCGCCCGTTA TGCCTATGAC
GGCAAGACGC GTTTCTATCT GCTGCATCGC CGCCGCCTCT ATAATCCCTC GGAAGGCGTC
TGGATGGGCT GGGAGCGCAA GCGCGGCAAG CTGCACGAGC TGAACATGCT TCTGCGCGGC
GACCGCGACA CGACCTACCT GCCGGGCGCC AATACCGTGC CAGCCAATGT ACAATATGTC
ATGACGCTCG ATGCCGATAC GCGCCTGATG CGCGATGCCG TCACCAAGCT CGTCGGCAAG
CTCTATCATC CGATCAACCG TCCGGTGATT AATCCGAAAA CCGGCCGTGT CGAAAGCGGC
TACGGCGTGC TGCAGCCGCG CGTCACCCCG TCGCTGACGA CGGGCAAGGA CGCGTCGGTC
TTCCAGCGCG TCTTTTCGAT CAACCGCGGC CTCGATCCTT ACGTTTTCAC CGTATCCGAC
GTCTATCAGG ATCTCGCCGG CGAAGGCACC TTCACCGGCA AGGGCCTTTA CCATGTCGAT
GCCTTCGAAG CGTCGCTGAA GGGCCGGATT GACGAGAACT CCGTGCTCAG CCACGATCTT
CTCGAAGGCT CGATGGCGCG TTGCGCGCTC GTTACCGATC TCGAACTGGT GGAGGATTTC
CCGATCCGCT ACGAGGTGGA AACCTCGCGC CAGCATCGCT GGGCGCGCGG CGACTGGCAG
CTTCTCCCCT ACATGTTCAA TCCAAAATAT GGCGTCACGG CACTCGGCCG CTGGAAGATG
TTCGACAATC TGCGCCGCTC GCTGACGCCG ATCGCCTGGT TCTTCGCCTC CGTGCTCGGC
TGGTATTTCA TGGGTCCGCT CGGCGCGCTC ATCTGGCAGA TTCTGCTGAT CTTCTGCCTC
TTCGTCGCGC CGACGCTATC GCTCATCAAC GGCATCATCC CGCGCACCAG CGATATCATC
GCCCGCGCCC ATCTCTATAC GGTCTGGGCC GATATCACGG CTGCCAATGC GCAGGTTGCG
CTGCGCATCG TCTTCATCGC CGATTCGGCT TGCATGATGG CGGATGCGAT CGGCCGGTCG
CTCTACCGCC TGTTCGTCAG CCGCAAGCTG ATGCTGCAGT GGCGGACCGC CGCCAGCGTT
CAGGCCGGCG GCCAGGGCAC GCTGATTTCC TATTACAAGG CGATGTGGCA TGCACCGGCG
CTGGCGCTGC TGGCGCTCGG TTTTGCCGCT CTTCCCGGCG ACAACGCCTT CCTCGTCGGC
ATTCCCTTCG CGCTGCTGTG GATCCTGTCG CCTGTTGTCG CCTGGTATGT CAGCCAGTCG
GCGGAGACCG AGGACCGGCT CGAGGTCGCC GATTCCGTGT CGAGCGAACT GCGCAAGATC
GCCCGGCGCA CTTGGCGTTA TTTCGAAACT TTCACGACCG CCGAGCAGAA CTATCTGCCG
CCTGACAATA TCCAGGAAAC GCCGCATGTG ATCGTCGCGG CGCGCACCTC GCCGACCAAT
ATCGGCGTCT ATCTGCTTTC CGTCGTTTCC GGCCGGCATT TCGGCTGGTT CTCCTTCGAG
GAGACGCTCG AGCGGCTGGA GCAGACGATC GCGACGATCG ACAAAATGGA GAAATTCCGC
GGTCACCTGT TCAACTGGTA CCACACCGAT ACGCTGCAGA CGCTTGGGCC CCGTTACGTC
TCTGCCGTCG ACAGCGGCAA TCTCGCCGGC CATCTGATCG CCATTTCGTC GGCCTGCCGC
AACTGGGCCG AGGCGCCGTC CGCTCATATG CAGGGCAATC TCGACGGCGT CGGCGACACG
GCCGGCATTC TCCGCGAAGT GCTGGCGGAC CTGCCCGATG ACCGCAAGAC CGTGCGCCCG
CTGCGCCGGC GCCTGGAGGA GCGCATCGTC GGCTTCCAGA ACGCGCTTGC CGCCGTCAAG
CGCGAGCACG AATTTGCCTC GATCCGCATC ATCAACCTCG CGGTTCTCGC GCGCGATATC
GAGAAGCTCG CCGCCAATCT CGACCATGAG GTCAAGTCGA AGCAGAGCGA AGAGGTCACC
CAATGGGCGG CATCGCTGGT GAAGGTCTGC GAGGCGCATA TTTCCGACAG CACCTTCGAC
CTTTCCAAAG TCGATGCACT GAGGCCGCGT CTGGTGGCGC TGCGCGACAA GGCGCGCGAC
CTCGCCTTCT CGATGGATTT CGGCTTCCTG TTCCGGCCGG AGCGGCGTCT GCTGTCGATC
GGCTATCGCG TCGAAAGCGG TGAACTCGAT CAGGCCTGCT ATGACCTTCT CGCCTCGGAA
TGCCGCCTGA CCAGCCTCTT CGGCATCGCC AAGGGCGACC TGCCGACGGA ACACTGGTAC
CGCCTCGGCC GCCAGGTCGT GCCCGTCGGC TCGCGCGGTG CGCTGGTGTC CTGGTCCGGC
TCGATGTTCG AATATCTGAT GCCGCCGCTC GTCATGCAGG AGCGCGGCGG GGGTATTCTC
AACCAGACCA ACAATCTGGT CGTCGTCGAG CAGATGAATT ACGCCCGCAA GCTCGGTATT
CCCTGGGGCA TTTCGGAAGC TGCCTTCAAT GCCCGCGACC ATAACCTCAA TTATCAGTAC
ACGAATTTCG GCGTGCCGAC GCTCGGCCTG AAGCGCGGTC TCGGCCACAA TGCCGTTATC
GCGCCCTATG CCTCGCTGCT TGCCAGCCAA TACGATCCGC CGGCTGCGCT GGAAAACCTG
CAGAGGCTGC GCAAGCTCGG CGCACTCGGC AAGTTCGGCT TCCACGACGC CGTCGACTTC
ACGCCGACGC GCGTGCCGGA AGGCAAGAAA TGCGCGGTGG TCTACAATTA TTATGCCCAC
CATCACGGCA TGTCGATCGC GGCGGTTGCC AATGTCGCCT TCAACGGTCA TCTGCGCGAA
CTCTTCCACG CCGATCCTGT CATCGAGGCG GCGGAACTGC TGCTGCAGGA AAAGGCGCCG
CGCGACATTC CCGTCATGAG CGGCAAGCAC GAATCCGATA CGCCCGCCAG CATCCAGGAC
GATCTCCTGC GGCCGGAACT CAGGAAGATC AGCGATCCGG CTTCGCGCGA CCGCGAACTC
GTCTTCCTGT CGAACGGCCA TTATTCGGTG ATGCTGACGG CGACGGGCGC CGGTTATTCC
CGCTGGAACA ATCTTTCCGT TTCCCGTTGG AAGCCTGATC CGACTGAAGA CCGCTGGGGC
AGTTTCATCT TCCTGCGTGA TACCGCCACC AACGAATGGT GGTCGGCGAC ATCAGAGCCG
AAGGGCGTCG AAGGCGAAAA GACCATGGTC GAATTCGCCG ACGAGAAGGC GCAGTTCACC
AAGATTGTCG GCGACCTCAC AAGCGAGGTG GAGTGCATCG TCGCCACCGA GCATGATGCC
GAGGCCCGTC GTGTTACGCT GCTCAACATG GGCACGGAAG ACCGTTTCAT CGAGGTCACC
TCCTATCTCG AGCCGGTGAT CACCTCCGAC GATACCGACA ATGCGCATCC GGCATTTGCC
CGCATGTTCG TCAAGACAGA GATCGGCAAG CGCGGCGACG TCATCCGTGC CGAACGCAAC
AAGCGCGATC CGAACGAGCC GAACATCAGC ATCGCACATC TGATCGTCGA CAATGCCGGC
GACACCCGCC ATACCGAATT CGAGACCGAC CGGCGCAAGT TCATCGGCCG TGGCCGCAGT
CTTGCCGATG CGGCCGCCTT CGACCCGGGC GCTACGCTTT CCGGCAGCGA CGGCTTCATG
CTCGATGCCG TGATGTCGCT GCGTCGCACC GTCCGCGTAC CGGCCGGCAA GAAAGTCAGC
GTGATCTTCT GGACGATTGC GGCGCCAAGC CGCGAGGAAG TCGACAAGGC GGTCAATCGC
TACCGTCATC CCGACGCCTT CACCCACGAG CTCGTCCAGG CCTGGACACG CACGCAGGTG
CAGATGCGCC ATGTCGGCGT CACCTCCCAG CAGGCGGCGG CCTTCCAGCA TCTTGGACGC
TATCTCGTCT ATCCTGATAT GCAACTGCGC AAGGACGAGG CGACCGTCGA GGCCGGCCTG
CAGTCGCAAT CGGCGCTCTG GCCGCTGGCA ATCTCCGGCG ACTTCCCGAT CTTCACGCTG
CGCATTAACG ACGACATGGA TCTCGACATC GCCCGCGAGG CGCTGCTGGC GCAGGAATAT
CTGCGCTCGC GCGGCGTCAC CGCCGATCTC GTCATCATGA ACGAACGCGC TTCGTCCTAT
GCGCAGGACA TGCAGCATGC GCTTGACGCC ATGTGCGAAA ACGTACGTCG CATGGGCCAG
GCCGACGGGT TGCGCCAGCA TATCTTTGCG GTTCGCAAGG ACCTGATGGA GGAAGCCACT
TATCATGCGC TGATCGCGGC TTCCCGCGTC ACGCTGCACA CCAAGAACGG CAAGGTGGTC
GACCAGATCA ACCGCGCCGT CGCGCTGTTT GCACCCTCGA AGGAGGAACT GCAGGAGATG
GAACGGGCCG AGCGCAACAA GGCCCCCGTC AAGCGTATTG CGCCGGTGCC GCCACCTGTC
GTGCCCGCCG TCGTCATCGA GGAGGAAGGC GATCTCGACT CCTGGAACGG CATCGGCGGC
TTTGCCCGCG ACGGACGCGA ATATGTCGTT CGGCTGCCTG GAGGTCATGC GACGCCGCAG
CCCTGGATCA ATGTCATTTC CAACGACAGC TTCGGCTTCC ACGTGTCGGC CGAGGGCTCG
GGCTTCACCT GGAGCGTCAA TTCGCGCGAC TATCAGCTGA CCTCCTGGTC GAACGATGCA
GTGGTCAACC GTTCGGGCGA GGCGTTCTAT CTCACCGATC TGGACAGCGG TGCTGTCATG
ACGCCGTTCG CAGCGCTTTC CCGCCGTCCG GACATCCGTT TCGAAGCCCG CCACGGGCTC
GGCTATTCCG TCTTCTCCAG CGTTCAGCAC GATATCGCGC TCGAGCTGAC GCAGACGATC
GACCGCGAAA AGCCGGTGAA ACTGCAACGC CTGCGCCTGC GCAACACCGG TTCGACCAGT
CGCAAGCTGC GTCTCTACGG CTATGTCGAA TGGATCCTCG GCAGCAATCC GGGACGCACG
GTGCCTTTCA TCCTGTCGAG CCATGATGAG GAGACGGGGG CGCTATTTGC CACCAATCCC
TACAGCATCG ATTTTTCCAA CCGTACTGCC TTCTTCGCGG CGAGCGAGAC GCTTTCGAGT
TTCTCGGCAA GCCGGCGCGA ATTCATCGGC AAGGCGGGGA CGATCCAGGC ACCGCAGGCC
GTCATTTCCG CCGCCGCCCT TTCCGGCGCG ACCGAACTCG ACGGTGATCC GGCCGCGGCG
CTTGCTATCG ACATCGAGCT CGGCGCCGGC GAGGAGCGCG ACTTCACCTT CTTCCTCGGC
GATACCCCGA CGGAAGAAGA AGCGCGGACT GTTATCGCCG ATATCCGCAA GGCTTCGTTC
GACGAGACCG TCGAGGCAAA CCGCGCCTTC TGGCGGGATT TCACCGGCAG GCTGCAGATA
TCGACTCCGG ATCGCGGGAT GAACAATCTC GTCAACACCT GGCTGCCGTA TCAGAGCCTC
GGCTGCCGCA TCATGGCCCG CACCGCCTTC TACCAGGCAA GCGGCGCCTT CGGCTTCCGC
GACCAGCTGC AGGACACGCT GGCCTTCGTG CTGCACGAAC CCTCGCTTGC CCGCCGGCAG
ATCCTGAATG CCGCTTCGCG GCAATTCCGT GAAGGCGACG TACAGCATTG GTGGTTGCCG
GGAACGGGTG CGGGCGTGCG CACGCTGATC TCCGACGACG TCGTCTGGCT CGCTTACGCG
ATCCATCACT ATTGCAGCGT CACCGGCGAC AAAAACGTCC TCGATGAGGA GATCGCCTTC
CTGGAAGGGC CAGCACTGCT CGAAGGCCAG CACGACTCGT TCTACAAGCC GGAAATTTCC
GAGGACAAGG CGAGCGTCTA CGAACACGCG GCGCTGGCGC TTGATCTTGC CATCGCTCGC
AAGGGGGCAA ACGGCCTGCC GCTGTTCCTC GGCGGCGACT GGAACGACGG GATGAACCGC
GTCGGCATCG GCGGGCGCGG CACCAGCGTC TGGCTCGGCT GGTTTCTGGC GGGCGCATTG
CGCTCCTTCA TTCCCTATGC CGAAGAGCGT GGCGACACGG CCCGCATGGA GCGCTGGGCG
GCGCATCTTA CTGAGCTCAA GAAGGCGCTC GAAACTGCGG CCTGGGACGG CGGCTATTAT
CGCCGCGGCA CGTTCGACGA CGGTGCACTG CTTGGCTCCA GGGAAAGCCC GGAATGCCGG
ATCGATTCGA TCGCGCAGTC CTGGAGCGTG CTGTCGGGCG AGGGCGATCC GGACCGCGCC
GTCACGGCGA TGGAGGCTGT GCTCGACCAA CTGGTCGACG AGGATGCCCG CATCATCCGG
CTGTTCACGC CGCCTTTCGT CAATTCTGCC AGGGATCCTG GCTATATCAA GGCCTATCCG
CCGGGTGTGC GTGAAAATGG CGGGCAATAT ACCCATGCGG CGACCTGGGT GGTGATGGCG
CTGGCGGAGC TGAAGCGGGG CGACGATGCC TTCCGCTGCT TCCAGATCCT CAATCCGATC
ACCCATGCGC TCGACAAGGC TTCAGCGGAG CAATACCGCG TCGAACCCTA TGTCGTGGCG
GCCGACGTCT ACGGAAACGA GCCTTATACG TCACGCGGCG GCTGGACCTG GTATACCGGG
TCAGCCGGCT GGCTCTACCG CGCCGCCGTC GAGGGCATCC TCGGTATCCG GCTGAAGGAT
GGTCGGCTCT ATGTGCGCCC GTCGCTCCCG TCGGAATGGG ACGGTTTTGC GGCAGAGGTG
GAGCAAGGCG GCGGGAAATA CCGCATTTCG GTCTCAAAAG CGTCCAATGA CAGCGGCTAC
ACTCTGGCAA TCAACGGCTG CGAAGTCACC GATCCCGAAG AGGGATATCC GCTCGGATAA
 
Protein sequence
MSISHLSPSL PREPDMKQID HNGSIRSTYL SIEDIKAMGD AVARNGVDQL PAFAPFDFFA 
RHKENEKEIL RVYRTTATDV EAGETITPAA EWLLDNHYIV EEAIQEVRRD FPRRFYRELP
TMVVGGVEVP RTLVLAWLYV AHTHSTISQE SLTALVDGFQ ASETLRIGEL WALPSLVRYV
LVENLRRISS RVEASRRLRR RANEAADELV RLTDPAGAAA YLKTLEPLAE DNTFSTQFLY
RLRDGSQTSS LAITWLDERL EELGRNTEEA TTAEHSRLSS GNVTMGNIIR SLREIDDAEW
SVWVEQVSHV DKLLWEHSDY GILDSGSRNK YRKQIEKLAK RSPLSEMEIA QLALDMTDAA
KASDEPQPHE PNVGGFLSGA QRPKLEARAN YRPTVTQHFV RAVRRFNWLA IAVPVMLLTV
IAMAIVGKFM ANAGMGAVEI ALLLIMFSVP ASEGATGLFN TLLSFFVTPA RLVGYEFKDG
IPEDARTLLV VPCLISNRDS VDDHVRNLEV HYLANPRGEI YFAMLSDWPD SEVEETPADL
EVLDYARREI ANLSARYAYD GKTRFYLLHR RRLYNPSEGV WMGWERKRGK LHELNMLLRG
DRDTTYLPGA NTVPANVQYV MTLDADTRLM RDAVTKLVGK LYHPINRPVI NPKTGRVESG
YGVLQPRVTP SLTTGKDASV FQRVFSINRG LDPYVFTVSD VYQDLAGEGT FTGKGLYHVD
AFEASLKGRI DENSVLSHDL LEGSMARCAL VTDLELVEDF PIRYEVETSR QHRWARGDWQ
LLPYMFNPKY GVTALGRWKM FDNLRRSLTP IAWFFASVLG WYFMGPLGAL IWQILLIFCL
FVAPTLSLIN GIIPRTSDII ARAHLYTVWA DITAANAQVA LRIVFIADSA CMMADAIGRS
LYRLFVSRKL MLQWRTAASV QAGGQGTLIS YYKAMWHAPA LALLALGFAA LPGDNAFLVG
IPFALLWILS PVVAWYVSQS AETEDRLEVA DSVSSELRKI ARRTWRYFET FTTAEQNYLP
PDNIQETPHV IVAARTSPTN IGVYLLSVVS GRHFGWFSFE ETLERLEQTI ATIDKMEKFR
GHLFNWYHTD TLQTLGPRYV SAVDSGNLAG HLIAISSACR NWAEAPSAHM QGNLDGVGDT
AGILREVLAD LPDDRKTVRP LRRRLEERIV GFQNALAAVK REHEFASIRI INLAVLARDI
EKLAANLDHE VKSKQSEEVT QWAASLVKVC EAHISDSTFD LSKVDALRPR LVALRDKARD
LAFSMDFGFL FRPERRLLSI GYRVESGELD QACYDLLASE CRLTSLFGIA KGDLPTEHWY
RLGRQVVPVG SRGALVSWSG SMFEYLMPPL VMQERGGGIL NQTNNLVVVE QMNYARKLGI
PWGISEAAFN ARDHNLNYQY TNFGVPTLGL KRGLGHNAVI APYASLLASQ YDPPAALENL
QRLRKLGALG KFGFHDAVDF TPTRVPEGKK CAVVYNYYAH HHGMSIAAVA NVAFNGHLRE
LFHADPVIEA AELLLQEKAP RDIPVMSGKH ESDTPASIQD DLLRPELRKI SDPASRDREL
VFLSNGHYSV MLTATGAGYS RWNNLSVSRW KPDPTEDRWG SFIFLRDTAT NEWWSATSEP
KGVEGEKTMV EFADEKAQFT KIVGDLTSEV ECIVATEHDA EARRVTLLNM GTEDRFIEVT
SYLEPVITSD DTDNAHPAFA RMFVKTEIGK RGDVIRAERN KRDPNEPNIS IAHLIVDNAG
DTRHTEFETD RRKFIGRGRS LADAAAFDPG ATLSGSDGFM LDAVMSLRRT VRVPAGKKVS
VIFWTIAAPS REEVDKAVNR YRHPDAFTHE LVQAWTRTQV QMRHVGVTSQ QAAAFQHLGR
YLVYPDMQLR KDEATVEAGL QSQSALWPLA ISGDFPIFTL RINDDMDLDI AREALLAQEY
LRSRGVTADL VIMNERASSY AQDMQHALDA MCENVRRMGQ ADGLRQHIFA VRKDLMEEAT
YHALIAASRV TLHTKNGKVV DQINRAVALF APSKEELQEM ERAERNKAPV KRIAPVPPPV
VPAVVIEEEG DLDSWNGIGG FARDGREYVV RLPGGHATPQ PWINVISNDS FGFHVSAEGS
GFTWSVNSRD YQLTSWSNDA VVNRSGEAFY LTDLDSGAVM TPFAALSRRP DIRFEARHGL
GYSVFSSVQH DIALELTQTI DREKPVKLQR LRLRNTGSTS RKLRLYGYVE WILGSNPGRT
VPFILSSHDE ETGALFATNP YSIDFSNRTA FFAASETLSS FSASRREFIG KAGTIQAPQA
VISAAALSGA TELDGDPAAA LAIDIELGAG EERDFTFFLG DTPTEEEART VIADIRKASF
DETVEANRAF WRDFTGRLQI STPDRGMNNL VNTWLPYQSL GCRIMARTAF YQASGAFGFR
DQLQDTLAFV LHEPSLARRQ ILNAASRQFR EGDVQHWWLP GTGAGVRTLI SDDVVWLAYA
IHHYCSVTGD KNVLDEEIAF LEGPALLEGQ HDSFYKPEIS EDKASVYEHA ALALDLAIAR
KGANGLPLFL GGDWNDGMNR VGIGGRGTSV WLGWFLAGAL RSFIPYAEER GDTARMERWA
AHLTELKKAL ETAAWDGGYY RRGTFDDGAL LGSRESPECR IDSIAQSWSV LSGEGDPDRA
VTAMEAVLDQ LVDEDARIIR LFTPPFVNSA RDPGYIKAYP PGVRENGGQY THAATWVVMA
LAELKRGDDA FRCFQILNPI THALDKASAE QYRVEPYVVA ADVYGNEPYT SRGGWTWYTG
SAGWLYRAAV EGILGIRLKD GRLYVRPSLP SEWDGFAAEV EQGGGKYRIS VSKASNDSGY
TLAINGCEVT DPEEGYPLG